ISSUE #10 Integrate COMPL-AI Benchmark Evaluation Suite by fdidonato · Pull Request #11 · fdidonato/moralstack

fdidonato · 2026-05-06T14:18:36Z

-Added extensive tests for refusal focus, contextualization, grounding, domain prefilter, and intent falsification. -Resolved an issue in domain detection end refusal specificity. -Added openai compatible server to expose openai-like apis and test the framework with compl-ai

fdidonato linked an issue May 6, 2026 that may be closed by this pull request

Integrate COMPL-AI Benchmark Evaluation Suite #10

Closed

5 tasks

fdidonato merged commit 1f81d5b into main May 6, 2026
2 of 6 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

ISSUE #10 Integrate COMPL-AI Benchmark Evaluation Suite#11

ISSUE #10 Integrate COMPL-AI Benchmark Evaluation Suite#11
fdidonato merged 1 commit into
mainfrom
10-integrate-compl-ai-benchmark-evaluation-suite

fdidonato commented May 6, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

fdidonato commented May 6, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant