Skip to content

ISSUE #10 Integrate COMPL-AI Benchmark Evaluation Suite#11

Merged
fdidonato merged 1 commit into
mainfrom
10-integrate-compl-ai-benchmark-evaluation-suite
May 6, 2026
Merged

ISSUE #10 Integrate COMPL-AI Benchmark Evaluation Suite#11
fdidonato merged 1 commit into
mainfrom
10-integrate-compl-ai-benchmark-evaluation-suite

Conversation

@fdidonato
Copy link
Copy Markdown
Owner

-Added extensive tests for refusal focus, contextualization, grounding, domain prefilter, and intent falsification. -Resolved an issue in domain detection end refusal specificity. -Added openai compatible server to expose openai-like apis and test the framework with compl-ai

-Added extensive tests for refusal focus, contextualization, grounding, domain prefilter, and intent falsification.
-Resolved an issue in domain detection end refusal specificity.
-Added openai compatible server to expose openai-like apis and test the framework with compl-ai
@fdidonato fdidonato linked an issue May 6, 2026 that may be closed by this pull request
5 tasks
@fdidonato fdidonato merged commit 1f81d5b into main May 6, 2026
2 of 6 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

Integrate COMPL-AI Benchmark Evaluation Suite

1 participant