LLM quality gates for every PR — run @eval_case suites automatically and block merge if quality drops below threshold
python testing machine-learning ai ci quality-assurance eval github-actions llm llmops llm-framework llm-evaluation synapsekit evalci
-
Updated
May 30, 2026 - Python