Multi-judge LLM evaluation that returns typed disagreement structure instead of collapsing it to a scalar.
-
Updated
May 24, 2026 - Python
Multi-judge LLM evaluation that returns typed disagreement structure instead of collapsing it to a scalar.
Add a description, image, and links to the disagreement topic page so that developers can more easily learn about it.
To associate your repository with the disagreement topic, visit your repo's landing page and select "manage topics."