A community catalog of autonomous agents and bundles certified by passing TraceCore deterministic episode runs in public CI
open-source benchmarking evaluation multi-agent deterministic ai-agents developer-tools-test agent-benchmark tracecore
-
Updated
Mar 7, 2026 - Python