[nightly-eval] Nightly regression: cli_args (2026-06-11)

Nightly eval REGRESSION: benchmark 'cli_args' has a passing baseline but failed BOTH trials on 2026-06-11.
Error category: [logic_error]
Model: opencode-qwen3-5-35b-a3b-mxfp8 (local, tiers:smoke,core)
Results: /tmp/nightly_eval_20260611
It passed before, so this is a genuine regression — not a known gap. Investigate.

---
Binary info (auto-attached):
ailang version: v0.24.2-42-ge54841f4
git commit: e54841f40a0102c51cfb6dc0d205bfcd393123ef

---
_Reported by: nightly-eval via ailang messages_

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[nightly-eval] Nightly regression: cli_args (2026-06-11) #296

Metadata

Assignees

Labels

Type

Fields

Projects

Milestone

Relationships

Development

[nightly-eval] Nightly regression: cli_args (2026-06-11) #296

Description

Metadata

Metadata

Assignees

Labels

Type

Fields

Projects

Milestone

Relationships

Development

Issue actions