Treat failed Codex reviews as infra-only#286
Conversation
|
Codex review: needs real behavior proof before merge. Reviewed June 13, 2026, 4:44 PM ET / 20:44 UTC. Summary Reproducibility: Source-reproducible. Current main unconditionally renders PR readiness and rating label justifications for pull-request reports, while failed review reports still carry PR rating fields; I did not execute tests because this review is read-only. Review metrics: 3 noteworthy metrics.
Merge readiness Overall follows the weaker of proof and patch quality, so missing proof can cap an otherwise strong patch. Rank-up moves:
Proof guidance:
Risk before merge
Maintainer options:
Next step before merge
Security Review detailsBest possible solution: Land the narrow failed-review infra-only handling after real rendered-comment or label-sync proof is added and the normal merge gates are acceptable, while keeping complete-review proof blocking unchanged. Do we have a high-confidence way to reproduce the issue? Source-reproducible. Current main unconditionally renders PR readiness and rating label justifications for pull-request reports, while failed review reports still carry PR rating fields; I did not execute tests because this review is read-only. Is this the best way to solve the issue? Yes. The PR uses narrow AGENTS.md: found and applied where relevant. Codex review notes: model internal, reasoning high; reviewed against 0c3e8876ae04. Label changesLabel changes:
Label justifications:
Evidence reviewedWhat I checked:
Likely related people:
What the crustacean ranks mean
Shiny media proof means a screenshot, video, or linked artifact directly shows the changed behavior. Runtime, network, CSP, and security claims still need visible diagnostics. How this review workflow works
|
Summary
review_status: failed.Verification
pnpm run build:all && node --test --test-name-pattern 'failed Codex review comments suppress PR readiness ratings|mock-only real behavior proof blocks repair markers|proof-blocked PR comments show proof cap while preserving patch quality|public PR review details justify derived rating label changes' test/clawsweeper.test.tspnpm run test:repair && pnpm run test:coverage:changed && pnpm run format:checkpnpm run checkwas also attempted. It stopped intest:uniton the existing local filesystem-mode assertionread-only checkout mode restores file modes and leaves git metadata writable, wherestat(...).mode & 0777returned0777instead of the expected0555. The changed failed-review tests passed before that failure, and the independent downstream repair/coverage/format gates above passed.Notes
This is the backend side of the #91210/#92499 re-review failure cleanup: exhausted Codex transport failures should remain infra-only and should not leave patch-quality or readiness labels behind.