Skip to content

reimpl: prevent orchestrator re-evaluation#378

Merged
Nicola Franco (franconicola) merged 2 commits into
mainfrom
reimpl/prevent-orchestrator-re-evaluation
May 22, 2026
Merged

reimpl: prevent orchestrator re-evaluation#378
Nicola Franco (franconicola) merged 2 commits into
mainfrom
reimpl/prevent-orchestrator-re-evaluation

Conversation

@RPaolino

Copy link
Copy Markdown
Contributor

Add _already_evaluated guard to run_full_evaluation that detects when input data already carries evaluation scores (eval_jb, eval_hb, best_score, etc.) and skips redundant re-evaluation steps 1-6 while preserving metrics summary generation (steps 7-8).

Fixes #342

Add _already_evaluated guard to run_full_evaluation that detects when
input data already carries evaluation scores (eval_jb, eval_hb, best_score,
etc.) and skips redundant re-evaluation steps 1-6 while preserving metrics
summary generation (steps 7-8).

This fixes the AdvPrefix bug where aggregated rows lack the completion
field, causing judges to evaluate empty strings and overwrite correct
statuses with FAILED_JAILBREAK.

Fixes #342
@franconicola Nicola Franco (franconicola) temporarily deployed to reimpl/prevent-orchestrator-re-evaluation - Docs PR #378 May 22, 2026 10:45 — with Render Destroyed
@codecov

codecov Bot commented May 22, 2026

Copy link
Copy Markdown

Codecov Report

❌ Patch coverage is 96.42857% with 1 line in your changes missing coverage. Please review.

Files with missing lines Patch % Lines
hackagent/attacks/evaluator/evaluation_step.py 96.42% 1 Missing ⚠️

📢 Thoughts on this report? Let us know!

Copy link
Copy Markdown
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM

@franconicola Nicola Franco (franconicola) merged commit 1b47797 into main May 22, 2026
21 checks passed
@franconicola Nicola Franco (franconicola) deleted the reimpl/prevent-orchestrator-re-evaluation branch May 22, 2026 18:55
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

Fix: prevent orchestrator re-evaluation from zeroing jailbreak counts

2 participants