Skip to content

Checkpointing workflow.json incrementally for nat eval #1631

@alessiodevoto

Description

@alessiodevoto

Is this a new feature, an improvement, or a change to existing functionality?

Improvement

How would you describe the priority of this feature request

Critical (currently preventing usage)

Please provide a clear description of problem this feature solves

When running nat eval on a dataset, workflow_output.json is only written/visible after the entire dataset finishes. If evaluation hangs/fails or we have to interrupt it, we lose access to results from tasks that already completed, and the run isn’t recoverable without re-running. It would be very helpful to persist per-task intermediate outputs to disk as soon as each dataset item completes (before eval) so partial results are always recoverable.

Describe your ideal solution

Add incremental output writing for nat eval, e.g. write one JSONL record per completed item (dataset_item_..json) and optionally generate the aggregated workflow_output.json at the end.

Additional context

No response

Code of Conduct

  • I agree to follow this project's Code of Conduct
  • I have searched the open feature requests and have found no duplicates for this feature request

Metadata

Metadata

Labels

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions