Checkpointing `workflow.json` incrementally for nat eval

### Is this a new feature, an improvement, or a change to existing functionality?

Improvement

### How would you describe the priority of this feature request

Critical (currently preventing usage)

### Please provide a clear description of problem this feature solves

When running `nat eval` on a dataset, workflow_output.json is only written/visible after the entire dataset finishes. If evaluation hangs/fails or we have to interrupt it, we lose access to results from tasks that already completed, and the run isn’t recoverable without re-running. It would be very helpful to persist per-task intermediate outputs to disk as soon as each dataset item completes (before eval) so partial results are always recoverable.

### Describe your ideal solution

Add incremental output writing for nat eval, e.g. write one JSONL record per completed item (dataset_item_<idx>..json) and optionally generate the aggregated workflow_output.json at the end.


### Additional context

_No response_

### Code of Conduct

- [x] I agree to follow this project's Code of Conduct
- [x] I have searched the [open feature requests](https://github.com/NVIDIA/NeMo-Agent-Toolkit/issues?q=is%3Aopen+is%3Aissue+label%3A%22feature+request%22%2Cimprovement%2Cenhancement) and have found no duplicates for this feature request

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Checkpointing `workflow.json` incrementally for nat eval #1631

Is this a new feature, an improvement, or a change to existing functionality?

How would you describe the priority of this feature request

Please provide a clear description of problem this feature solves

Describe your ideal solution

Additional context

Code of Conduct

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Checkpointing workflow.json incrementally for nat eval #1631

Description

Is this a new feature, an improvement, or a change to existing functionality?

How would you describe the priority of this feature request

Please provide a clear description of problem this feature solves

Describe your ideal solution

Additional context

Code of Conduct

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions

Checkpointing `workflow.json` incrementally for nat eval #1631