Autoresearch: Workflow engine node execution optimization loop

## Context

Adapting [Karpathy's Autoresearch pattern](https://github.com/karpathy/autoresearch) to autonomously optimize workflow node execution.

### Why

The workflow engine executes DAGs of AI nodes — LLM calls, tool executions, branching logic. As workflows grow more complex (10+ nodes, nested sub-workflows), execution time and memory compound. Optimizing scheduling, parallelization, and node implementation directly impacts user experience and compute costs.

### What

Set up the autoresearch loop:

| File | Role | Who edits |
|------|------|-----------|
| `benchmark.py` | Runs test workflow suite, measures execution time + memory + correctness | Nobody (read-only) |
| `engine.py` | Scheduler, node implementations, parallelization logic | Agent only |
| `program.md` | Optimization targets, constraints, test workflows | Human only |

### Search space

- **Scheduling strategies**: Topological sort variants, priority queues, speculative execution
- **Parallelization**: asyncio concurrency limits, node-level vs branch-level parallelism
- **Memory management**: Streaming intermediate results vs buffering, context window packing
- **Node execution**: Batch compatible nodes, reuse connections, cache deterministic outputs

### Evaluation metric

**Primary**: Total workflow execution time (seconds)
**Secondary**: Peak memory (MB)
**Tertiary**: Correctness (all node outputs match expected)

### Prerequisites

- [ ] Define test workflow suite (simple linear, branching, nested, 20+ node DAGs)
- [ ] Implement `benchmark.py` with timing + memory + correctness checks
- [ ] Baseline measurement of current engine performance
- [ ] Scaffold `program.md`

---
*Generated by Claude*

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Autoresearch: Workflow engine node execution optimization loop #91

Context

Why

What

Search space

Evaluation metric

Prerequisites

Metadata

Assignees

Labels

Type

Fields

Projects

Milestone

Relationships

Development

File	Role	Who edits
`benchmark.py`	Runs test workflow suite, measures execution time + memory + correctness	Nobody (read-only)
`engine.py`	Scheduler, node implementations, parallelization logic	Agent only
`program.md`	Optimization targets, constraints, test workflows	Human only

Autoresearch: Workflow engine node execution optimization loop #91

Description

Context

Why

What

Search space

Evaluation metric

Prerequisites

Metadata

Metadata

Assignees

Labels

Type

Fields

Projects

Milestone

Relationships

Development

Issue actions