Skip to content

feat(rl-env): support parallel rollout collection via VM pool #76

@abrichr

Description

@abrichr

RLEnvironment.collect_rollout is sequential. GRPO requires N=8 parallel rollouts from the same VM starting state. Sequential rollouts violate the same-state assumption (each rollout mutates the VM).

Need ParallelRolloutCollector using PoolManager with VM snapshots or synchronized resets. Currently documented as "future work" in rollout_collector.py.

Metadata

Metadata

Assignees

No one assigned

    Labels

    enhancementNew feature or request

    Type

    No type
    No fields configured for issues without a type.

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions