Skip to content

Add num_generators field to RLTrainer.Config#3624

Open
pzhan9 wants to merge 1 commit into
pytorch:mainfrom
pzhan9:num-generator-main
Open

Add num_generators field to RLTrainer.Config#3624
pzhan9 wants to merge 1 commit into
pytorch:mainfrom
pzhan9:num-generator-main

Conversation

@pzhan9

@pzhan9 pzhan9 commented Jun 10, 2026

Copy link
Copy Markdown
Contributor

GeneratorRouter was added in #3583, but we currently do not have a way to spawn multiple generators. Subsequently router cannot be tested in CI with an integration test.

In order to unblock, this PR adds a num_generators field to RLTrainer.Config. This field can only be used in the single-host mode.

This field is meant to be temporary just to unblock the integration test (suggested by this comment). For the long run, we need to have a "placement" config, which contains the information regarding how to map gpus/hosts to different RL roles, such as trainer and generators. The num_generators should be there too. Hopefully we can get that done in the next few weeks.

@meta-cla meta-cla Bot added the CLA Signed This label is managed by the Meta Open Source bot. label Jun 10, 2026
],
"RL GRPO 2 generators TP=1 no compile",
"rl_grpo_2gen_tp1_no_compile",
ngpu=4,

Copy link
Copy Markdown
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

We should make this 8 GPU, and use trainer 4, 2 generator each TP2. Seems fine to land this first, with TODO

Copy link
Copy Markdown
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

i would do 3 generator TP2. Rollout is the bottleneck

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

ciflow/rl ciflow/8gpu CLA Signed This label is managed by the Meta Open Source bot.

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants