[feat] Add SP and PP kernels for qwen_moe true on policy by maocheng23 · Pull Request #31 · radixark/Megatron-LM

maocheng23 · 2026-05-07T21:31:41Z

Stacked on top of #30.

Summary

Reword the attention-TP-without-SP error message to flag the missing true-on-policy path.
Add _under_sp_or_attn_tp guard in miles_megatron_plugins/true_on_policy/moe_layer_ext.py so the SGLang local-masked EP forward path is allowed even when padding is present (under SP or attention-TP), and padding compaction is skipped under those topologies.
Wire SP/PP-aware token routing, p2p communication, schedules, transformer block changes, and bridge schema flag.
Add MoE PP/SP test fixtures under tests/unit_tests.

Test plan

Run tests/unit_tests/transformer/moe/test_moe_layer.py against PP=2 and SP=on configurations.
Verify zero logprob diff for Qwen3-MoE TP=1 EP=4 PP=2 SP=on against rollout.

Add a clean Megatron backend that calls SGLang-compatible math under a flag: - sglang.py: SGLangLinear, SGLangRMSNorm, SGLangFlashAttention and related modules - matmul_tp_inv.py: TP-invariant matmul dispatch for Megatron layers - transformer_config.py: use_sglang config flag - arguments.py: --use-sglang CLI arg - layers.py: conditional SGLang backend selection in TP layers - gpt_layer_specs.py: SGLang-compatible layer spec builder - test_sglang_extension.py: import, config, and default-path-unchanged tests Default training path remains unchanged when use_sglang is off. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

Match SGLang's TP reduction order and full-vocab logprob contract: - mappings.py: tree_all_reduce_sum for deterministic TP reduction - layers.py: conditional tree allreduce in RowParallelLinear - gpt_model.py: full-vocab logprob gather/truncate/log-softmax - transformer_config.py: true_on_policy_logits config - test_tree_all_reduce.py: TP tree-allreduce tests - test_true_on_policy_logits.py: full-vocab gather/truncate tests Default NCCL allreduce path unchanged when flags are off. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

isort/black reformatting on files touched by the true-on-policy substrate, runtime contract, and Qwen3-dense parity path. No semantic changes. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>

maocheng23 and others added 25 commits April 23, 2026 12:17

Align Megatron true-on-policy Qwen3 dense path

be972a2

Bypass SGLang Ulysses CP full recompute

ab4d98f

Add Megatron true-on-policy namespace

d9bfff1

Split Megatron true-on-policy backend modules

7de0d6b

Add Megatron true-on-policy runtime contract

b8a0bd2

Throttle implicit true-on-policy contract warnings

356770a

Add true-on-policy contract schema adapter

ce180c5

Route low-risk true-on-policy checks through runtime policy

f0f92be

Route GPT true-on-policy setup through runtime policy

c73d0d3

Move transformer block true-on-policy behavior into runtime policy

52a7f1d

Route transformer layer residual contract through runtime policy

f3f2590

Move attention dtype boundaries into true-on-policy contract

d5719b8

Retire use_sglang true-on-policy switch

e58f537

Sync true-on-policy schema file

c8def60

chore: apply pre-commit auto-fixes to true-on-policy stack

b709289

isort/black reformatting on files touched by the true-on-policy substrate, runtime contract, and Qwen3-dense parity path. No semantic changes. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>

address cmts

afa4590

fix

41e5015

Rename true-on-policy plugin package

4a5418a

Add Qwen3 MoE true-on-policy Megatron parity

81c6e34

Fix MoE true-on-policy weight refresh path

ebbe843

Split true-on-policy MoE helpers

9c390a1

Extract true-on-policy MoE layer extensions and combine ordering

c2a98e1

Support true-on-policy MoE PP and SP kernels

61b608e

maocheng23 force-pushed the feat/true_on_policy_qwen_moe branch 4 times, most recently from 7ac2a80 to b0679e2 Compare May 23, 2026 02:47

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[feat] Add SP and PP kernels for qwen_moe true on policy#31

[feat] Add SP and PP kernels for qwen_moe true on policy#31
maocheng23 wants to merge 25 commits into
feat/true_on_policy_qwen_moefrom
feat/true_on_policy_qwen_moe_sppp

maocheng23 commented May 7, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

maocheng23 commented May 7, 2026

Summary

Test plan

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant