Skip to content

[graph_trainer] Enable ChunkedCELoss for deepseek_v3 and qwen3 (#3248)#3614

Draft
SherlockNoMad wants to merge 1 commit into
gh/SherlockNoMad/50/basefrom
gh/SherlockNoMad/50/head
Draft

[graph_trainer] Enable ChunkedCELoss for deepseek_v3 and qwen3 (#3248)#3614
SherlockNoMad wants to merge 1 commit into
gh/SherlockNoMad/50/basefrom
gh/SherlockNoMad/50/head

Conversation

@SherlockNoMad

@SherlockNoMad SherlockNoMad commented Jun 10, 2026

Copy link
Copy Markdown
Contributor

[ghstack-poisoned]
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

ciflow/8gpu CLA Signed This label is managed by the Meta Open Source bot.

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant