Skip to content

[graph_trainer] Add DSv3 scaling run/profiling helper script#3606

Draft
SherlockNoMad wants to merge 1 commit into
gh/SherlockNoMad/42/basefrom
gh/SherlockNoMad/42/head
Draft

[graph_trainer] Add DSv3 scaling run/profiling helper script#3606
SherlockNoMad wants to merge 1 commit into
gh/SherlockNoMad/42/basefrom
gh/SherlockNoMad/42/head

Conversation

@SherlockNoMad

@SherlockNoMad SherlockNoMad commented Jun 10, 2026

Copy link
Copy Markdown
Contributor

[ghstack-poisoned]
Comment thread run_graph_trainer_dsv3.sh
# logged automatically.
{

# --- DeepSeek-v3 16B (FSDP+TP+EP) ---

Copy link
Copy Markdown
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

what's the incentive to host this non-general script at root folder?

Convenience launcher for the DeepSeek-v3 671B scaling run

It's not even a true model.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

CLA Signed This label is managed by the Meta Open Source bot.

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants