Skip to content

[WIP] Add non-gated colvec_scale support to gemm_dact#143

Draft
JungHoyoun wants to merge 2 commits into
Dao-AILab:mainfrom
JungHoyoun:hy/gemm-dact-colvec-support
Draft

[WIP] Add non-gated colvec_scale support to gemm_dact#143
JungHoyoun wants to merge 2 commits into
Dao-AILab:mainfrom
JungHoyoun:hy/gemm-dact-colvec-support

Conversation

@JungHoyoun
Copy link
Copy Markdown

I would like to add a colvec_scale argument to the gemm_dact signature, similar to gemm_dgated.

The motivation is to add squared_relu activation support to sonic-moe. Currently, sonic-moe only supports GLU-family activations, and this change is intended to extend support to architectures such as Nemotron 3.

The initial commit was written with the help of Codex. I will clean up and verify the code properly before moving this PR out of draft.

So far, testing has only been run on an SM90 environment.

Also, #142 was accidentally pushed while I was working with an AI tool. I have closed that PR, but I would appreciate it if it could be deleted from the repository history if possible. Sorry for the noise.

@JungHoyoun JungHoyoun marked this pull request as draft May 19, 2026 13:32
@JungHoyoun JungHoyoun changed the title [WIP] Add colvec_scale support to gemm_dact [WIP] Add non-gated colvec_scale support to gemm_dact May 19, 2026
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant