Skip to content

Feat/cuda pad#2347

Open
kali wants to merge 2 commits into
mainfrom
feat/cuda-pad
Open

Feat/cuda pad#2347
kali wants to merge 2 commits into
mainfrom
feat/cuda-pad

Conversation

@kali
Copy link
Copy Markdown
Collaborator

@kali kali commented Jun 5, 2026

No description provided.

kali added 2 commits June 5, 2026 15:32
Pad had no GPU implementation, so any model using it (e.g. the Nemotron/Parakeet
preprocessor's signal-centering and real-to-complex pads) bounced through the host
mid-graph. Add GpuPad, a backend-agnostic copy-based op: broadcast the pad value
across the output, then copy the input into the interior, both via the existing
copy_nd. Constant mode only; Reflect/Edge stay on the host. The preprocessor signal
path now runs entirely on GPU.
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant