Skip to content

[WIP] Fa4 d256 varlen zero seq#149

Draft
umiswing wants to merge 9 commits into
PaddlePaddle:mainfrom
umiswing:fa4_d256_varlen_zero_seq
Draft

[WIP] Fa4 d256 varlen zero seq#149
umiswing wants to merge 9 commits into
PaddlePaddle:mainfrom
umiswing:fa4_d256_varlen_zero_seq

Conversation

@umiswing
Copy link
Copy Markdown
Member

umiswing added 9 commits May 19, 2026 22:03
pick branch jshah/hdim256-varlen-zero-lengths, commit: 75db52f
…-doc)

this fix the bug of output padding method when causal=False.
when causal=False, set zero to output padding can not mask out
the padding region of attn score at column dimension,
while the causal=True branch happen to bypass this problem
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant