Skip to content

fix: pad ids_dst allocation to prevent MMQ stream-k OOB reads#13

Merged
davide221 merged 1 commit into
Luce-Org:luce-dflashfrom
howard0su:crash
May 28, 2026
Merged

fix: pad ids_dst allocation to prevent MMQ stream-k OOB reads#13
davide221 merged 1 commit into
Luce-Org:luce-dflashfrom
howard0su:crash

Conversation

@howard0su
Copy link
Copy Markdown

Fix crash when using in MoE graph

The stream-k kernel loads a full mmq_x-wide tile from ids_dst without
bounds-checking the load (only the write-back is bounded). When the
number of get_rows is small, the unpadded allocation caused OOB reads.

Pad by get_mmq_x_max_host(cc) elements to ensure the tile load is
always within allocated memory.

Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>
@davide221 davide221 merged commit 3890634 into Luce-Org:luce-dflash May 28, 2026
11 of 50 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants