Skip to content

fix(qwen35moe): sub-batch hybrid prefill FFN to avoid MMQ mul_mat_id OOB

caf2b11
Select commit
Loading
Failed to load commit list.
Sign in for the full log view
Open

feat(qwen35moe): pipelined hybrid MoE decode with GPU/CPU overlap #289

fix(qwen35moe): sub-batch hybrid prefill FFN to avoid MMQ mul_mat_id OOB
caf2b11
Select commit
Loading
Failed to load commit list.

Annotations

1 warning
uv workspace (lock + sync + import smoke)
succeeded May 30, 2026 in 45s