-
Notifications
You must be signed in to change notification settings - Fork 213
Pull requests: Luce-Org/lucebox-hub
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
[LuceBox][DFlash][lucebox-pr314-common-empty-fallback][2/n] Default empty spec retry in backend calls
#319
opened May 31, 2026 by
OmarB97
Contributor
Loading…
fix: capture daemon stderr in DflashClient error messages
#316
opened May 31, 2026 by
Oxygen56
Loading…
1 task done
[codex] Recover dflash spec-decode agent stalls
#315
opened May 31, 2026 by
OmarB97
Contributor
Loading…
feat(server): reduce layer-split activation memory with backend precision policy
#310
opened May 29, 2026 by
weicj
Collaborator
Loading…
feat(dflash): reduce feature mirror memory with dtype policy
#309
opened May 29, 2026 by
weicj
Collaborator
Loading…
fix(server): route Qwen3.6/Laguna think-mode reasoning to reasoning_content channel
#308
opened May 29, 2026 by
easel
Collaborator
Loading…
refactor(server): share target layer-split runtime helpers
#306
opened May 29, 2026 by
weicj
Collaborator
Loading…
refactor: extract MoE hybrid mode into common layer for qwen and laguna
#305
opened May 29, 2026 by
howard0su
Contributor
Loading…
feat(server): add Laguna target-layer-split adapter
#297
opened May 28, 2026 by
weicj
Collaborator
Loading…
fix(server): support sampled requests in target layer split
#295
opened May 28, 2026 by
weicj
Collaborator
Loading…
feat(server): passthrough proxy, piecewise keep-ratio curve, query survival check
#294
opened May 28, 2026 by
smpurkis
Contributor
Loading…
feat(qwen35moe): pipelined hybrid MoE decode with GPU/CPU overlap
#289
opened May 28, 2026 by
howard0su
Contributor
Loading…
feat(lucebox): docker stack + CLI + bench/profile + harness + luce-bench in-tree
#285
opened May 27, 2026 by
easel
Collaborator
Loading…
fix(server): Qwen3.6-27B tool calling for claude-code Anthropic path
#276
opened May 25, 2026 by
dusterbloom
Collaborator
Loading…
5 of 7 tasks
feat(drafter): ee3 as production default (depends on #274)
#275
opened May 24, 2026 by
dusterbloom
Collaborator
•
Draft
feat(pflash): prefill compress up to 128k -> 2-12× prefill (content-dependent), decode at parity
#274
opened May 24, 2026 by
dusterbloom
Collaborator
Loading…
feat(harness): typed adapters + format-aware session-inject proxy + multi-turn bandit driver
#266
opened May 23, 2026 by
dusterbloom
Collaborator
Loading…
mtp: prefix-cache WARM hit (perfect + partial via range-warm)
#221
opened May 18, 2026 by
dusterbloom
Collaborator
Loading…
Previous Next
ProTip!
What’s not been updated in a month: updated:<2026-04-30.