Skip to content

Pull requests: Luce-Org/lucebox-hub

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

Add speed profiler
#431 opened Jun 20, 2026 by officialgr Loading…
feat(kvflash): pager serialize/deserialize + critical-chunk pinning
#429 opened Jun 20, 2026 by dusterbloom Collaborator Loading…
perf(qwen35): fixed-width verify graph for CUDA-graph replay
#424 opened Jun 19, 2026 by cheese-cakee Contributor Loading…
spec-decode: eliminate replay pass via fast-rollback
#390 opened Jun 15, 2026 by howard0su Contributor Draft
perf(server): reduce MoE expert-compute IPC overhead
#388 opened Jun 15, 2026 by weicj Collaborator Draft
[codex] fix prefix cache for user-first prompts
#387 opened Jun 15, 2026 by easel Collaborator Draft
[codex] unify cache capacity config
#381 opened Jun 14, 2026 by easel Collaborator Draft
feat(dflash): add DeepSeek V4 Flash backend
#353 opened Jun 9, 2026 by howard0su Contributor Draft
test(server): CPU-only HTTP server test rig (stub backend + scenarios)
#343 opened Jun 4, 2026 by easel Collaborator Loading…
4 tasks
feat(server): soft-close thinking termination (qwen35 + gemma4)
#339 opened Jun 3, 2026 by easel Collaborator Loading…
5 tasks
feat(luce-bench): in-tree bench harness + multi-turn agent_recorded + LLM judge
#337 opened Jun 3, 2026 by easel Collaborator Loading…
4 tasks
feat(lucebox): hub CLI + autotune/sweep/profile + harness adapters + shell wrapper
#335 opened Jun 3, 2026 by easel Collaborator Loading…
6 tasks
Add LLM Auto Context Compaction
#304 opened May 29, 2026 by howard0su Contributor Draft
fix(server): Qwen3.6-27B tool calling for claude-code Anthropic path
#276 opened May 25, 2026 by dusterbloom Collaborator Loading…
5 of 7 tasks
ProTip! What’s not been updated in a month: updated:<2026-05-23.