-
Notifications
You must be signed in to change notification settings - Fork 270
Pull requests: ml-explore/mlx-swift-lm
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
Gemma4Text: load conversions that omit redundant KV-shared-layer K/V projections
#350
opened Jun 13, 2026 by
jfranknichols
Contributor
Loading…
Expose ModelTypeRegistry.contains(_:) for pre-load support checks
#349
opened Jun 13, 2026 by
jfranknichols
Contributor
Loading…
Add TranslateGemma support (Gemma 3 translation template)
#348
opened Jun 13, 2026 by
beshkenadze
Loading…
Add Hunyuan dense V1 (hunyuan_v1_dense): Hunyuan-MT-7B and Hy-MT2-7B
#347
opened Jun 13, 2026 by
beshkenadze
Loading…
Gemma 3: chunked prompt prefill, skip lm_head on prompt positions
#346
opened Jun 13, 2026 by
beshkenadze
Loading…
Qwen2-VL: implement M-RoPE in the language model (mirror of #239)
#345
opened Jun 13, 2026 by
mnmly
Loading…
fix MLXVLM prepare(): honor windowSize for chunked prefill (FastVLM / Gemma3 / Qwen2VL / LFM2VL / Pixtral / Mistral3)
#344
opened Jun 12, 2026 by
john-rocky
Contributor
Loading…
Gemma 4: add audio and video multimodal support
#343
opened Jun 12, 2026 by
atlascodesai
Loading…
4 tasks done
Fix Gemma 4 QAT load: KV-shared layers carry no k_proj/v_proj/k_norm
#342
opened Jun 12, 2026 by
Flo5k5
Loading…
1 of 4 tasks
remove references per author request
#341
opened Jun 11, 2026 by
davidkoski
Collaborator
Loading…
3 of 4 tasks
Raise swift-syntax floor to 602.0.0 for prebuilt artifact resolution
#340
opened Jun 11, 2026 by
roryford
Loading…
Use MaterializedArray for Sendable conformance
#335
opened Jun 9, 2026 by
davidkoski
Collaborator
Loading…
2 of 4 tasks
Add MLXFoundationModels: an MLX-backed FoundationModels LanguageModel
#334
opened Jun 9, 2026 by
ctymoszek
Loading…
Fix Gemma4 QAT (E-series) load: KV-shared layers have no k_proj/v_proj/k_norm
#330
opened Jun 5, 2026 by
Flo5k5
Loading…
Add variance-normalized KV cache
#329
opened Jun 5, 2026 by
aleroot
Contributor
Loading…
3 of 4 tasks
fix speculative decode tests
#326
opened Jun 3, 2026 by
davidkoski
Collaborator
Loading…
3 of 4 tasks
Improve Qwen3.5 recurrent cache handling
#323
opened May 30, 2026 by
aleroot
Contributor
Loading…
4 tasks done
Fix Gemma4 E-series load: make per_layer_model_projection quantizable
#320
opened May 28, 2026 by
nyteshade
Loading…
Add Swift model conversion API
#318
opened May 28, 2026 by
aleroot
Contributor
Loading…
3 of 4 tasks
Add speculative decoding telemetry and memory gating
#314
opened May 27, 2026 by
aleroot
Contributor
Loading…
4 tasks done
Previous Next
ProTip!
Updated in the last three days: updated:>2026-06-11.