Skip to content

Pull requests: ggml-org/llama.cpp

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

CUDA: enable FA for FP32 KV cache ggml changes relating to the ggml tensor library for machine learning Nvidia GPU Issues specific to Nvidia GPUs
#16546 opened Oct 12, 2025 by JohannesGaessler Loading…
vulkan: Improve build time for MSVC ggml changes relating to the ggml tensor library for machine learning Vulkan Issues specific to the Vulkan backend
#16545 opened Oct 12, 2025 by jeffbolznv Loading…
tests: increase NMSE threshold for q5_1 MUL_MAT tests testing Everything test related
#16544 opened Oct 12, 2025 by Erics38 Loading…
vulkan: Support FA with K/V in F32 ggml changes relating to the ggml tensor library for machine learning testing Everything test related Vulkan Issues specific to the Vulkan backend
#16543 opened Oct 12, 2025 by jeffbolznv Loading…
Add metal conv transpose 2d Apple Metal https://en.wikipedia.org/wiki/Metal_(API) ggml changes relating to the ggml tensor library for machine learning
#16542 opened Oct 12, 2025 by iliailmer Loading…
1 task done
CUDA: fix numerical issue in tile FA kernel ggml changes relating to the ggml tensor library for machine learning Nvidia GPU Issues specific to Nvidia GPUs
#16540 opened Oct 12, 2025 by JohannesGaessler Loading…
metal: add support for opt_step_sgd Apple Metal https://en.wikipedia.org/wiki/Metal_(API) ggml changes relating to the ggml tensor library for machine learning
#16539 opened Oct 12, 2025 by cern1710 Loading…
Vulkan MMQ Integer Dot Refactor and K-Quant support ggml changes relating to the ggml tensor library for machine learning Vulkan Issues specific to the Vulkan backend
#16536 opened Oct 12, 2025 by 0cc4m Draft
1 of 5 tasks
Update close-issue.yml devops improvements to build systems and github actions
#16535 opened Oct 12, 2025 by barneysspeedshop Draft
metal : FA support F32 K and V Apple Metal https://en.wikipedia.org/wiki/Metal_(API) ggml changes relating to the ggml tensor library for machine learning testing Everything test related
#16531 opened Oct 12, 2025 by ggerganov Loading…
metal: add support for LOG op (f32, f16) Apple Metal https://en.wikipedia.org/wiki/Metal_(API) ggml changes relating to the ggml tensor library for machine learning
#16530 opened Oct 12, 2025 by RD-zhang1234 Loading…
graph : support cacheless embeddings with FA and iSWA ggml changes relating to the ggml tensor library for machine learning Nvidia GPU Issues specific to Nvidia GPUs
#16528 opened Oct 12, 2025 by ggerganov Loading…
Leverage the existing GGML_F32_VEC helpers to vectorize ggml_vec_set_f32 for faster fills ggml changes relating to the ggml tensor library for machine learning
#16522 opened Oct 11, 2025 by sirus20x6 Loading…
ggml : fix build broken with -march=armv9-a on MacOS ggml changes relating to the ggml tensor library for machine learning
#16520 opened Oct 11, 2025 by DamonFool Loading…
CUDA: add fp kernel for larger batch size MoE ggml changes relating to the ggml tensor library for machine learning Nvidia GPU Issues specific to Nvidia GPUs testing Everything test related
#16512 opened Oct 11, 2025 by am17an Loading…
vendor : sync minja
#16500 opened Oct 10, 2025 by CISC Loading…
Switch to using Ubuntu 25.10 vulkan/mesa devops improvements to build systems and github actions
#16497 opened Oct 10, 2025 by ericcurtin Loading…
graph : reuse SSM graphs
#16490 opened Oct 9, 2025 by ggerganov Loading…
ProTip! Add no:assignee to see everything that’s not assigned.