-
Notifications
You must be signed in to change notification settings - Fork 13.1k
Pull requests: ggml-org/llama.cpp
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
CANN: improve ACL graph matching
Ascend NPU
issues specific to Ascend NPUs
ggml
changes relating to the ggml tensor library for machine learning
#16166
opened Sep 22, 2025 by
noemotiovon
Loading…
ggml-cpu: Respect cpumask settings with OpenMP
ggml
changes relating to the ggml tensor library for machine learning
#16164
opened Sep 22, 2025 by
wishstudio
Loading…
vulkan: support set_rows with i32 index type
ggml
changes relating to the ggml tensor library for machine learning
Vulkan
Issues specific to the Vulkan backend
#16162
opened Sep 22, 2025 by
jeffbolznv
Loading…
vulkan: support arbitrary KV dimension in flash attention
ggml
changes relating to the ggml tensor library for machine learning
Vulkan
Issues specific to the Vulkan backend
#16160
opened Sep 21, 2025 by
jeffbolznv
Loading…
ggml : implement set_rows with i32 index
Apple Metal
https://en.wikipedia.org/wiki/Metal_(API)
ggml
changes relating to the ggml tensor library for machine learning
Nvidia GPU
Issues specific to Nvidia GPUs
OpenCL
Issues specific to the OpenCL backend
SYCL
https://en.wikipedia.org/wiki/SYCL - GPU programming language
testing
Everything test related
#16159
opened Sep 21, 2025 by
CISC
Loading…
6 of 8 tasks
webui: switch to hash-based routing (alternative of #16079)
examples
server
#16157
opened Sep 21, 2025 by
isaac-mcfadyen
Loading…
vulkan: throw system error instead of SIGABRT during init on older devices
ggml
changes relating to the ggml tensor library for machine learning
Vulkan
Issues specific to the Vulkan backend
#16156
opened Sep 21, 2025 by
DmyMi
Loading…
sycl: add PAD_REFLECT_D1 operator support
documentation
Improvements or additions to documentation
ggml
changes relating to the ggml tensor library for machine learning
SYCL
https://en.wikipedia.org/wiki/SYCL - GPU programming language
#16145
opened Sep 21, 2025 by
ye-NX
Loading…
README.md : Added link to llama-cpp-jna Java binding
#16144
opened Sep 21, 2025 by
romantal
Loading…
[metal] Add fused RMS_NORM + MUL + SWIGLU for Qwen3Next
Apple Metal
https://en.wikipedia.org/wiki/Metal_(API)
ggml
changes relating to the ggml tensor library for machine learning
testing
Everything test related
#16143
opened Sep 21, 2025 by
MemoryIt
Loading…
clang-tidy : disable warning about braces around statements
#16139
opened Sep 21, 2025 by
haiyuewa
Loading…
vulkan: 64-bit im2col
ggml
changes relating to the ggml tensor library for machine learning
testing
Everything test related
Vulkan
Issues specific to the Vulkan backend
#16135
opened Sep 20, 2025 by
jeffbolznv
Loading…
CUDA: add a fused top-K MoE kernel
ggml
changes relating to the ggml tensor library for machine learning
Nvidia GPU
Issues specific to Nvidia GPUs
testing
Everything test related
#16130
opened Sep 20, 2025 by
am17an
Loading…
rpc : use GGML_LOG_* for logging
examples
ggml
changes relating to the ggml tensor library for machine learning
#16129
opened Sep 20, 2025 by
rgerganov
Loading…
codeowners : update ownership for @ngxson and @allozuar
#16128
opened Sep 20, 2025 by
ngxson
Loading…
clang-tidy : disable warning about performance enum size
#16127
opened Sep 20, 2025 by
haiyuewa
Loading…
mtmd: more optimized build_rope_2d
examples
testing
Everything test related
#16126
opened Sep 20, 2025 by
ngxson
Loading…
codeowners : claim responsibility for ci, models, gguf-py and convert
#16124
opened Sep 20, 2025 by
CISC
Loading…
ggml : extend ggml_can_fuse to work with non-sequential nodes
ggml
changes relating to the ggml tensor library for machine learning
#16123
opened Sep 20, 2025 by
ggerganov
Loading…
ggml : add ggml_op_is_empty
Apple Metal
https://en.wikipedia.org/wiki/Metal_(API)
ggml
changes relating to the ggml tensor library for machine learning
#16122
opened Sep 20, 2025 by
ggerganov
Loading…
Previous Next
ProTip!
Mix and match filters to narrow down what you’re looking for.