-
Notifications
You must be signed in to change notification settings - Fork 13.5k
Pull requests: ggml-org/llama.cpp
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
Mamba2 SSD
Apple Metal
https://en.wikipedia.org/wiki/Metal_(API)
examples
ggml
changes relating to the ggml tensor library for machine learning
Nvidia GPU
Issues specific to Nvidia GPUs
testing
Everything test related
#16982
opened Nov 3, 2025 by
gabe-l-hart
•
Draft
vulkan: Use spec constants for conv2d s/d/p and kernel W/H
ggml
changes relating to the ggml tensor library for machine learning
Vulkan
Issues specific to the Vulkan backend
#16978
opened Nov 3, 2025 by
jeffbolznv
Loading…
vulkan: fuse rms_norm + mul + rope (+ view + set_rows)
ggml
changes relating to the ggml tensor library for machine learning
testing
Everything test related
Vulkan
Issues specific to the Vulkan backend
#16977
opened Nov 3, 2025 by
jeffbolznv
Loading…
sycl: flash-attention implementation
ggml
changes relating to the ggml tensor library for machine learning
SYCL
https://en.wikipedia.org/wiki/SYCL - GPU programming language
#16969
opened Nov 3, 2025 by
ye-NX
Loading…
s390x: disable vxe for cross-compilation by default
ggml
changes relating to the ggml tensor library for machine learning
#16966
opened Nov 3, 2025 by
AlekseiNikiforovIBM
Loading…
Refactor llm_chat_template_from_str to avoid throwing exceptions
#16965
opened Nov 3, 2025 by
AnonN10
Loading…
Fix garbled output with REPACK at high thread counts
ggml
changes relating to the ggml tensor library for machine learning
#16956
opened Nov 2, 2025 by
NoahOksuz
Loading…
CUDA: add implicit conv3d
ggml
changes relating to the ggml tensor library for machine learning
Nvidia GPU
Issues specific to Nvidia GPUs
testing
Everything test related
#16948
opened Nov 2, 2025 by
bssrdf
Loading…
Model: Minimax M2 - chat support
testing
Everything test related
#16946
opened Nov 2, 2025 by
pwilkin
Loading…
Model: add openPangu-Embedded
python
python script changes
#16941
opened Nov 2, 2025 by
Lpzhan931
Loading…
Add e2e tests for embedding raw flag
devops
improvements to build systems and github actions
examples
python
python script changes
testing
Everything test related
#16940
opened Nov 2, 2025 by
SamMalayek
•
Draft
doc: Windows + clang/ninja build guide format cleanup
documentation
Improvements or additions to documentation
#16939
opened Nov 2, 2025 by
jsjtxietian
Loading…
CUDA: avoid mul + bias fusion when buffers are split
ggml
changes relating to the ggml tensor library for machine learning
Nvidia GPU
Issues specific to Nvidia GPUs
#16935
opened Nov 2, 2025 by
am17an
Loading…
server: add minimax-m2 reasoning format override for MiniMax-M2 compatibility
examples
server
#16933
opened Nov 2, 2025 by
ServeurpersoCom
•
Draft
common: Generalized XML-style tool-call parsing with streaming support (GLM 4.5/4.6 + MiniMax M2 + SeedOSS)
testing
Everything test related
#16932
opened Nov 2, 2025 by
hksdpc255
Loading…
hparams : add n_embd_inp() to support extended embed
examples
#16928
opened Nov 1, 2025 by
CISC
Loading…
vulkan: Fix GGML_VULKAN_CHECK_RESULTS to better handle fusion
ggml
changes relating to the ggml tensor library for machine learning
Vulkan
Issues specific to the Vulkan backend
#16919
opened Nov 1, 2025 by
jeffbolznv
Loading…
add TheRock HIP backend build instructions
documentation
Improvements or additions to documentation
#16915
opened Nov 1, 2025 by
lihaofd
Loading…
ggml-hexagon: replace sprintf with snprintf in changes relating to the ggml tensor library for machine learning
ops-utils.h
ggml
#16913
opened Nov 1, 2025 by
chraac
Loading…
Previous Next
ProTip!
Add no:assignee to see everything that’s not assigned.