Skip to content

Pull requests: ggml-org/llama.cpp

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Sort

Pull requests list

opencl: fix couple crashes ggml changes relating to the ggml tensor library for machine learning
#12795 opened Apr 7, 2025 by linehill Loading…
Support for OuteTTS 1.0 examples python python script changes
#12794 opened Apr 7, 2025 by edwko Draft
llama : Support llama 4 text-only (WIP) python python script changes
#12791 opened Apr 7, 2025 by ngxson Draft
1 of 3 tasks
SYCL: Add fp16 type support to unary op kernels ggml changes relating to the ggml tensor library for machine learning SYCL https://en.wikipedia.org/wiki/SYCL - GPU programming language
#12788 opened Apr 7, 2025 by qnixsynapse Draft
[CANN]Support Opt CONV_TRANSPOSE_1D and ELU Ascend NPU issues specific to Ascend NPUs ggml changes relating to the ggml tensor library for machine learning
#12786 opened Apr 7, 2025 by noemotiovon Loading…
vulkan: Use fp16 for the flash attention P*V multiplication ggml changes relating to the ggml tensor library for machine learning Vulkan Issues specific to the Vulkan backend
#12783 opened Apr 6, 2025 by jeffbolznv Loading…
ci: fix issue in android build(https://github.com/ggml-org/llama.cpp/issues/12638) devops improvements to build systems and github actions
#12775 opened Apr 6, 2025 by zhouwg Loading…
ggml: use _mm[512/256]_dpbusd[_avx]_epi32 to directly accumulate into the result register ggml changes relating to the ggml tensor library for machine learning
#12773 opened Apr 5, 2025 by SongXiaoXi Loading…
opencl: better identify Adreno GPU ggml changes relating to the ggml tensor library for machine learning
#12760 opened Apr 4, 2025 by lhez Loading…
Added all CPU to Docker GPU images for 'token_embd.weight' compatibility devops improvements to build systems and github actions
#12749 opened Apr 4, 2025 by rudiservo Loading…
(wip) support ultravox audio input examples python python script changes
#12745 opened Apr 3, 2025 by ngxson Draft
sycl:remove redundant memcopy in function ggml_backend_sycl_buffer_set_tensor ggml changes relating to the ggml tensor library for machine learning SYCL https://en.wikipedia.org/wiki/SYCL - GPU programming language
#12734 opened Apr 3, 2025 by zhouwg Loading…
sync : ggml ggml changes relating to the ggml tensor library for machine learning Nvidia GPU Issues specific to Nvidia GPUs script Script related
#12732 opened Apr 3, 2025 by ggerganov Loading…
Update llama-quant.cpp llama_tensor_get_type with DeepSeek friendly modifications ggml changes relating to the ggml tensor library for machine learning
#12727 opened Apr 3, 2025 by bartowski1182 Loading…
Fix: Abnormal exit on Android devices ggml changes relating to the ggml tensor library for machine learning
#12712 opened Apr 2, 2025 by biyou Loading…
WIP: Add support for CogAgent examples python python script changes server
#12679 opened Mar 31, 2025 by Tianyue-Zhao Draft
update rope_multi: ggml changes relating to the ggml tensor library for machine learning
#12665 opened Mar 31, 2025 by foldl Loading…
llama : nit, DeepSeek V1 MoE is 16B and GigaChat is 20B
#12652 opened Mar 30, 2025 by CISC Loading…
tts : implement sesame CSM + Mimi decoder examples python python script changes
#12648 opened Mar 29, 2025 by ngxson Loading…
ProTip! Type g i on any issue or pull request to go back to the issue listing page.