Publish the latest llama.cpp? #1951

curvedinf · 2025-02-27T21:09:14Z

Hello, I run an AMD card and there have been very significant ROCm support updates (flash attention, quants, massive speed improvements) since the llama.cpp version currently in llama-cpp-python.

Could you do us a big one and publish a new llama-cpp-python with the latest llama.cpp? It would be much appreciated! Thank you!

ekcrisp · 2025-03-11T06:08:41Z

+1 would love to see an update to the latest llama.cpp

handshape · 2025-03-28T13:29:32Z

Up until a couple of weeks ago, the bindings were still close enough that you could pull the upstream llama.cpp in the vendor directory and build locally. It looks like there's a breaking change in the contract for libllama - llama_model_load_from_file got renamed to llama_load_model_from_file.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Publish the latest llama.cpp? #1951

Publish the latest llama.cpp? #1951

curvedinf commented Feb 27, 2025

ekcrisp commented Mar 11, 2025

handshape commented Mar 28, 2025

Publish the latest llama.cpp? #1951

Publish the latest llama.cpp? #1951

Comments

curvedinf commented Feb 27, 2025

ekcrisp commented Mar 11, 2025

handshape commented Mar 28, 2025