Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Publish the latest llama.cpp? #1951

Open
curvedinf opened this issue Feb 27, 2025 · 2 comments
Open

Publish the latest llama.cpp? #1951

curvedinf opened this issue Feb 27, 2025 · 2 comments

Comments

@curvedinf
Copy link

Hello, I run an AMD card and there have been very significant ROCm support updates (flash attention, quants, massive speed improvements) since the llama.cpp version currently in llama-cpp-python.

Could you do us a big one and publish a new llama-cpp-python with the latest llama.cpp? It would be much appreciated! Thank you!

@ekcrisp
Copy link

ekcrisp commented Mar 11, 2025

+1 would love to see an update to the latest llama.cpp

@handshape
Copy link

Up until a couple of weeks ago, the bindings were still close enough that you could pull the upstream llama.cpp in the vendor directory and build locally. It looks like there's a breaking change in the contract for libllama - llama_model_load_from_file got renamed to llama_load_model_from_file.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants