Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Package request: TensorRT-LLM #28973

Open
2 tasks done
ehfd opened this issue Jan 29, 2025 · 0 comments
Open
2 tasks done

Package request: TensorRT-LLM #28973

ehfd opened this issue Jan 29, 2025 · 0 comments

Comments

@ehfd
Copy link
Member

ehfd commented Jan 29, 2025

Package name

TensorRT-LLM

Package version

Newest

Package website

https://docs.nvidia.com/tensorrt-llm/index.html

https://github.com/NVIDIA/TensorRT-LLM

Package availability

https://pypi.org/project/tensorrt-llm/

https://github.com/NVIDIA/TensorRT-LLM/releases

Additional comments

This package is a key interface for large language model inference on NVIDIA GPUs, and benefits the inference of various language and multimodal models including Llama and Stable Diffusion.

This depends on TensorRT (#25661).

Package is not available

  • The package is not available on conda-forge.

No previous issues or open PRs

  • No previous issue exists and no PR has been opened.
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Development

No branches or pull requests

1 participant