You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
This package is a key interface for large language model inference on NVIDIA GPUs, and benefits the inference of various language and multimodal models including Llama and Stable Diffusion.
Package name
TensorRT-LLM
Package version
Newest
Package website
https://docs.nvidia.com/tensorrt-llm/index.html
https://github.com/NVIDIA/TensorRT-LLM
Package availability
https://pypi.org/project/tensorrt-llm/
https://github.com/NVIDIA/TensorRT-LLM/releases
Additional comments
This package is a key interface for large language model inference on NVIDIA GPUs, and benefits the inference of various language and multimodal models including Llama and Stable Diffusion.
This depends on TensorRT (#25661).
Package is not available
No previous issues or open PRs
The text was updated successfully, but these errors were encountered: