Quickstart - Inference Use Case #26

benwilcock · 2024-07-08T10:17:38Z

Hi There!

If you just wanted to deploy RHEL AI as an inference server using Granite as the LLM and vLLM as the API server, what is the correct procedure? Is this easy to do? Is there already a boot image or container image for this? If it can be done, would you be prepared to document it for those of us needing a quickstart guide?

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Quickstart - Inference Use Case #26

Quickstart - Inference Use Case #26

benwilcock commented Jul 8, 2024 •

edited

Loading

Quickstart - Inference Use Case #26

Quickstart - Inference Use Case #26

Comments

benwilcock commented Jul 8, 2024 • edited Loading

benwilcock commented Jul 8, 2024 •

edited

Loading