Using AMD GPUs #276

taske666 · 2025-08-29T17:11:46Z

taske666
Aug 29, 2025

Hi,

I am a very inexperienced user if it comes to LLMs in general.

I tried to follow the guide as close as I could, but I realized CPU is getting utilized instead GPU.
Got an AMD 9070XT and, as I wasnt sure if it was needed, also installed the ROCm SDK.

Is it not possible to use an AMD GPU for Qwen2.5 or where can I find the config file for which hardware should be utilized?

kind regards

Answered by t41372

Aug 29, 2025

Yeah I just realized we don't have that in our doc. We should have one. It's pretty easy though, there is an option called llm_provider. Set that to lm Studio. There is also some options named lm Studio in conf yaml, you can change relevant settings there.

In lm Studio, you might have to switch to dev mode and start the API server. I will add docs later today so if you don't know what I'm saying you can wait a bit

View full answer

t41372 · 2025-08-29T17:39:11Z

t41372
Aug 29, 2025
Maintainer

LLM inference is not a part of the Open-LLM-VTuber. If you are following the quick guide, I assume that you are using ollama for your LLM inference.

I did some digging, and apparently ollama does not support your GPU on Windows as of now. Once these support got merged, you will probably be able to use GPU acceleration in ollama.
ollama/ollama#10430
ollama/ollama#9633

For now, you can try:

run ollama inside wsl
use LM Studio
use vllm within wsl or docker
use API for now and check the status of the issues mentioned above in the mean time

0 replies

taske666 · 2025-08-29T20:01:53Z

taske666
Aug 29, 2025
Author

Thanks a lot for the answer.

If I use LM Studio it utilizes my GPU, with the qwen2.5 7B i get up to 102 token/sec.

Is there somewhere documented how to use LM Studio instead Ollama?

I read it a few times but couldn't find an answer

Edit: I meant how to use LM Studio as backend (if that's the right term) in LLM Vtuber

0 replies

t41372 · 2025-08-29T23:39:40Z

t41372
Aug 29, 2025
Maintainer

Yeah I just realized we don't have that in our doc. We should have one. It's pretty easy though, there is an option called llm_provider. Set that to lm Studio. There is also some options named lm Studio in conf yaml, you can change relevant settings there.

In lm Studio, you might have to switch to dev mode and start the API server. I will add docs later today so if you don't know what I'm saying you can wait a bit

1 reply

taske666 Aug 30, 2025
Author

thanks sooo much! That way I can use my AMD GPU, will going very slowly through all the configuration now
You're a great person!

philpw99 · 2026-01-07T21:30:54Z

philpw99
Jan 7, 2026

I am using LM Studio. To use it, it's just a simple edit of config.yaml

Under Agent Config -> agent settings -> basic_memory-agent , change "llm_provider" to "lmstudio_llm"
Under Agent Config -> agent settings -> llm_configs -> lmstudio-llm, change "model" to the model name from LM studio, like "qwen2.5-7b-instruct".
Restart the app.

0 replies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Open-LLM-VTuber

Using AMD GPUs #276

Uh oh!

{{title}}

Uh oh!

Replies: 4 comments 1 reply

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{editor}}'s edit

{{editor}}'s edit

Uh oh!

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{editor}}'s edit

{{editor}}'s edit

Uh oh!

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{title}}

Uh oh!

Select a reply

Uh oh!

Open-LLM-VTuber

Using AMD GPUs #276

Uh oh!

taske666 Aug 29, 2025

Replies: 4 comments · 1 reply

Uh oh!

t41372 Aug 29, 2025 Maintainer

Uh oh!

Uh oh!

taske666 Aug 29, 2025 Author

Uh oh!

Uh oh!

t41372 Aug 29, 2025 Maintainer

Uh oh!

taske666 Aug 30, 2025 Author

Uh oh!

philpw99 Jan 7, 2026

taske666
Aug 29, 2025

Replies: 4 comments 1 reply

t41372
Aug 29, 2025
Maintainer

taske666
Aug 29, 2025
Author

t41372
Aug 29, 2025
Maintainer

taske666 Aug 30, 2025
Author

philpw99
Jan 7, 2026