Ability to get a list of loaded models and unload a model by request #3378

Nyralei · 2024-08-25T13:00:34Z

No description provided.

dave-gray101 · 2024-08-25T19:08:25Z

https://github.com/mudler/LocalAI/blob/master/core/http/endpoints/localai/backend_monitor.go

These endpoints already show which backends are loaded and allow them to be unloaded?

Nyralei · 2024-08-25T19:50:06Z

https://github.com/mudler/LocalAI/blob/master/core/http/endpoints/localai/backend_monitor.go

These endpoints already show which backends are loaded and allow them to be unloaded?

Thanks for pointing out /backend/shutdown, but it works only when model ends with .bin https://github.com/mudler/LocalAI/blob/master/core/services/backend_monitor.go#L42
otherwise it tries to append ".bin" to model name. In my case model name ends with ".gguf"

About /backend/monitor endpoint - it doesn't show which models are currently loaded, it just shows some metrics and only if model is set in request (ending with ".bin" too).
I tried calling both with

{
    "model": "ggml-whisper-large-v3.bin"
}

/backend/monitor responds with

{
    "state": 1,
    "memory": {
        "total": 53627682816,
        "breakdown": {
            "gopsutil-RSS": 681861120
        }
    }
}

/backend/shutdown properly shut downs

With "model": "gemma-2-27b-it-Q5_K_S.gguf":

{
    "error": {
        "code": 500,
        "message": "backend gemma-2-27b-it-Q5_K_S.gguf.bin is not currently loaded",
        "type": ""
    }
}

{
    "error": {
        "code": 500,
        "message": "model gemma-2-27b-it-Q5_K_S.gguf.bin not found",
        "type": ""
    }
}

dave-gray101 · 2024-08-25T20:55:17Z

Thanks for the updated comment!

That sounds like a big bug to me - I'll see if I can investigate this soon.

jokerosky · 2024-09-12T14:13:40Z

Seems that it shows status of a particular requested model https://github.com/mudler/LocalAI/blob/master/core/http/endpoints/localai/backend_monitor.go#L23C34-L23C39
maybe for all models it should be a separate endpoint or something like '*' instead of model name?

Nyralei added the enhancement New feature or request label Aug 25, 2024

dave-gray101 self-assigned this Aug 25, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Ability to get a list of loaded models and unload a model by request #3378

Ability to get a list of loaded models and unload a model by request #3378

Nyralei commented Aug 25, 2024

dave-gray101 commented Aug 25, 2024

Nyralei commented Aug 25, 2024

dave-gray101 commented Aug 25, 2024

jokerosky commented Sep 12, 2024

Ability to get a list of loaded models and unload a model by request #3378

Ability to get a list of loaded models and unload a model by request #3378

Comments

Nyralei commented Aug 25, 2024

dave-gray101 commented Aug 25, 2024

Nyralei commented Aug 25, 2024

dave-gray101 commented Aug 25, 2024

jokerosky commented Sep 12, 2024