fix(amd): dashboard token metrics via Lemonade inner llama-server by Lightheartdevs · Pull Request #607 · Light-Heart-Labs/DreamServer

Lightheartdevs · 2026-03-24T14:18:45Z

Summary

Dashboard "Tokens/sec" and "Tokens Generated" always showed — and 0 on AMD/Lemonade
Lemonade wraps llama.cpp but doesn't proxy /metrics through its main port (8080)
Pass --metrics --host 0.0.0.0 to the inner llama-server via --llamacpp-args
Expose port 8001 (inner llama-server) on the Docker network
LLAMA_METRICS_PORT=8001 tells dashboard-api to query the inner process directly

Changes

docker-compose.amd.yml — added --llamacpp-args, expose: ["8001"], LLAMA_METRICS_PORT
helpers.py — metrics_port = int(os.environ.get("LLAMA_METRICS_PORT", port))

Backwards compatibility

LLAMA_METRICS_PORT defaults to the main service port. NVIDIA/CPU setups don't set it — zero change.

Test plan

AMD: dashboard shows tokens/sec after first inference
NVIDIA: dashboard metrics unchanged
Verify first-inference delay (Lemonade lazily spawns inner llama-server)

🤖 Generated with Claude Code

Lemonade wraps llama.cpp and doesn't proxy /metrics through its main API port. Pass --metrics --host 0.0.0.0 to the inner llama-server via --llamacpp-args, expose port 8001, and add LLAMA_METRICS_PORT env var to dashboard-api so it queries the inner process directly. Backwards-compatible: LLAMA_METRICS_PORT defaults to the main service port on non-Lemonade setups. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

Lightheartdevs merged commit 5afa637 into main Mar 24, 2026
15 of 22 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix(amd): dashboard token metrics via Lemonade inner llama-server#607

fix(amd): dashboard token metrics via Lemonade inner llama-server#607
Lightheartdevs merged 1 commit intomainfrom
fix/dashboard-lemonade-metrics

Lightheartdevs commented Mar 24, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

Lightheartdevs commented Mar 24, 2026

Summary

Changes

Backwards compatibility

Test plan

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant