feat: lazy daemon mode with voxtream-server streaming by pszymkowiak · Pull Request #36 · rtk-ai/vox

pszymkowiak · 2026-03-18T16:16:49Z

Summary

Add vox daemon start/stop/status — keeps heavy TTS models warm in memory
Daemon manages voxtream-server (FastAPI) as child process, proxies via WebSocket
Transparent routing: vox -b voxtream "text" auto-routes through daemon if running
Auto-shutdown after idle timeout (5min default)
Real streaming benchmark: 170ms first-frame on RTX 4070 Ti SUPER (paper: 74ms)

Benchmark (streaming, model warm)

Platform	First audio frame
M2 Pro (CPU)	3.3s
RTX 4070 Ti SUPER (CUDA)	170ms

Closes #30

Test plan

cargo build --features metal passes
vox daemon start → vox daemon status → vox daemon stop lifecycle
voxtream via daemon generates and plays audio
Idle auto-shutdown after timeout
Streaming benchmark on Mac and CUDA

- Add `vox daemon start/stop/status` — local HTTP server keeps models warm - Daemon manages voxtream-server as child process (WebSocket proxy) - Transparent routing: heavy backends auto-route through daemon if running - Idle auto-shutdown (default 5min, configurable) - Update README with real benchmark data (streaming: 170ms first-frame CUDA) - Add .cargo/config.toml with Metal aliases for macOS dev builds - Add tokio rt-multi-thread, macros, signal, time features Benchmark results (voxtream streaming, model warm): M2 Pro CPU: 3.3s first audio RTX 4070 Ti SUPER: 170ms first audio (paper: 74ms) Closes #30

pszymkowiak · 2026-03-18T16:17:07Z

🧞 wshm · Automated triage by AI

📊 Automated PR Analysis


✨ Type	`feature`
🔴 Risk	`high`

Summary

Adds a lazy daemon mode (vox daemon start/stop/status) that keeps heavy TTS models warm in memory, manages voxtream-server (FastAPI) as a child process with WebSocket proxying, and transparently routes vox -b voxtream calls through the daemon when running. Includes auto-shutdown after an idle timeout (5min default) and achieves 170ms first-frame latency on RTX 4070 Ti SUPER.

Review Checklist

Tests present
Breaking change
Docs updated

Linked issues: #30

🤖 Analyzed automatically by wshm · This is an automated analysis, not a human review.

pszymkowiak merged commit 47a7609 into main Mar 18, 2026
0 of 3 checks passed

pszymkowiak added feature daemon streaming performance labels Mar 18, 2026

github-actions bot mentioned this pull request Mar 18, 2026

chore(main): release 0.9.0 #35

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: lazy daemon mode with voxtream-server streaming#36

feat: lazy daemon mode with voxtream-server streaming#36
pszymkowiak merged 1 commit intomainfrom
feat/daemon-voxtream-tui

pszymkowiak commented Mar 18, 2026

Uh oh!

Uh oh!

pszymkowiak commented Mar 18, 2026 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

pszymkowiak commented Mar 18, 2026

Summary

Benchmark (streaming, model warm)

Test plan

Uh oh!

Uh oh!

pszymkowiak commented Mar 18, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

📊 Automated PR Analysis

Summary

Review Checklist

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

pszymkowiak commented Mar 18, 2026 •

edited

Loading