Skip to content

feat: add VoXtream2 backend and interactive TUI#31

Merged
pszymkowiak merged 1 commit intomainfrom
feat/voxtream-backend-tui
Mar 18, 2026
Merged

feat: add VoXtream2 backend and interactive TUI#31
pszymkowiak merged 1 commit intomainfrom
feat/voxtream-backend-tui

Conversation

@pszymkowiak
Copy link
Contributor

Summary

Benchmark results

Backend M2 Pro (CPU) RTX 4070 Ti SUPER
say 3s macOS only
kokoro 10s ~10s
voxtream 68s / 8s warm 44s / 22s warm
qwen-native 11m33s / 3s warm ~30s / ~2s warm

Test plan

  • cargo build passes
  • vox -b voxtream "test" generates and plays audio (macOS)
  • vox -b voxtream "test" works on Linux CUDA (maria, RTX 4070 Ti SUPER)
  • vox setup launches TUI, navigates, saves preferences
  • voxtream auto-detected in ~/.local/venvs/voxtream/bin/

- Add voxtream backend (0.5B zero-shot TTS, CC-BY-4.0)
  - Auto-detects voxtream binary in PATH or ~/.local/venvs/voxtream/
  - Generates default prompt audio via macOS say for non-clone usage
  - Supports voice cloning via ref_audio and speaking rate control
- Add `vox setup` TUI (ratatui + crossterm) for human configuration
  - 5-panel layout: Backend, Voice, Language, Style, Config
  - Live voice testing (T), save preferences (S), vim-style navigation
  - AI agents continue using CLI flags instead
- Update README with 5-backend benchmark (M2 Pro + RTX 4070 Ti SUPER)
- Register voxtream as clone-capable backend in MCP tool descriptions
@pszymkowiak pszymkowiak merged commit 5c7799c into main Mar 18, 2026
0 of 3 checks passed
@pszymkowiak
Copy link
Contributor Author

pszymkowiak commented Mar 18, 2026

🧞 wshm · Automated triage by AI

📊 Automated PR Analysis

Type feature
🟡 Risk medium

Summary

Adds VoXtream2 as a 5th TTS backend with zero-shot voice cloning support, introduces an interactive TUI (vox setup) built with ratatui for voice configuration, and updates the README with benchmark data across M2 Pro and RTX 4070 Ti SUPER hardware.

Review Checklist

  • Tests present
  • Breaking change
  • Docs updated

Linked issues: #30


🤖 Analyzed automatically by wshm · This is an automated analysis, not a human review.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant