Skip to content

skip_vocoder and voice preset save/load#10

Merged
TroyHernandez merged 4 commits into
mainfrom
skip-vocoder-voice-io
Jun 12, 2026
Merged

skip_vocoder and voice preset save/load#10
TroyHernandez merged 4 commits into
mainfrom
skip-vocoder-voice-io

Conversation

@TroyHernandez

Copy link
Copy Markdown
Contributor

Two small 0.1.7-parity conveniences:

  • generate(skip_vocoder = TRUE) stops after flow matching and returns mel (batch x 80 x frames, 50 frames/s) instead of audio — Python 0.1.7's skip_vocoder escape hatch. GPU-validated: mel frame count matches the vocoded run's duration exactly (1.72 s both ways on the same seed).
  • save_voice_embedding() / load_voice_embedding() (new R/voice_io.R): torch_save-based voice presets, the analogue of Python's Conditionals.save()/load(). Round trip through cuda is exact (max diff 0). Format is R torch serialization, documented as not compatible with Python's .pt.

New tinytest round-trip coverage (test_voice_io.R); full suite passes. Version bumped to 0.1.0.6 with NEWS entries.

- generate(skip_vocoder = TRUE) stops after flow matching and returns
  the mel (batch x 80 x frames) instead of audio (Python 0.1.7 parity);
  validated on GPU: mel frame count matches the vocoded run exactly.
- New R/voice_io.R: save_voice_embedding()/load_voice_embedding(),
  torch_save-based presets (analogue of Python's Conditionals.save/
  load). Round trip through cuda is exact.
@TroyHernandez TroyHernandez merged commit 3a21d4a into main Jun 12, 2026
4 checks passed
@TroyHernandez TroyHernandez deleted the skip-vocoder-voice-io branch June 12, 2026 18:36
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant