SparkVox

SparkVox is a training framework focused on speech generation, while also supporting a range of related speech tasks, including speaker attribute recognition, emotion recognition, audio codecs, and speech synthesis.

Supported Tasks

Speaker Attribute Recognition
- Age prediction
- Gender prediction
Codec
- BiCodec
- BigCodec
Speech Synthesis
- SparkTTS

Project Structure

bins:
- train_pl: The main training entry point for all tasks.
egs:
- task (e.g. codec, speech_synthesis): Example training scripts for each task.
sparkvox
- models: Model implementations for different tasks.
tools: Utilities for data processing, model inference, and feature extraction.
utils: Common utilities for tasks such as reading and processing audio files, as well as general training tools.

Name		Name	Last commit message	Last commit date
Latest commit History 23 Commits
bins		bins
egs		egs
notebooks		notebooks
sparkvox		sparkvox
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

SparkVox

Supported Tasks

Project Structure

Examples

About

Uh oh!

Releases

Packages

Languages

License

SparkAudio/SparkVox

Folders and files

Latest commit

History

Repository files navigation

SparkVox

Supported Tasks

Project Structure

Examples

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages