Feat: add stream inference (Inference Overview of Voice Cloning and Inference Overview of Controlled Generation) #118

weedge · 2025-03-17T09:43:23Z

feat:

add stream inference (Inference Overview of Voice Cloning and Inference Overview of Controlled Generation)

# Inference Overview of Controlled Generation
PYTHONPATH=./ python cli/inference_stream.py \
    --text "身临其境，换新体验。塑造开源语音合成新范式，让智能语音更自然。" \
    --save_dir "example/results" \
    --model_dir ../../pretrained_models/SparkTTS-0.5B \
    --gender female --pitch  moderate --speed high

# Inference Overview of Voice Cloning
# default use static batch is ok
PYTHONPATH=./ python cli/inference_stream.py \
    --text "身临其境，换新体验。塑造开源语音合成新范式，让智能语音更自然。" \
    --save_dir "example/results" \
    --model_dir ../../pretrained_models/SparkTTS-0.5B \
    --prompt_text "吃燕窝就选燕之屋，本节目由26年专注高品质燕窝的燕之屋冠名播出。豆奶牛奶换着喝，营养更均衡，本节目由豆本豆豆奶特约播出。" \
    --prompt_speech_path "example/prompt_audio.wav"

Signed-off-by: weedge <[email protected]>

…nference Overview of Controlled Generation) Signed-off-by: weedge <[email protected]>

Signed-off-by: weedge <[email protected]>

whaozl · 2025-04-10T05:24:16Z

Hello, could you clarify whether this 'merge' refers to real-time streaming (processed and played simultaneously, rather than being played after full synthesis)?"
@weedge

weedge added 8 commits March 12, 2025 16:27

fix: test bicodec to print model and params

9cff5eb

Signed-off-by: weedge <[email protected]>

fix: add cli/__init__.py

f127b85

Signed-off-by: weedge <[email protected]>

add TOKENIZER_PATH env for tokenizer paser

58c8d10

Signed-off-by: weedge <[email protected]>

assert tokens==inputs

27e0648

Signed-off-by: weedge <[email protected]>

Merge branch 'SparkAudio:main' into main

dfae8d7

feat: add stream inference (Inference Overview of Voice Cloning and I…

26b547d

…nference Overview of Controlled Generation) Signed-off-by: weedge <[email protected]>

rm triton_python_backend_utils.py

8063ce0

Signed-off-by: weedge <[email protected]>

add case

b7aa7b1

Signed-off-by: weedge <[email protected]>

weedge mentioned this pull request Mar 17, 2025

feat: add spark tts BiCodec and spark tts support batch stream ai-bot-pro/achatbot#130

Merged

fix typo

24b6327

Signed-off-by: weedge <[email protected]>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Feat: add stream inference (Inference Overview of Voice Cloning and Inference Overview of Controlled Generation) #118

Feat: add stream inference (Inference Overview of Voice Cloning and Inference Overview of Controlled Generation) #118

Uh oh!

weedge commented Mar 17, 2025

Uh oh!

whaozl commented Apr 10, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Uh oh!

Feat: add stream inference (Inference Overview of Voice Cloning and Inference Overview of Controlled Generation) #118

Are you sure you want to change the base?

Feat: add stream inference (Inference Overview of Voice Cloning and Inference Overview of Controlled Generation) #118

Uh oh!

Conversation

weedge commented Mar 17, 2025

Uh oh!

whaozl commented Apr 10, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants