You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
I'm running tts on spanish and hindi language. The audio generated are better by previous repo. New repo seems to add female mixture to generated audio.
Here is the sample audio and the output generated by both repo.
I'm using the same code for both.
importtorchfromTTS.apiimportTTS# Get devicedevice="cuda"iftorch.cuda.is_available() else"cpu"# Init TTStts=TTS("tts_models/multilingual/multi-dataset/xtts_v2").to(device)
tts.tts_to_file(text="After this, we will talk about a player who will give a lot of respect to Indian fans in this world cup. I will just tell you about this player who announced his entry in International Cricket and did a great performance. From today, we will meet you every day at EAM Cricket World Cup 2007.", speaker_wav="../audio/urdu.wav", language="en", file_path="output.wav")
Install old TTS and new TTS separately and test given code.
importtorchfromTTS.apiimportTTS# Get devicedevice="cuda"iftorch.cuda.is_available() else"cpu"# Init TTStts=TTS("tts_models/multilingual/multi-dataset/xtts_v2").to(device)
tts.tts_to_file(text="After this, we will talk about a player who will give a lot of respect to Indian fans in this world cup. I will just tell you about this player who announced his entry in International Cricket and did a great performance. From today, we will meet you every day at EAM Cricket World Cup 2007.", speaker_wav="../audio/urdu.wav", language="en", file_path="output.wav")
Thank you for the report! There is indeed an issue with voice cloning and I've identified the source. I should be able to release a fix in ~2 days. In the meantime you can use version 0.24.3 of this fork.
Describe the bug
I'm running tts on spanish and hindi language. The audio generated are better by previous repo. New repo seems to add female mixture to generated audio.
Here is the sample audio and the output generated by both repo.
I'm using the same code for both.
data.zip
To Reproduce
Install old TTS and new TTS separately and test given code.
Expected behavior
voice quality not good.
Logs
No response
Environment
Additional context
No response
The text was updated successfully, but these errors were encountered: