Two audio files.
Example
input1: example/00001_spk1.wav
input2: example/00024_spk1.wav
(Wav file from https://huggingface.co/spaces/wenet/wespeaker_demo)
Degree of similarity.
The speakers are 84.8% similar
Welcome, human!
This model recommends additional module.
pip3 install torch torchaudio
Automatically downloads the onnx and prototxt files on the first run. It is necessary to be connected to the Internet while downloading.
For the sample audio,
$ python3 wespeaker.py --input1 examples/00001_spk1.wav --input2 examples/00024_spk1.wav
Specify two audio files with the --input1
and --input2
options.
By specifying the --english
option, it uses the English model
and the --chinese
option for the Chinese.
$ python3 wespeaker.py --input1 AUDIO_FILE1 --input2 AUDIO_FILE2 --english
Onnxruntime
ONNX opset=14
voxceleb_resnet34.onnx.prototxt
cnceleb_resnet34.onnx.prototxt