voice_assistant

What is this?

Homemade vocal assistant, to practice using wav2vec2 from huggingface and class oriented programming.

After initialisation, the assistant can enter standby, waiting for some sound. If sound is detected, recording start until sound becomes sufficiently quiet for some time. The sound array is passed to a french-optimized wav2vec2 model, producing text. The text is then converted to a phonetic representation (for better robustness against false detection, such as "dis", "dit" or "dix") and comparted against a known database of orders.

How does it work?

prerequisites

todo

installation

todo

basic operation

va = assistant()

va.standby()

To-do list

Code missing functions (such as warmup of the model)
allow for activation + order in the same speech
properly reference source material (models, sound recording code, ...)

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
.gitignore		.gitignore
README.md		README.md
S2T_tests.py		S2T_tests.py
ST2_class.py		ST2_class.py
audio_capture.py		audio_capture.py
dict_phonems.json		dict_phonems.json
known_orders.csv		known_orders.csv
logger.py		logger.py
loop.py		loop.py
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

voice_assistant

What is this?

How does it work?

prerequisites

installation

basic operation

To-do list

About

Releases

Packages

Languages

vldv/voice_assistant

Folders and files

Latest commit

History

Repository files navigation

voice_assistant

What is this?

How does it work?

prerequisites

installation

basic operation

To-do list

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages