Real-Time Speech-to-Text Translation Support #58

hu-ke · 2025-01-24T03:39:41Z

Description of the feature request:

Instead of waiting for a turn of speech to complete (VAD mode), would it be possible to stream the generated results in real-time?

What problem are you trying to solve with this feature?

Suppose I am currently in a Japanese interview, but my Japanese skills are not very strong. I would like to build a app with the Gemini Multimodal API to assist me with real-time speech-to-text translation.

Any other information you'd like to share?

No response

The text was updated successfully, but these errors were encountered:

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Real-Time Speech-to-Text Translation Support #58

Real-Time Speech-to-Text Translation Support #58

hu-ke commented Jan 24, 2025

Real-Time Speech-to-Text Translation Support #58

Real-Time Speech-to-Text Translation Support #58

Comments

hu-ke commented Jan 24, 2025

Description of the feature request:

What problem are you trying to solve with this feature?

Any other information you'd like to share?