Hi! Impressive AI meeting copilot with echo cancellation.
For real-time meeting transcription, SenseVoice could significantly reduce ASR latency:
Key advantages for a meeting copilot
- Non-autoregressive — single forward pass gives full transcription (no sequential token generation)
- ~50ms latency on GPU — ideal for real-time copilot scenarios
- 234M params / ~1GB VRAM — leaves room for your LLM on the same GPU
- Built-in features: VAD, speaker diarization (cam++), emotion detection
- OpenAI-compatible API — if you already use Whisper API, it's a drop-in
Quick start
from funasr import AutoModel
model = AutoModel(
model="iic/SenseVoiceSmall",
vad_model="fsmn-vad",
spk_model="cam++",
)
result = model.generate(input=audio_chunk)
Or start a server:
pip install funasr
funasr-server --device cuda # localhost:8000, OpenAI-compatible
Links
Hi! Impressive AI meeting copilot with echo cancellation.
For real-time meeting transcription, SenseVoice could significantly reduce ASR latency:
Key advantages for a meeting copilot
Quick start
Or start a server:
pip install funasr funasr-server --device cuda # localhost:8000, OpenAI-compatibleLinks