autotalk — Hands-Free Voice Interface for Claude Code

Talk to your terminal. Claude Code hears you.

What It Does

Continuous mic capture → voice activity detection → local speech-to-text → inject into Claude Code's terminal input. Fully local, no cloud STT.

Stack

Mic capture: sounddevice (PortAudio)
VAD: webrtcvad (Google WebRTC, C extension)
STT: faster-whisper (CTranslate2, Whisper base.en)
Injection: AppleScript keystroke/clipboard into active terminal
TTS (output): voxtral-mcp speak tool (Kokoro-82M) — Claude Code calls it directly

Usage

# Start listening (Open Mic mode, paste delivery)
./run.sh

# Use specific mic
./run.sh --device 3

# Dry run — transcribe but don't inject
./run.sh --mode dry-run

# Better accuracy (slower)
./run.sh --model small.en

# Target specific app
./run.sh --target Terminal

Full Duplex Setup

Terminal A: ./run.sh (autotalk listens)
Terminal B: claude (Claude Code running)
Talk → autotalk transcribes → pastes into Claude Code
Claude Code responds → uses voxtral-mcp speak tool to read aloud

Files

autotalk.py — main script (mic → VAD → STT → inject)
run.sh — launcher (activates venv)
test_pipeline.py — component validation
.venv/ — Python 3.13 virtual environment

Requirements

macOS (AppleScript injection)
Python 3.11+
Microphone access (grant in System Settings > Privacy > Microphone)
Accessibility permission for AppleScript keystroke injection

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

autotalk — Hands-Free Voice Interface for Claude Code

What It Does

Stack

Usage

Full Duplex Setup

Files

Requirements

FilesExpand file tree

CLAUDE.md

Latest commit

History

CLAUDE.md

File metadata and controls

autotalk — Hands-Free Voice Interface for Claude Code

What It Does

Stack

Usage

Full Duplex Setup

Files

Requirements