Phone Agent (macOS Voice Agent)

On-device macOS voice agent that captures FaceTime/Continuity call audio, transcribes with Whisper, generates replies with Phi-3 Mini via llama.cpp, and responds with a cloned user voice.

Features

Privacy-first: All AI inference runs on-device - no cloud APIs
Zero-config models: AI models download automatically on first launch
Auto-managed LLM server: The app starts/stops llama.cpp automatically
Guided setup: In-app wizard walks you through all setup steps
Voice cloning: Train the agent to speak in your voice

Quick Start

1. Build Dependencies (One-Time)

cd /path/to/phone-agent
bash Scripts/build-frameworks.sh

2. Build & Run

Open PhoneAgentApp/PhoneAgentApp.xcodeproj in Xcode, build, and run.

3. Follow the Setup Wizard

The app will guide you through:

Microphone access - For voice recording
Screen recording - For capturing FaceTime audio
BlackHole setup - For routing agent voice to calls
Voice training (optional) - To clone your voice

The setup wizard includes:

Auto-detection of BlackHole installation
One-click copy of install command
Direct links to System Settings and FaceTime

How It Works

FaceTime Call
      |
      v
[Screen Capture] --> [ASR/Whisper] --> [LLM/Phi-3] --> [TTS] --> [BlackHole] --> FaceTime Mic
      |                     |                |                        |
   Caller audio       Transcription    AI Response              Agent speaks

Automatic Features

Feature	What It Does
Model Download	Phi-3 model auto-downloads to `~/Library/Application Support/PhoneAgent/Models/`
LLM Server	llama.cpp server auto-starts, health monitored, status shown in UI
BlackHole Detection	App detects if BlackHole is installed and guides you through setup
Call Detection	Automatically activates when FaceTime is running

Why BlackHole?

macOS doesn't provide an API to inject audio into another app's microphone. BlackHole creates a virtual audio device that acts as a bridge:

Your App (TTS Output) --> BlackHole --> FaceTime (Microphone Input)

The setup wizard makes this easy:

Detects if BlackHole is already installed
Provides the install command with one-click copy
Links directly to FaceTime settings for configuration

Project Structure

PhoneAgent/           # Swift package (main library)
  App/                # App state, services, permissions
  Audio/              # Audio capture, injection, VAD, BlackHole detection
  AI/                 # LLM, ASR, TTS, orchestration
  UI/                 # SwiftUI views
  Models/             # Data models
PhoneAgentApp/        # Xcode project wrapper
Scripts/              # Build helpers
Frameworks/           # Built dependencies (llama.cpp, sherpa-onnx)

Troubleshooting

Issue	Solution
BlackHole not detected	Run `brew install blackhole-2ch`, then click "Check Installation"
Models not downloading	Check internet; retry from the download screen
LLM server won't start	Run `Scripts/build-frameworks.sh` again
No audio to caller	In FaceTime > Settings > Audio, set microphone to "BlackHole 2ch"

Requirements

macOS 13.0+
Xcode 15+
~3 GB disk space for models
Homebrew (for BlackHole installation)

Name		Name	Last commit message	Last commit date
Latest commit History 67 Commits
.cursor/plans		.cursor/plans
Archive/MacOS-UI-Backup		Archive/MacOS-UI-Backup
PhoneAgent		PhoneAgent
PhoneAgentApp		PhoneAgentApp
PhoneAgentAppMobile		PhoneAgentAppMobile
Scripts		Scripts
notes		notes
.gitignore		.gitignore
Package.swift		Package.swift
README.md		README.md
XCODE_SETUP_GUIDE.md		XCODE_SETUP_GUIDE.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Phone Agent (macOS Voice Agent)

Features

Quick Start

1. Build Dependencies (One-Time)

2. Build & Run

3. Follow the Setup Wizard

How It Works

Automatic Features

Why BlackHole?

Project Structure

Troubleshooting

Requirements

About

Uh oh!

Releases

Packages

Languages

MuhsinunC/phone-agent

Folders and files

Latest commit

History

Repository files navigation

Phone Agent (macOS Voice Agent)

Features

Quick Start

1. Build Dependencies (One-Time)

2. Build & Run

3. Follow the Setup Wizard

How It Works

Automatic Features

Why BlackHole?

Project Structure

Troubleshooting

Requirements

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages