Field Cut

From field tape to rough cut in one place.

A local web app for audio journalism production. Upload a field interview, get a timestamped transcript with speaker detection, mark clips, cut them, add narration, and assemble a rough cut — all in the browser.

Built for radio journalists and podcast producers who work with field recordings and want to go from raw interview to rough cut without jumping between five different tools.

What it does

Transcribe — Upload a WAV/MP3 interview. Whisper transcribes it with word-level timestamps. Supports Hebrew, English, Arabic, and more.
Speaker detection (optional) — Automatically identifies who's talking and color-codes speakers throughout the UI.
Mark clips — Click words in the transcript to mark clip boundaries. Clips are numbered automatically.
Cut — One click cuts the source audio into separate WAV files using ffmpeg.
Narration (optional) — Upload your narration recording, transcribe it, and mark narration clips the same way.
Assemble — Drag interview clips and narration into order, hit assemble, get a single rough-cut WAV.
Export — Download clips as a ZIP, transcript as a Word doc (with speaker colors), or the final rough cut.

Features

Waveform visualization with zoom, pan, and clip regions
Speaker diarization with color-coded badges (rename or reassign speakers)
Clip boundary trimming (fine-tune start/end times)
Paper edit export (Word doc matching your assembly order)
Multi-project support (save, load, duplicate projects)
Export folder — auto-copy outputs to Google Drive, Dropbox, or any local folder
Bilingual UI (English / Hebrew), easy to add more languages
Built-in demo interview to try the full pipeline without your own audio
First-run setup wizard for API keys
Dark and light themes
Works on macOS, Windows, and Linux

Requirements

Python 3.9+
ffmpeg installed and in your PATH
An OpenAI API key (for Whisper transcription)
Optional: HuggingFace token (for speaker detection)

Quick start

# Clone the repo
git clone https://github.com/shaulams/FieldCut.git
cd FieldCut

# Create a virtual environment
python3 -m venv .venv
source .venv/bin/activate

# Install dependencies
pip install -r requirements.txt

# Optional: speaker detection (pulls ~2GB of PyTorch models)
pip install -r requirements-speaker.txt

# Run
python app.py

Open http://localhost:5555 in your browser. On first run, the app will ask for your API keys.

Or try the built-in demo — click "Try a demo interview" to experience the full pipeline without uploading your own audio.

How much does it cost?

The only cost is OpenAI's Whisper API:

$0.006 per minute of audio
A 60-minute interview costs about $0.36

Speaker detection runs locally on your machine (free). Everything else is local too.

Speaker detection setup

Speaker detection uses pyannote and requires a free HuggingFace account:

Create a token at huggingface.co/settings/tokens (Read permission)
Accept the terms for these models:
Enter the token in the setup wizard, or add HUGGINGFACE_TOKEN=hf_... to your .env file

On Apple Silicon Macs, speaker detection uses the GPU automatically for faster processing.

Tech stack

Backend: Python / Flask
Frontend: Single HTML file, vanilla JS (no build step, no framework)
Audio processing: ffmpeg
Transcription: OpenAI Whisper API
Speaker detection: pyannote.audio (runs locally)
State: JSON file (no database needed)

Adding a new language

Field Cut ships with English and Hebrew. Adding a new language takes ~10 minutes:

Open static/lang.js
Copy the en block and paste it as a new key (e.g. fr for French, ar for Arabic)
Translate every string value — keep the keys unchanged
Set _meta.name to the language's own name (e.g. "Francais") and _meta.dir to "ltr" or "rtl"
The language picker in the top bar will automatically include it

Contributing

Contributions are welcome! Feel free to open issues or submit pull requests.

License

MIT

Name		Name	Last commit message	Last commit date
Latest commit History 51 Commits
.github		.github
demo		demo
docs/plans		docs/plans
static		static
templates		templates
tests		tests
.env.example		.env.example
.gitignore		.gitignore
CLAUDE.md		CLAUDE.md
CONTRIBUTING.md		CONTRIBUTING.md
LICENSE		LICENSE
README.md		README.md
app.py		app.py
requirements-speaker.txt		requirements-speaker.txt
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Field Cut

What it does

Features

Requirements

Quick start

How much does it cost?

Speaker detection setup

Tech stack

Adding a new language

Contributing

License

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Field Cut

What it does

Features

Requirements

Quick start

How much does it cost?

Speaker detection setup

Tech stack

Adding a new language

Contributing

License

About

Topics

Resources

License

Contributing

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages