Skip to content
Merged
Changes from all commits
Commits
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
1 change: 1 addition & 0 deletions docs/multimedia-processing.md
Original file line number Diff line number Diff line change
Expand Up @@ -204,6 +204,7 @@ Servers focused on generating or manipulating images, processing video, audio tr
- [neugence/whipscribe-mcp](https://github.com/neugence/whipscribe-mcp): MCP server for audio/video transcription via the Whipscribe API. Transcribes from a URL (YouTube, podcasts, direct media links) or local file; returns txt/json/srt/vtt/docx with optional speaker diarization and word timestamps. Hosted remote endpoint at `https://whipscribe.com/mcp` (Bearer auth) or local install via `uvx whipscribe-mcp`.
- [rendobar/mcp](https://github.com/rendobar/mcp): Official local stdio MCP server for Rendobar's media-processing API. Lets AI agents submit FFmpeg, transcode, caption, and watermark jobs and upload local files to the cloud backend. TypeScript, MIT licensed.

- [getcutpro/mcp](https://github.com/getcutpro/mcp): MCP server that exposes the full CutPro v1 API as 34 tools for AI clients — analyze videos, submit clipping jobs, manage clips, render, publish posts, and more. Supports stdio (local), Streamable HTTP (remote), and OAuth 2.1 (browser clients).
- [transloadit/node-sdk](https://github.com/transloadit/node-sdk/tree/main/packages/mcp-server): Agent-native media processing via Transloadit's 86+ Robots, supporting video encoding, image manipulation, document conversion, OCR, and speech transcription. Hosted or self-hosted via npx.
- [fasuizu-br/speech-ai-examples](https://github.com/fasuizu-br/speech-ai-examples): Speech AI MCP server with pronunciation assessment (phoneme-level scoring, 17MB model, <300ms), text-to-speech, and speech-to-text. 8 tools for AI agents building language learning, accessibility, and voice applications.
- [ssembleinc/ssemble-mcp-server](https://github.com/ssembleinc/ssemble-mcp-server): Create AI-generated short-form video clips from YouTube with captions, music, gameplay overlays, meme hooks, and viral scoring.