Skip to content
View Cinemachina's full-sized avatar

Block or report Cinemachina

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Stars

MusicGen

28 repositories

so-vits-svc fork with realtime support, improved interface and more features.

Python 8,914 1,181 Updated Feb 26, 2025

The PyTorch-based audio source separation toolkit for researchers

Python 2,330 428 Updated Jan 11, 2025

A PyTorch-based Speech Toolkit

Python 9,409 1,439 Updated Feb 25, 2025

ModelScope: bring the notion of Model-as-a-Service to life.

Python 7,444 763 Updated Feb 24, 2025

Implementation of SoundStorm, Efficient Parallel Audio Generation from Google Deepmind, in Pytorch

Python 1,479 92 Updated Oct 31, 2024

A simple GUI application that slices audio with silence detection

Python 1,304 172 Updated Jul 29, 2024

Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor / tokenizer, along with MusicGen, a simple and controllable…

Jupyter Notebook 21,554 2,250 Updated Jan 15, 2025

Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding

Jupyter Notebook 6,920 835 Updated Feb 24, 2025

Easily train a good VC model with voice data <= 10 mins!

Python 27,328 3,885 Updated Nov 24, 2024

The official Python API for ElevenLabs Text to Speech.

Python 2,402 288 Updated Feb 25, 2025

🔊 Text-Prompted Generative Audio Model

Jupyter Notebook 37,065 4,370 Updated Aug 19, 2024

Automatic Speech Recognition with Speaker Diarization based on OpenAI Whisper

Jupyter Notebook 4,192 379 Updated Dec 18, 2024

An advanced singing voice synthesis system with high fidelity, expressiveness, controllability and flexibility based on DiffSinger: Singing Voice Synthesis via Shallow Diffusion Mechanism

Python 2,792 294 Updated Feb 17, 2025

Real-time end-to-end singing voice conversion system based on DDSP (Differentiable Digital Signal Processing)

Python 2,025 253 Updated Feb 12, 2025

An easy to understand TTS / SVS / SVC framework

Python 683 90 Updated Feb 3, 2025

Text-to-Audio/Music Generation

Python 2,375 187 Updated Sep 29, 2024

Collaborative Programmable Music

Clojure 5,971 448 Updated Jan 22, 2025

🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production

Python 37,990 4,745 Updated Aug 16, 2024
Python 137 13 Updated Jul 12, 2024

A webui for different audio related Neural Networks

Python 1,128 106 Updated Aug 16, 2024

A collection of pre-trained, state-of-the-art models in the ONNX format

Jupyter Notebook 8,295 1,434 Updated Apr 30, 2024

A WebUI to create song covers with any RVC v2 trained AI voice from YouTube videos or audio files.

Python 1,206 291 Updated Feb 15, 2025

DiffSinger dataset processing tools, including audio processing, labeling.

C++ 53 7 Updated Nov 13, 2024

A collection of neural vocoders suitable for singing voice synthesis tasks.

Python 107 10 Updated Jan 19, 2025

A simple, high-quality voice conversion tool focused on ease of use and performance.

Python 2,139 340 Updated Feb 23, 2025

Versatile AI-driven audio upscaler to enhance the quality of any audio.

Python 95 9 Updated Jan 10, 2025

Collection of the best Applio plugins.

Python 29 7 Updated Sep 12, 2024