Overview

This project focuses on enhancing audio similarity detection using machine learning and signal processing techniques. By leveraging Python libraries like Librosa and Resemblyzer, the system extracts voice embeddings and computes cosine similarity between audio files. The solution addresses the challenge of distinguishing between similar voices while accounting for noise variations. Advanced feature extraction techniques such as MFCC and chroma features are employed to improve accuracy. The model is fine-tuned to identify the same speaker's voice across noisy and clean audio samples.

Features

Audio preprocessing and feature extraction using Librosa
Voice embedding generation with Resemblyzer
Cosine similarity calculation for audio comparison
MFCC and chroma feature extraction to enhance comparison accuracy
Robust to noisy datasets
Visualization of waveforms, spectrograms, and similarity scores

Requirements

Python 3.7 or later is recommended.

Installation

Install the required libraries by running

  pip install numpy

  pip install librosa

  pip install seaborn

  pip install matplotlib

  pip install resemblyzer

Download or clone the repository from GitHub.

  git clone https://github.com/5W4PN1L/Audio-Matching-Tool.git

Usage

Save the audio files in .wav format.
Place your target audio file in the project directory and name it aud1.wav, or modify the script to use your own file name.
Place reference audio files (e.g., audio1.wav, audio4.wav) in the same directory, or modify the script to use your own file name.
Run the audiomatching.py script to compute the similarity between the target and reference audio files.

To execute the script:

  python audiomatching.py

Authors

B.Swapnil

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
LICENSE		LICENSE
README.md		README.md
audiomatching.py		audiomatching.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Overview

Features

Requirements

Installation

Usage

Authors

About

Releases

Packages

Languages

License

5W4PN1L/Audio-Matching-Tool

Folders and files

Latest commit

History

Repository files navigation

Overview

Features

Requirements

Installation

Usage

Authors

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages