HumorMe - An Audio Humor Detection Demo

A full-fledged web application that records audio and classifies whether it's funny or not using a fine-tuned HuggingFace model. The app features real-time audio recording, file upload capabilities, and a modern, responsive UI.

Features

Real-time Audio Recording: Record audio directly in the browser
File Upload Support: Upload audio files (.wav, .mp3, .m4a, etc.)
AI Classification: Uses fine-tuned wav2vec model for humor detection
Modern UI: Beautiful, responsive design with real-time feedback
Confidence Scores: Shows detailed confidence scores for predictions
Audio Validation: Ensures minimum/maximum audio length requirements

Model Performance

The app uses a fine-tuned HuggingFace model (rishiA/humor_model_v4) that achieves 86% accuracy on humor detection tasks.

Installation & Setup

Prerequisites

Python 3.8 or higher
pip (Python package installer)
Modern web browser with microphone access

Quick Start

Clone the repository
```
git clone <your-repo-url>
cd humorMe
```
Install dependencies
```
pip install -r requirements.txt
```
Run the application
```
python app.py
```
Open your browser Navigate to http://localhost:5000

Production Deployment

For production deployment, you can use Gunicorn:

gunicorn -w 4 -b 0.0.0.0:5000 app:app

How to Use

Recording Audio

Click "🎤 Start Recording"
Speak something funny or serious
Click "⏹️ Stop Recording" when done
Wait for the AI analysis

Uploading Files

Click "📁 Choose Audio File"
Select an audio file from your device
Wait for the AI analysis

Understanding Results

😂 FUNNY: The AI detected humor in your audio
😐 NOT FUNNY: The AI detected serious/non-humorous content
Confidence Score: How certain the AI is about its prediction
Detailed Scores: Breakdown of funny vs not-funny probabilities

🔧 Technical Details

Backend (Flask)

Framework: Flask with CORS support
Audio Processing: librosa for audio preprocessing
Model Integration: HuggingFace Transformers pipeline
File Handling: Temporary file management for audio processing

Frontend (HTML/CSS/JavaScript)

Audio Recording: Web Audio API with MediaRecorder
File Upload: Drag-and-drop file input
Responsive Design: Mobile-friendly interface
Real-time Feedback: Status updates and progress indicators

Audio Processing Pipeline

Input Validation: Check file format and duration
Preprocessing: Resample to 16kHz, normalize length
Model Inference: Pass through fine-tuned wav2vec model
Post-processing: Extract confidence scores and predictions

📊 Model Architecture

The app uses a fine-tuned version of Facebook's wav2vec model:

Base Model: facebook/hubert-large-ls960-ft
Task: Binary classification (funny vs not funny)
Training: Custom dataset with class weighting
Performance: 86% accuracy on test set

🎨 Customization

Styling

The UI uses CSS custom properties and can be easily customized by modifying the styles in templates/index.html.

Model Configuration

To use a different model, update the model name in app.py:

classifier = pipeline("audio-classification", model="your-model-name")

Audio Constraints

Modify audio length constraints in app.py:

# Minimum length (0.5 seconds)
if len(audio_data) < 8000:
    
# Maximum length (30 seconds)
if len(audio_data) > 480000:

Performance Optimization

Model Caching: Model is loaded once at startup
Temporary Files: Automatic cleanup of processed audio files
Audio Compression: Optimized audio preprocessing pipeline
Async Processing: Non-blocking audio classification

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
fine_tuned_models		fine_tuned_models
templates		templates
text-based_models		text-based_models
README.md		README.md
app.py		app.py
demo.py		demo.py
requirements.txt		requirements.txt
start.sh		start.sh
test_installation.py		test_installation.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

HumorMe - An Audio Humor Detection Demo

Features

Model Performance

Installation & Setup

Prerequisites

Quick Start

Production Deployment

How to Use

Recording Audio

Uploading Files

Understanding Results

🔧 Technical Details

Backend (Flask)

Frontend (HTML/CSS/JavaScript)

Audio Processing Pipeline

📊 Model Architecture

🎨 Customization

Styling

Model Configuration

Audio Constraints

Performance Optimization

License

This project is open source and available under the MIT License.

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

HumorMe - An Audio Humor Detection Demo

Features

Model Performance

Installation & Setup

Prerequisites

Quick Start

Production Deployment

How to Use

Recording Audio

Uploading Files

Understanding Results

🔧 Technical Details

Backend (Flask)

Frontend (HTML/CSS/JavaScript)

Audio Processing Pipeline

📊 Model Architecture

🎨 Customization

Styling

Model Configuration

Audio Constraints

Performance Optimization

License

This project is open source and available under the MIT License.

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages