Swahili Spam Detection

A Flask-based web application for detecting spam messages in Swahili communications using machine learning. Maintains clean communication channels with real-time analysis and user feedback capabilities.

Features

Real-time Swahili message spam detection
User authentication system
Feedback submission and storage
Message history tracking
Machine learning model integration
Responsive web interface

Technical Stack

Backend: Python/Flask
Frontend: HTML5, CSS3, JavaScript
Machine Learning: scikit-learn (Pickle model)
Data Storage: JSON (users & feedback)
Styling: Custom CSS with Flexbox layout
Deployment: WSGI compatible

Project Structure

├── ssd/
│   ├── db/                 # JSON databases
│   │   ├── feedback.json
│   │   └── users.json
│   ├── logs/               # Application logs
│   ├── model/              # ML models
│   │   ├── newlyTrainedModel_27_jan_25/
│   │   └── swahiliSpamDetectionModel.pkl
│   ├── static/             # Static assets
│   │   ├── assets/         # Images
│   │   ├── js/             # JavaScript modules
│   │   ├── styles/         # CSS files
│   │   └── sweetalert/     # Alert library
│   ├── templates/          # Flask templates
│   │   ├── 404.html
│   │   ├── index.html
│   │   └── login.html
│   ├── app.py              # Main application
│   ├── wsgi.py            # WSGI entry point
│   └── requirements.txt    # Dependencies

Getting Started

Prerequisites

Python 3.8+
pip package manager
Modern web browser

Installation

Clone the repository

git clone https://github.com/patrick-paul/ssd.git
cd ssd

Create virtual environment

python -m venv .venv
source .venv/bin/activate  # On Windows: .venv\Scripts\activate

Install dependencies

pip install -r requirements.txt

Configure Environment

Create .env file in project root:

SECRET_KEY=your-secure-key-here

Generate a strong key using:

import secrets
print(secrets.token_hex(24))

Initialize databases

touch db/users.json db/feedback.json
echo "{}" > db/users.json
echo "{}" > db/feedback.json

Running the Application

python wsgi.py

Access the application at http://localhost:2001

Model Training

The spam detection model was trained using custom Swahili datasets. Training code and datasets are available in the separate repository: ssd-training Repository

Key training features:

Custom Swahili spam corpus
TF-IDF vectorization
Naive Bayes classifier
Model versioning system

Usage

Register a new account:
Enter Swahili text in the message input field
Get instant spam classification results
Provide feedback on detection accuracy using the feedback system

Configuration

Environment Variables

FLASK_ENV=development
FLASK_DEBUG=0
PORT=2001

Model Selection

Replace model/swahiliSpamDetectionModel.pkl with updated models

Styling

Modify CSS files in static/styles/

Contributing

Set up development environment:

pip install -r requirements.txt

Contribution guidelines:

Write tests for new features
Maintain JSON schema consistency
Update documentation accordingly
Follow PEP-8 standards

Known Issues

Limited concurrent user support
Model accuracy variance with regional dialects
Session management improvements needed

Future Improvements

📊 Real-time analytics dashboard
🔄 Model auto-update system
📱 Progressive Web App implementation

License

MIT License - See LICENSE.md for details

Contact

Development Team: [email protected]
Maintainer: @patrick-paul
Project Link: https://github.com/patrick-paul/ssd

Name		Name	Last commit message	Last commit date
Latest commit History 11 Commits
db		db
model		model
static		static
templates		templates
.gitignore		.gitignore
LICENSE		LICENSE
Pipfile		Pipfile
Pipfile.lock		Pipfile.lock
README.md		README.md
app.py		app.py
ecosystem.config.js		ecosystem.config.js
folder-structure-1.md		folder-structure-1.md
requirements.txt		requirements.txt
wsgi.py		wsgi.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Swahili Spam Detection

Features

Technical Stack

Project Structure

Getting Started

Prerequisites

Installation

Configure Environment

Running the Application

Model Training

Usage

Configuration

Environment Variables

Model Selection

Styling

Contributing

Known Issues

Future Improvements

License

Contact

About

Releases

Packages

Languages

License

patrick-paul/ssd

Folders and files

Latest commit

History

Repository files navigation

Swahili Spam Detection

Features

Technical Stack

Project Structure

Getting Started

Prerequisites

Installation

Configure Environment

Running the Application

Model Training

Usage

Configuration

Environment Variables

Model Selection

Styling

Contributing

Known Issues

Future Improvements

License

Contact

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages