Multiple Disease Prediction is an advanced Data Science project that leverages machine learning to predict the likelihood of multiple diseases, including Kidney Disease, Liver Disease, and Parkinsonβs Disease. The system aims to support early diagnosis, assist healthcare providers in decision-making, and reduce the time and cost of traditional diagnostics. The project integrates data preprocessing, machine learning model training, evaluation, and visualization using Streamlit and Power BI.
π multiple-disease-prediction
|
βββ π app/ # Streamlit application code
β βββ streamlit_app.py
|
βββ π data/
β βββ π raw/ # Original/raw datasets
β β βββ kidney_disease.csv
β β βββ liver_disease.csv
β β βββ parkinsons_disease.csv
β β
β βββ π cleaned/ # Cleaned/preprocessed datasets
β β βββ kidney_disease_cleaned.csv
β β βββ liver_disease_cleaned.csv
β β βββ parkinsons_disease_cleaned.csv
β
βββ π models/ # Trained ML models (saved as pickle files)
β βββ kidney_model.pkl
β βββ liver_model.pkl
β βββ parkinsons_model.pkl
|
βββ π notebooks/ # google colab notebooks for EDA & modeling
β βββ Kidney_Disease_Prediction.ipynb
β βββ Liver_Disease_Prediction.ipynb
β βββ Parkinsons_Prediction.ipynb
β
βββ π video/ # Project demo video
β βββ project_demo.mp4
β
βββ requirements.txt # Python dependencies
βββ README.md # Project documentation
βββ .gitignore # Ignore unnecessary files in git
βββ LICENSE # Open-source license for project
- Clone the repository
git clone https://github.com/Infant-Joshva/Project_4-Multiple-Disease-Prediction.git
cd Project_4-Multiple-Disease-Prediction- Install dependencies
pip install -r requirements.txt- Run the Streamlit app
streamlit run app/main.py- Multi-Disease Prediction: Predicts Kidney, Liver, and Parkinsonβs disease probability based on user-provided symptoms, demographics, and test results.
- Data Preprocessing: Handles missing values, encodes categorical features, and scales numerical data to improve model accuracy.
- Machine Learning Models: Trained using Logistic Regression, Random Forest, and XGBoost for robust predictions.
- Interactive Streamlit App: Users can input personal health data and instantly receive disease probability and risk level.
- Model Evaluation Metrics: Accuracy, Precision, Recall, F1-Score, ROC-AUC, MAE, RMSE.
- Scalable & Secure: Designed to handle multiple users and ensure privacy of sensitive health data.
- Visual Insights: Graphs and charts showing feature importance, probability distributions, and high-risk patient identification.
- Patients with abnormal test results have higher predicted disease probabilities.
- Multi-disease prediction enables prioritizing early diagnosis and treatment.
- Visualizations improve interpretability of model results and risk analysis.
- Healthcare providers can monitor trends across patient populations and identify high-risk groups.
Your Name
π§ [email protected]
π GitHub
π LinkedIn
If you find this project useful, please give it a β on GitHub!
This project is licensed under the MIT License - see the LICENSE file for details.



