This project analyzes and predicts student performance in Mathematics based on socio-economic and academic factors using Machine Learning techniques and deploys it using Flask, AWS, and GitHub Actions for CI/CD.
β
Understanding the Problem
π Data Collection & Checks
π Exploratory Data Analysis (EDA)
π Data Pre-Processing
π€ Model Training & Evaluation
π Choosing the Best Model
π Deployment with Flask & AWS
Understanding how student performance (test scores) is influenced by:
π§βπ Gender
π Ethnicity
π Parental Education
π½ Lunch Type
π Test Preparation Course
π Categorical Features:
- Gender, Ethnicity, Parental Education, Lunch Type, Test Prep Course
π Numerical Features:
- Math Score, Reading Score, Writing Score
1οΈβ£ Data Collection & Preprocessing β Handling missing values, duplicates, outliers
2οΈβ£ EDA β Visualizing trends & correlations π
3οΈβ£ Feature Engineering β Encoding categorical data, scaling, splitting datasets
4οΈβ£ Model Training β Experimenting with multiple ML models
5οΈβ£ Model Selection β Comparing models using evaluation metrics
6οΈβ£ Web Interface Development β Flask-based UI for easy predictions
7οΈβ£ Deployment β Hosted on AWS EC2 with ECR & GitHub Actions for CI/CD
π Python (Pandas, NumPy, Scikit-learn, Matplotlib, Seaborn)
π Jupyter Notebook β Model training & analysis
π€ Machine Learning (LinearRegression, RandomForestRegressor, GradientBoostingRegressor, KNeighborsRegressor, XGBRegressor, CatBoostRegressor, AdaBoostRegressor etc.)
π Flask β Web interface for user-friendly predictions
β AWS (EC2, ECR) β Cloud deployment & containerized infrastructure
β‘ GitHub Actions β CI/CD automation for deployment
π Key insights into factors affecting student performance
π Accurate ML model for predicting student math scores
π Web app for real-time score prediction
π Cloud-deployed solution for accessibility
π§ Shiva Prasad Naroju - [email protected]
Let me know if you need any further improvements! ππ₯