This project provides a complete workflow for cleaning, imputing, evaluating, sorting, and visualizing datasets. It is designed to help data analysts explore and prepare data for deeper insights and modeling.
- β Imputation β Handling missing values using statistical and model-based methods.
- π Evaluation β Comparing imputation results and measuring their effectiveness.
- π§½ Cleaning β Removing duplicates, fixing inconsistencies, and filtering out noisy data.
- π Sorting & Filtering β Organizing data for clearer analysis.
- π Visualization β Generating informative charts using Matplotlib/Seaborn to identify trends.
- π§βπ» Streamlit UI (optional) β Interactive interface to explore data visually.
if an error occur for the bokeh library:
pip install --force-reinstall --no-deps bokeh==2.4.3
do not forget:
pip install -r requirements.txt
- python
- streamlit
- pandas
- numpy
- sklearn
- matplotlib
- bokeh
- altair
- plotly
- io
This project is ideal for practicing:
Real-world data preprocessing
Missing value treatment
Evaluation of imputation strategies
Clear and clean visualization for storytelling