A beginner-friendly introduction to summary statistics with Python.
This repo contains two Jupyter notebooks: one for exercises and one with full solutions.
Demonstrations
β guided notebook with explanations, prompts, and exercises.Solutions and Answers
β full solutions notebook with worked-out answers.requirements.txt
β minimal dependencies to run the notebooks.
- Descriptive vs. Inferential statistics
- Data types: numeric, categorical, nominal, ordinal
- Measures of center: mean, median, mode
- Measures of spread: standard deviation, mean absolute deviation (MAD)
- Quartiles and interquartile range (IQR)
- Boxplots for visualization
- Mini-project: compare salaries and orders by department
-
Clone the repo
git clone https://github.com/yourusername/StatsCourse.git cd StatsCOurse
-
Install dependencies
pip install -r requirements.txt
-
Launch Jupyter Notebook
jupyter notebook
(But you can use any IDE you want)
-
Open notebooks
- Start with
stats_basics_exercises.ipynb
- Check your work with
stats_basics_solutions.ipynb
- Start with
The notebooks include:
- Hands-on Python with NumPy and Pandas
- Visualizations with Matplotlib
- Short reflections to reinforce learning
Suggestions and improvements are welcome!
Feel free to open an issue or submit a pull request.
Created as part of my learning journey through the
DataCamp Associate Data Scientist Certificate.