Skip to content

A Repo of Data Analysis projects including data exploration, visualization, statistical insights, and PySpark workflows. Demonstrating real-world problem solving with Python, Pandas, Matplotlib, and advanced analytical techniques.

Notifications You must be signed in to change notification settings

Hamzi275/data-analysis

Folders and files

NameName
Last commit message
Last commit date

Latest commit

Β 

History

32 Commits
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 

Repository files navigation

πŸ“Š Data Analysis Projects – General Repository

Welcome to my Data Analysis Hub πŸš€ – a curated collection of my analytical projects where I explore, clean, visualize, and interpret diverse datasets using Python, Pandas, Matplotlib, Seaborn, and PySpark.

This repository combines multiple standalone projects into one central hub, showcasing my journey in data wrangling, statistical exploration, feature engineering, and storytelling with data.


πŸ”₯ Projects Included

  1. Data Exploration
    Basic data cleaning, descriptive statistics, and pattern discovery.

  2. COVID Data Exploration
    Analyzing global COVID-19 datasets with trend insights and visualizations.

  3. World Happiness Analysis
    Investigating happiness scores, socio-economic factors, and correlations.

  4. PySpark Data Analysis
    Big data processing and exploration using Apache Spark with Python.


πŸ› οΈ Tech Stack

  • Languages: Python, SQL
  • Libraries: Pandas, NumPy, Matplotlib, Seaborn, Plotly
  • Big Data: PySpark, Hadoop Ecosystem
  • Tools: Jupyter Notebook, Google Colab

πŸ“ˆ Key Highlights

  • End-to-end exploratory data analysis (EDA)
  • Data cleaning and preprocessing pipelines
  • Advanced visualization techniques for storytelling
  • Real-world datasets with practical insights
  • PySpark workflows for large-scale analysis

🀝 Contribution

Want to collaborate? Fork the repo, create a branch, and submit a PR. Suggestions are always welcome!


πŸ“¬ Contact


✨ Exploring data, uncovering stories, and making sense of the numbers.

About

A Repo of Data Analysis projects including data exploration, visualization, statistical insights, and PySpark workflows. Demonstrating real-world problem solving with Python, Pandas, Matplotlib, and advanced analytical techniques.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Contributors 2

  •  
  •