Skip to content

Tarunkumar0601/customer_behavior_analysis

Repository files navigation

📊 Customer Shopping Behavior Analysis

An end-to-end Data Analytics project that analyzes customer shopping patterns using Python, PostgreSQL, and Power BI to generate actionable business insights.


🔍 Project Overview

This project analyzes transactional retail data to understand customer behavior, spending patterns, product performance, and subscription impact.
The goal is to demonstrate a complete analytics workflow — from raw data to dashboard, report, and presentation.


📁 Dataset

  • ~3,900 customer purchase records
  • 18 features including demographics, purchase details, and shopping behavior

Key attributes

  • Age, Gender, Location, Subscription Status
  • Item Purchased, Category, Purchase Amount
  • Discount Applied, Review Rating, Shipping Type

Missing values were identified and handled during preprocessing.


🛠 Tools & Technologies

  • Python: pandas, numpy, matplotlib, seaborn
  • Database: PostgreSQL
  • Visualization: Power BI
  • Reporting: Analytical report (PDF)
  • Presentation: Gamma (AI-powered PPT)

🔄 Project Workflow

  1. Data Loading & EDA (Python)
    • Data inspection, summary statistics, visualization
  2. Data Cleaning & Feature Engineering
    • Missing value treatment
    • Column standardization
    • Customer segmentation & age grouping
  3. SQL Analysis (PostgreSQL)
    • Revenue and spending analysis
    • Product and customer behavior queries
  4. Dashboard Development (Power BI)
    • Interactive KPIs and filters
  5. Reporting & Presentation
    • Business insights report
    • Executive presentation using Gamma

🧮 SQL Analysis Highlights

  • Revenue by gender
  • High-spending discount users
  • Top-rated and top-selling products
  • Subscribers vs non-subscribers comparison
  • Revenue by age group
  • Shipping type impact on spending

📊 Power BI Dashboard

The interactive dashboard includes:

  • Customer count & average purchase KPIs
  • Revenue by category and age group
  • Subscription distribution
  • Sales and behavior trends

Designed for quick insight discovery and business decision support.


📈 Key Insights

  • Loyal customers form the largest segment
  • Certain products are highly discount-dependent
  • Subscribers and non-subscribers show similar average spend but different volumes
  • Young and middle-aged customers contribute the highest revenue

▶️ How to Run

Prerequisites

  • Python 3.x
  • PostgreSQL
  • Power BI Desktop

Steps

# Install dependencies
pip install pandas numpy matplotlib seaborn psycopg2
  • Run Python scripts for EDA and data cleaning
  • Load cleaned data into PostgreSQL
  • Execute SQL queries for analysis
  • Open Power BI file to explore the dashboard

📌 Outcome

  • This project showcases:
  • Strong data cleaning & EDA skills
  • Practical SQL for business analytics
  • Dashboard storytelling with Power BI
  • End-to-end data-to-decision workflow

👤 Author

Tarun Kumar S A Data Analytics | Python | SQL | Power BI

About

data analytics project to showcase customer behavior analysis using python, sql and power BI.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors