Skip to content

πŸ“šThis project is a complete end-to-end data pipeline that involves scraping book data from an online bookstore and uncovering insights through SQL and Python analysis. Using tools such as BeautifulSoup, Pandas, and SQL, I collected information on book titles, prices, availability, and categories.

Notifications You must be signed in to change notification settings

AMANPATEL-1234/Book_Store_Analysis

Folders and files

NameName
Last commit message
Last commit date

Latest commit

Β 

History

5 Commits
Β 
Β 
Β 
Β 
Β 
Β 

Repository files navigation

πŸ“š Book Data Scraper & Insights Dashboards

A project where I scraped data from an online book website and analyzed it using Python, SQL, and data visualization tools to uncover key trends and insights.


πŸš€ Project Overview

This project demonstrates the complete data pipeline β€” from web scraping raw book data to extracting insights using SQL and Python. It includes:

  • Scraping book titles, prices, availability, ratings, and categories
  • Cleaning and structuring data using Python (Pandas)
  • Loading data into a SQLite/PostgreSQL database
  • Performing SQL queries to extract meaningful insights
  • Visualizing data patterns with Matplotlib/Seaborn/Plotly

πŸ› οΈ Tools & Technologies

Tool Purpose
Python Core scripting and data analysis
BeautifulSoup / Requests Web scraping
Pandas Data cleaning & manipulation
SQLite or PostgreSQL Data storage & SQL queries
Matplotlib / Seaborn Data visualization
Jupyter Notebook Project documentation

πŸ“ˆ Key Insights

Here are a few insights extracted:

  • πŸ’Έ Average book price across all categories
  • πŸ“Š Most common book categories
  • ⭐ Distribution of ratings
  • 🚫 Out-of-stock vs In-stock books
  • πŸ” Category-wise pricing trends

(More insights are available in the analysis notebook.)

About

πŸ“šThis project is a complete end-to-end data pipeline that involves scraping book data from an online bookstore and uncovering insights through SQL and Python analysis. Using tools such as BeautifulSoup, Pandas, and SQL, I collected information on book titles, prices, availability, and categories.

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published