Formula-1DM2 is an advanced data analytics and visualization project focused on the world of Formula 1 racing. The project provides comprehensive historical analysis, deep dives into driver and constructor dominance, and a unique real-time data streaming dashboard built using Apache Kafka and Tableau.
This repository began with the goal to answer one of the most exciting questions in modern motorsport:
How did Max Verstappen manage to beat Lewis Hamilton, who stood on the brink of a record 8th world title, in the controversial 2021 F1 season?
But we didn't stop there. Formula-1DM2 explores:
- The rise and rivalry of F1's greatest drivers and constructors.
- The evolution of dominance in Formula 1 across decades.
- The power of real-time analytics, demonstrated by streaming F1 data pipelines to Tableau dashboards.
- Historical Data Analysis: Compare the most dominant drivers and constructors since the inception of Formula 1.
- 2021 Title Showdown: Special focus on the dramatic final race between Lewis Hamilton and Max Verstappen.
- Streaming Data Pipeline: Real-time data flow from Kafka to Tableau, enabling up-to-the-minute race analytics and dashboards.
- Rich Visualizations: Interactive Tableau dashboards, insightful plots, and storytelling with data.
- Reproducible Analysis: All code is provided in Jupyter notebooks for transparency and learning.
Formula-1DM2/
│
├── README.md # Project overview, instructions, credits
│
├── data/ # Datasets (raw and processed)
│ ├── Node-Link Tree Data_Main Dataset.csv
│ ├── Node-Link Tree Data_Main Dataset111.csv
│ ├── Node-Link.csv
│ ├── results (F1 Results 2021)_results.csv
│ └── results (F1 Results 2021)_results.xlsx
│
├── images/ # All visual assets and screenshots used in analysis
│ ├── F1new_logo.jpeg
│ ├── LewisvsMax1.png
│ ├── Screenshot 2023-11-14 at 11.45.10.png
│ ├── Screenshot 2023-11-14 at 12.13.02.png
│ ├── Screenshot 2023-11-14 at 22.13.36.png
│ ├── Screenshot 2023-11-14 at 22.16.49.png
│ ├── Screenshot 2023-11-14 at 23.47.05.png
│ ├── Screenshot 2023-11-14 at 23.52.33.png
│ ├── Screenshot 2023-11-14 at 23.59.13.png
│ ├── constructors.webp
│ ├── f1_Logo.jpeg
│ └── f1new.avif
│
├── notebooks/ # Jupyter notebooks for analysis & EDA
│ └── Exam.ipynb
│
├── presentations/ # Presentation slides for sharing findings
│ └── Dataviz 2 Day 1.pptx
│
├── scripts/ # Utility scripts (e.g., Kafka command reference)
│ └── KafkaCommands
│
└── visualizations/ # Tableau workbooks and dashboards
├── Formula1_StaticDashboard.twb
├── Formula1_StreamingDashboard.twbx
└── Superstore.twbx
git clone https://github.com/turkeshp/Formula-1DM2.git
cd Formula-1DM2pip install -r requirements.txt- Open and run the notebooks in the
notebooks/directory for detailed data exploration and visualizations. - Find Tableau dashboards and workbooks in
visualizations/for interactive data stories. - Investigate the data streaming and Kafka pipeline in the
scripts/folder.
- Set up Apache Kafka on your machine or cloud.
- Use scripts from the
scripts/folder to push live or simulated F1 data streams. - Connect Tableau to your Kafka data source and open dashboards in the
visualizations/folder for real-time analytics.
- Official Formula 1 race results (CSV & Excel)
- Onyx Data - DataDNA Dataset Challenge: F1 Results
- Custom, hand-assembled datasets for driver and constructor comparisons
Note: Some datasets are large. For privacy or licensing, you may need to download them from official sources.
- Driver vs. Driver showdowns (e.g., Lewis Hamilton vs. Max Verstappen)
- Constructor dominance over time
- Real-time race dashboards (using Tableau + Kafka)
- Annotated screenshots and images in
images/
- Python (Pandas, NumPy, Matplotlib, etc.)
- Jupyter Notebook for analysis & rapid prototyping
- Apache Kafka for real-time/streaming data
- Tableau for professional dashboards
- PowerPoint for presentations
Pull requests, suggestions, and collaborations are welcome. Please open an issue if you have ideas or want to report bugs.
This repo is licensed under the MIT License (see LICENSE for full text).
Created by turkeshp
Feel free to connect for F1 data discussions or analytics collaborations!