BeanMeApp SuperCafe Project

Elevator Pitch

As BeanMeApp, we’re tasked with building a scalable, automated ETL pipeline for our client, SuperCafe, a rapidly growing café chain with hundreds of outlets. Each night, transaction data in CSV files will be uploaded from every branch, and our system will extract, transform, and load this data into a centralized cloud-based data warehouse. From there, we’ll integrate with Grafana to provide real-time dashboards and reports, giving SuperCafe critical insights across all branches. We’ll utilize AWS services like S3 and Redshift to ensure everything is scalable and efficient. Our goal is to streamline SuperCafe's data processes, empower their decision-making, and support their rapid growth without relying on manual reporting.

Team Members

Iman Howard
Mazin Ibrahim
Atalay Erkul
Nathan Grant
Dario Li Causi
Avi Bercovich

Smart Goals

Goal 1

By the end of the first sprint (week 1), the BeanMeApp team will complete detailed UML diagrams, including system architecture, data flow, and the ETL pipeline components, to clearly outline the structure and flow of the data processing system.

Goal 2

By the end of the second sprint (week 2), the team will design and implement the initial data ingestion process to read CSV files from SuperCafe branches and store them in a local or temporary storage system, ensuring error-free data capture for at least 3 branches.

Goal 3

By the end of the second sprint (week 2), the team will define and implement basic data transformation rules to clean and structure the raw data, ensuring it is in a usable format for further processing in future sprints.

Way of Working

Team name:

BeanMeApp

Development process:

Agile Methodology (Scrum)

Definition of done:

As per the Product Owner’s request, ‘done’ is no more than a working ETL pipeline with a Grafana data visualisation of some sort.

Design process:

Initial meetings
Add/review/split tickets in GitHub Projects
Review the CSV data
Data modelling
Research
Wireframing and initial design concepts
Prototyping and Iteration
Feedback and Refinement

Stand up time, location, structure:

Time:

When: Stand-ups will be held every morning at 9:00 AM (or another agreed-upon time, depending on your timezone and team preferences).
Why: This gives everyone a clear start to the day, ensuring we’re all on the same page before diving into our tasks.
Duration: The stand-up should take around 15 minutes max — just a quick sync to check in with each other.

Where:

We’ll conduct the stand-up virtually via Teams

Structure:

Each team member will answer the three main questions:

What did you accomplish yesterday?
What are you working on today?
Are there any blockers or challenges?

Retro time and cadence:

Friday afternoons

Team Principles:

Transparency
Collaboration
Proactive Problem-Solving
Respect for Time
Ownership and Accountability
Continuous Learning and Feedback
Empathy and Respect
Eat as many Apricot Danish as we can!

Tools:

AWS S3 Bucket
AWS Redshift
AWS EC2
AWS Lamda
Python
Github
VS Code

Notes on Code:

Tabs are 4 spaces
A 5-line Python comment like below should be added to the top of each source file:

# my_source_file.py  
#  
# WizzbangFeatureSuperMajorImportantCoolness  
#  
# DE Final Project week XX, April 2025, Guido van Rossum

Import the typing module and use type-hinting
When submitting a pull request make sure you've removed any and all 'testing' code you've commented-out.
Break long function argument lists into a vertical list, like so:

def the_function(first_arg:  str,  
                 second_arg: int,  
                 third_arg:  list,  
                 fourth_arg: bool)

Defenition of Done (Tickets)

All tasks at a given step of the project need to be expressed as a GitHub project ticket and each ticket processed as ‘done’. In the (likely) case of expressed tickets still being marked as ‘in progress’, a short comment as the reason of not doneness must be present.

Directory structure

./doc - Project documantation
./data - Data source files
./src - Applicaation source code
./tooling - Various helpers for project tooling

Name		Name	Last commit message	Last commit date
Latest commit History 1 Commit
data		data
doc		doc
src		src
tests		tests
tooling		tooling
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
requirements.txt		requirements.txt

Folders and files

Latest commit

History

Repository files navigation

BeanMeApp SuperCafe Project

Contents

Elevator Pitch

Team Members

Smart Goals

Goal 1

Goal 2

Goal 3

Way of Working

Team name:

Development process:

Definition of done:

Design process:

Stand up time, location, structure:

Retro time and cadence:

Team Principles:

Tools:

Notes on Code:

Defenition of Done (Tickets)

Directory structure

High-Level Component Diagram

MoSCoW Diagram

Importance (Flex)

ER Diagram (3NF)

ETL Pipeline Architecture

AWS Architecture

Data in Postgres Database

Grafana Demo

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Packages