Freelance Data Engineer — Turning messy data into clarity for projects with real impact.
I work with purpose-driven teams to build data systems they can trust.
- 📊 Centralise scattered data into a single source of truth
- ⚙️ Automate cleaning & validation for always-ready data
- 🚀 Design efficient ETL/ELT pipelines (Airflow, dbt, Spark…)
- 📈 Build solid foundations for BI, ML & GenAI
- ⏱ Create real-time dataflows when speed matters
Python SQL Bash Git GitHub Poetry Pylint Pandas NumPy
Apache Airflow Cloud Composer (GCP) MWAA (AWS) dbt Fivetran Airbyte Prefect Apache Spark PySpark Apache Beam Dataflow (GCP) Dataproc (GCP) Spark Structured Streaming Apache Kafka Google Pub/Sub Apache NiFi Web scraping
Amazon S3 Google Cloud Storage Parquet BigQuery Snowflake Amazon Redshift Amazon Athena PostgreSQL MongoDB Cassandra ClickHouse
Amazon EC2 Google Compute Engine Terraform (IaC) Docker Docker Compose GitHub Actions (CI/CD) IAM / RBAC
Generative AI Large Language Models OpenAI API LangChain (RAG) Hugging Face Transformers NLTK spaCy scikit-learn PyTorch TensorFlow SPARQL AWS SageMaker
Matplotlib Seaborn Plotly Amazon QuickSight Apache Superset
Since 2021, I've worked in data across tech, banking, and large-scale systems (Amazon, Slido/Cisco).
In 2025, I went freelance to focus on projects with real impact — from healthtech and edtech to any sector that values purpose as much as results.
I also donate 10% of my earnings to the GiveWell Top Charities Fund.
💼 Portfolio request → LinkedIn
📩 Let’s connect and discuss how to make your data work better.



