End‑to‑end data pipelines built with modern data engineering tools
I’m transitioning into Data Engineering with a strong focus on building real-world, production-style data systems.
Instead of following tutorials, I designed pipelines that reflect how modern companies operate:
- Batch ETL & ELT workflows
- Data warehouse modeling
- Real-time event streaming
- Cloud-native ingestion
- Fully Dockerized, reproducible environments
Each project is structured like a real engineering repository.
📚 Table of Contents
Below are the four projects that make up my portfolio.
Each link takes you directly to the project’s README.
API ingestion → Raw → Staging → Mart → Dashboard
- Daily API extraction (Python)
- BigQuery raw + mart layers
- dbt transformations
- Airflow orchestration (Dockerized)
📂 View Project: Marketing ETL Pipeline
Star schema → Fact tables → Dimensions → Cohort analysis
- Dimensional modeling
- dbt staging + marts
- LTV, retention, and cohort metrics
📂 View Project: E-Commerce Data Warehouse
Simulated events → Kafka producer → Consumer → Warehouse
- Kafka streaming ingestion
- Python consumer
- Near real-time analytics
📂 View Project: Real-Time Event Pipeline
Serverless ingestion → Cloud Storage → BigQuery
- Cloud Functions
- Scheduled ingestion
- Serverless transformations
📂 View Project: Cloud-Native Pipeline
- Workflow orchestration (Airflow)
- ELT pipelines (Python → BigQuery → dbt)
- Data modeling (staging, marts, star schema)
- Streaming ingestion (Kafka)
- Cloud-native design (GCP)
- Dockerized development
- Version control (Git)
- Dependency pinning
- Modular code structure
- Logging & monitoring
- dbt tests (unique, not null, relationships)
- Incremental models
- Idempotent loads
- Retry logic in Airflow
- Metric definitions (CTR, CPC, ROAS, etc.)
- Dashboard-ready tables
- Partitioning & clustering
- Cost optimization in BigQuery
This repository is the landing page for all my data engineering work.
Each project is fully documented and reproducible.
Data & Marketing professional transitioning into Data Engineering.