I’m a Data Engineer focused on designing and optimizing ETL/ELT pipelines, data models, and analytics platforms that make data reliable, accessible, and actionable. I enjoy turning messy, multi-source data into clean datasets that power products and decisions.
- 🔭 Current focus: Modern data stack (Python, SQL, dbt, Airflow), cloud data (Azure/AWS), and analytics enablement (Power BI)
- 📦 Background: SQL Server admin & pipeline automation at scale (26+ branches), ERP↔web API integrations, and BI dashboards
- 🌍 Open to: Data Engineer roles (remote/relocation)
- 💬 Ask me about: Data modeling, dbt, Airflow, SQL performance, Dockerized pipelines, Power BI
- 📫 Reach me:
bolajiemmanuel01official@gmail.com
Data Engineering
- Python • SQL • dbt • Apache Airflow • Data Modeling (Star/Snowflake) • ETL/ELT • Git • CI/CD • Docker
Databases / Warehousing
- SQL Server • PostgreSQL • BigQuery
Cloud & Infra
- Azure (Fabric, ADF, Synapse) • AWS (S3, Lambda, EMR) • Linux
BI & Analytics
- Power BI (DAX, data modeling) • Excel (advanced)
Exposure / Learning
- PySpark • Kafka • Kubernetes • Data Governance & Quality
Tech: Python, Airflow, dbt, PostgreSQL, Power BI, Docker
What it does: End-to-end pipeline from raw transactions → modeled warehouse → BI dashboards.
Highlights: Modular transformations in dbt; scheduled orchestration with Airflow; containerized & version-controlled; metrics for sales, AOV, cohorts.
🔗 Repo: https://github.com/Bolajiemmanuel01/Data-Engineering-projects/tree/main/online-retail-etl
Tech: Python (OOP/TDD), Django, PostgreSQL/PostGIS, Leaflet, Sentinel Hub API
What it does: Query, process, and visualize Sentinel-2 imagery with NDVI overlays & raster stacking.
Highlights: Clean, maintainable architecture; reproducible geospatial workflows; secure API interactions.
🔗 Repo: https://github.com/Bolajiemmanuel01/Python_projects/tree/main/gis_project/geopipeline_project
Tech: Python, Airflow, dbt, PostgreSQL, Grafana, GitHub Actions, Docker What it does: Ingests CoinGecko prices on a schedule → stores raw JSON in bronze → cleans/stages in silver → builds gold facts/views for analysis → publishes dbt docs → visualizes daily trends in Grafana. Highlights: Medallion architecture; incremental dbt models; data tests (freshness, not null, uniques); Airflow DAG with retries; CI that runs dbt build and publishes docs; provisioned Grafana dashboard; fully containerized. 🔗 Repo: https://github.com/Bolajiemmanuel01/Data-Engineering-projects/tree/main/crypto-price-pipeline 📚 Docs: https://bolajiemmanuel01.github.io/Data-Engineering-projects/crypto-price-pipeline/#!/overview
Tech: SQL Server, Python, REST APIs, Power BI, Cloud backup
What it does: ERP↔e-commerce sync for real-time price/inventory; automated SQL backups; BI dashboards.
Highlights: Reduced downtime; improved data reliability; enabled self-service analytics across 26+ branches.
🔗 (Private/commercial—summary only)
- Built production-style ELT with dbt + Airflow and shipped self-service dashboards
- Maintained SQL Server Enterprise with automated backups & integrity checks
- Enabled cross-team analytics with Power BI (sales, refunds, behavior)
- Roles where I can design and own reliable pipelines, warehouse models, and analytics layers that power products and decisions.
- Teams using (or moving toward) dbt + Airflow, modern cloud warehousing, and data as a product practices.
- Email: bolajiemmanuel01official@gmail.com
- LinkedIn: /in/emmanuel-bolaji-7a3466155
