I'm Leo,
a data engineer at P&G in the customer data pipeline.
You can view my personal projects on my site: https://leobocci.pages.dev/ or in my github repos.
- 🔨 Currently working on:
Real-time lakehouse from Kafka to Databricks UC with Spark streaming
Python CICD (GH actions, Ruff, UV packaging, Databricks asset bundles)
Benchmarking innovative projects (Polars/DuckDB vs Spark for cloud infra cost savings)
Metadata & Data Quality (Great expectations, Soda-core, data contracts, configuration as code) - ⚡ Currently learning:
Rust language



