Enterprise Data Pipeline An end-to-end data engineering project simulating enterprise-scale customer transaction processing using SQL, PySpark, and AWS-style architecture. Focused on data modeling, ETL pipelines, performance optimization, and scalability.
Focus areas:
- Data modeling
- ETL pipelines
- SQL & PySpark transformations
- Scalable and reliable data processing