Skip to content
View Elsayed91's full-sized avatar
  • Cairo

Block or report Elsayed91

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Elsayed91/README.md

Hello there! I'm Islam ✨✨

Islam's LinkedIn Outlook

🔧 Technologies & Tools

Python Apache Airflow DBT Great Expectations Kubernetes Terraform Spark Kafka Google Cloud AWS Azure PowerBi Looker Data Studio Streamlit Grafana Prometheus

🌱 About Me

Data Engineer, but I mainly work with/as Data/Dev/MLOps, so am I really a data engineer? Idk.

🚀 Featured Projects

  • NY Taxi Data & MLOps Pipeline: Automated data & MLOps pipeline leveraging Kubernetes and Apache Airflow. Integrates Spark, Kafka, and DBT with a focus on data quality. Tailors solutions for diverse user needs.
  • Xbox Data Scraping & Analysis Pipeline: Automated data-driven project leveraging Python, Airflow, and GKE. Scrapes diverse data sources, providing insights into Xbox hardware and game data.

📦 Packages

  • Easy Expectations: A python package that abstracts away the complexity of Great Expectations and allow for easy no-knowledge-required implementation for basic use cases.
  • SchemaDiff: A python package that efficiently detects files with inconsistent schemas amidst thousands of files by reading the parquet files metadata.
  • Order of The Template: A Python toolkit for parsing and processing YAML templates, capable of resolving Bash syntax environment variables and Jinja templating. It also offers schema validation functionality.

Pinned Loading

  1. taxi-data-pipeline taxi-data-pipeline Public

    Automated data & MLOps pipeline leveraging Kubernetes and Apache Airflow. Integrates Spark, Kafka, and DBT with a focus on data quality. Tailors solutions for diverse user needs.

    Python 5

  2. easy_ge easy_ge Public archive

    Simplified Data Validation and Quality Testing with Great Expectations

    Python 3

  3. oot oot Public

    Order Of The Template - a Python toolkit to parse YAML files and Jinja templates, resolve environment variables, and validate JSON schemas

    Python 1

  4. gcp-lite-rs gcp-lite-rs Public

    Lightweight HTTP client for Google Cloud Platform APIs

    Rust

  5. infracost-rs infracost-rs Public

    Library and CLI tool to get cloud asset pricing from Infracost's API

    Rust