Skip to content
View abdulrahman-khan's full-sized avatar
📊
📊

Block or report abdulrahman-khan

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
abdulrahman-khan/README.md

Hello!

website · LinkedIn · dev profile · Kaggle

Previously:

  • Data Analyst @ Lululemon Athletica
  • Data Engineer @ Interalo
  • Data Programming Analyst @ S&C Electric Canada
  • Systems Analyst (ETL Testing) @ Ontario Public Service (Ministry of Natural Resources)
  • Backend Engineer @ IServeCorp (TradeXPod)

Projects:

  • FFXIV Party Finder GCP Data Warehouse - Production-grade medallion data warehouse on GCP featuring automated web scraping with Cloud Run, Terraform IaC, Dataform incremental transformations, and hourly analytics on duty popularity across regions.
  • Web Events Ingestion & Analytics - Data streaming pipeline ingesting web traffic from Kafka producers and processing with Apache Flink
  • ElectroNova - MapReduce, Spark analytics, and Sqoop MySQL integration for housing smart meter data analysis
  • Spark Analysis & Functional Testing - Spark data pipelines with SCD transformations, Iceberg storage, and unit testing via Chispa
  • DataPipelinesSQL - Advanced SQL pipeline implementations covering modern data engineering design patterns
  • Commerce Solutions - End-to-end data pipeline and workflows across the Microsoft stack (SQL Server, SSRS, PowerBI, C#)
  • Parsio - Job application tracker + AI job parsing
  • Video Game REST API - C# REST API with JWT auth, Entity Framework, and SQL Server database
  • Mississauga Library Booker - Activity (room) booking web application for the Mississauga library system

Pinned Loading

  1. ff14-partyfinder-analytics ff14-partyfinder-analytics Public

    GCP FFXIV Party Finder real-time data warehouse. Scrapes live listings every 15 minutes from xivpf.com, loading 1M+ records into a medallion (bronze/silver/gold) BigQuery warehouse using Cloud Run,…

    HCL

  2. Spark-Analysis-and-Functional-Testing Spark-Analysis-and-Functional-Testing Public

    Apache Spark data processing pipeline with Jupyter notebook analysis and unit testing for Slowly Changing Dimension (SCD) transformations. Utilzing Apache Iceberg for storage and Chispa for testing

    Jupyter Notebook

  3. ImplementingBigData_ElectroNova ImplementingBigData_ElectroNova Public

    Big Data storage and analytics solution using Hadoop MapReduce, Apache Spark analytics, HiveQL queries, Flume data pipelining, and Sqoop MySQL integration. Demonstrates scalable data processing wor…

  4. Real-Time-Web-Events-Pipeline Real-Time-Web-Events-Pipeline Public

    Real time web data streaming pipeline. Ingests web traffic data from Kafka producers, processing them using Apache Flink.

    Python