Skip to content
View dannykhant's full-sized avatar

Block or report dannykhant

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
dannykhant/README.md

Hi, I’m Danny

I am an Engineer specializing in Data and Platform Engineering. With over 12 years of experience, I design and scale production-grade systems across Banking, FMCG, and SME sectors. I specialize in developing scalable data architectures, machine learning workflows, and AI-driven solutions using cutting-edge technologies.

Technical Toolkit

  • Languages: SQL, Python, Java, Rust
  • Data Engineering: Medallion Architecture, Data Lakehouse, Distributed Processing (MPP), Data Modeling, Workflow Orchestration (DAGs), Change Data Capture (CDC), Open-Table Format
  • LLM Engineering: Retrieval-Augmented Generation (RAG), Large Language Models (LLM), Vector Databases, Agentic Workflows
  • MLOps: Model Training & Deployment, Hyperparameter Tuning, Feature Engineering, Model Monitoring
  • Infrastructure & Systems: Containerization, CI/CD, Infrastructure as Code (IaC), Cloud-Native Architectures, Linux Systems

Featured Projects

  • Open-source Data Platform
    Cloud-native Lakehouse architecture featuring Iceberg/Nessie for Git-like data versioning and Trino with KEDA/Prometheus for event-driven horizontal autoscaling of distributed query workers.

  • E-commerce Data Pipeline
    End-to-end analytics pipeline implementing a Medallion Architecture (Bronze/Silver/Gold) with declarative data pipelines and versioned data storage.

  • Weather Data Pipeline
    Automated ingestion engine for external API data into a cloud data warehouse, orchestrated using Directed Acyclic Graphs (DAGs) for workflow management.

  • Employee Churn Prediction
    Machine learning system featuring multiple classification models and a high-concurrency FastAPI inference service, fully containerized for deployment.

  • Face Recognition
    Deep learning project focused on model training, hyperparameter tuning, and embedding-based inference using neural networks.

  • Small Language Model
    Transformer-based model featuring a Rust-based inference service, ONNX quantization, and orchestration via local container clusters.

  • 5K Running Coach
    A production-ready RAG system utilizing hybrid search techniques and rigorous evaluation to provide expert running guidance.


Connect with me on LinkedIn
Or reach me at dannypmkhant@gmail.com

Pinned Loading

  1. data-platform-oss data-platform-oss Public

    Open-source Data Platform

    Jupyter Notebook

  2. dbt-jaffleshop dbt-jaffleshop Public

    Analytics Engineering

  3. ecomm-data-pipeline ecomm-data-pipeline Public

    Ecommerce End to End Data Pipeline with Databricks

    Python

  4. mlz-small-language-model mlz-small-language-model Public

    Small Language Model with Keras/Tensorflow

    Jupyter Notebook

  5. weather-data-pipeline weather-data-pipeline Public

    Weather Data Pipeline with Snowflake

    Python

  6. mlz-employee-churn-prediction mlz-employee-churn-prediction Public

    Employee Churn Prediction with XGBoost

    Jupyter Notebook