Skip to content
View Nikhil-Doye's full-sized avatar

Highlights

  • Pro

Block or report Nikhil-Doye

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Nikhil-Doye/README.md

πŸ‘‹ Hi, I'm Nikhil Doye

Profile view counter on GitHub

πŸš€ AI Engineer | Data Engineering | Software Engineer

πŸŽ“ MS in Artificial Intelligence β€” Northeastern University
πŸ’Ό AI Engineer @ IpserLab
🌎 Open to AI Engineer, Data Engineer, and Software Engineer roles (Visa Sponsorship)

I build production AI systems that combine LLMs, data pipelines, and scalable backend infrastructure.

My interests include:

  • AI Agents & LLM Systems
  • Retrieval Augmented Generation (RAG)
  • Data Engineering Pipelines
  • Backend Infrastructure for AI systems
  • Applied Machine Learning

🧠 Tech Stack

AI / Machine Learning

Python β€’ PyTorch β€’ Scikit-learn β€’ MLflow β€’ LangChain β€’ LangGraph β€’ RAG

LLM Systems

Prompt Engineering β€’ Agent Orchestration β€’ Function Calling β€’ Vector Databases

Backend Engineering

FastAPI β€’ REST APIs β€’ Docker β€’ Kubernetes β€’ CI/CD

Data Engineering

Kafka β€’ Apache Airflow β€’ SQL β€’ ETL / ELT Pipelines β€’ Data Modeling

Databases

PostgreSQL β€’ Redis β€’ Pinecone β€’ Vector Databases


πŸš€ Featured Projects

πŸ€– Auto-ML Agent

AI system that automatically builds machine learning models from datasets.

Capabilities

  • Dataset analysis
  • Feature engineering
  • Model training
  • Evaluation and comparison

Technologies
Python β€’ MLflow β€’ LLM orchestration


βš™οΈ AI Workflow Builder

Visual platform for building AI workflows using node-based pipelines.

Example pipeline

Web Scraper β†’ Embeddings β†’ LLM β†’ Database

Technologies
React Flow β€’ Python β€’ API orchestration


πŸ“Š Spotify Data Engineering Pipeline

End-to-end data pipeline using AWS services.

Architecture

Spotify API
↓
AWS Lambda
↓
S3 Data Lake
↓
Transformation
↓
Analytics Dashboard

Article
https://medium.com/@nikhil-datasolutions/building-a-spotify-etl-pipeline-with-aws-from-api-to-dashboard-81a647ae5bcd


πŸ’Ό Professional Experience

AI Engineer β€” IpserLab

  • Developed Python backend APIs integrating LLM agents with external data sources
  • Built RAG pipelines over structured and unstructured datasets achieving 95% deterministic responses
  • Designed document ingestion pipelines reducing manual research time 45%
  • Implemented ML evaluation and monitoring using MLflow and Airflow

Software Engineer Intern β€” Fix-It 24/7

  • Built backend APIs supporting 10K+ monthly users
  • Implemented Kafka streaming pipelines reducing analytics latency 60%
  • Developed CI/CD pipelines using Docker and Kubernetes

Software Engineer β€” LTIMindtree

  • Developed scalable enterprise applications using Python and SQL for large-scale data processing
  • Built data integration workflows improving data processing efficiency across internal systems
  • Collaborated with cross-functional engineering teams to design and deploy production-grade backend services
  • Optimized database queries and batch processing pipelines to improve system performance and reliability

πŸ“Š GitHub Stats

GitHub stats

Top Languages


✍️ Writing

Spotify Data Engineering Pipeline
https://medium.com/@nikhil-datasolutions/building-a-spotify-etl-pipeline-with-aws-from-api-to-dashboard-81a647ae5bcd


πŸ”— Let's Connect

LinkedIn
https://linkedin.com/in/nikhil-doye

GitHub
https://github.com/Nikhil-Doye

Email
nikhil.doye@gmail.com


⭐ Always excited to collaborate on AI agents, LLM systems, and scalable data platforms.

πŸ† Achievements

  • πŸ₯‡ Ranked World No.1️⃣ SQL Developer (Practice) on Hacker Rank.
  • πŸ₯ˆ Achieved Silver Medal during NeuroHack by building a BERT model to understand service ticket descriptions and later using HDBSCAN for automated ticket categorization.
  • πŸ₯‰ Earned Bronze Medal in Kaggle Competition to predict student drop-out rates using machine learning models like logistic regression and random forest, later employing hyperparameter tuning to enhance performance.

πŸ“ˆ GitHub Highlights

πŸ›  Projects


🌟 Fun Fact

When I’m not analyzing data, you’ll likely find me experimenting with recipes in the kitchen or exploring the hidden gems of Boston. Data and good food are my favorite combinations!

Let's collaborate and create something amazing together!

Pinned Loading

  1. workflow-builder workflow-builder Public

    Turn ideas into workflows instantly - AI copilot + visual editor = automation magic in seconds

    TypeScript

  2. voice-enabled-browser-automation voice-enabled-browser-automation Public

    Voice-controlled browser automation with real-time DOM analysis. Speak commands, get actions executed via Deepgram transcription, LLM intent parsing, and Playwright automation. Built with TypeScrip…

    TypeScript 2

  3. Hotel-Reservation-Prediction Hotel-Reservation-Prediction Public

    End-to-end MLOps pipeline automating data ingestion, model training, experiment tracking, versioning, and CI/CD deployment. Built with MLflow, Flask, GCP, and Docker to deliver a scalable, producti…

    Python

  4. auto-ml-agent auto-ml-agent Public

    Autonomous machine learning pipeline orchestrated by LLMs that automates the complete ML lifecycle from data preprocessing to model deployment without manual intervention. Features multi-agent arch…

    Python 3 1

  5. insurance_purchase insurance_purchase Public

    Predicting vehicle insurance purchases just got smarter! This repository combines the power of machine learning with seamless deployment, featuring an advanced preprocessing pipeline, top-notch mod…

    Jupyter Notebook

  6. terminal-coding-agent terminal-coding-agent Public

    A powerful CLI tool that brings AI-powered code generation and file manipulation directly to your terminal.

    TypeScript 2