- Interested in Machine Learning, Generative AI, and Data Science
- Currently learning LLMs, RAG, and AI Agents
- Building AI-powered applications and data-driven solutions
- Always exploring new technologies
Programming Languages: Python, SQL
Data Analysis: Data Cleaning, EDA, Hypothesis Testing, Central Limit Theorem, Statistical Modelling, A/B Testing
Data Visualization: Tableau, Matplotlib, Seaborn, Power BI, Excel
Data Wrangling: Pandas, Numpy, Scikit-Learn, SciPy, Excel
AI & LLMs: LangChain, Prompt Engineering, AI Agents, RAG, LLM Evaluation, OpenAI API, Multi-Agent Systems
Cloud Computing: AWS
Tools & Technologies: Git, GitHub, Jupyter Notebook
YouTube Podcast AI Q&A System is a Retrieval-Augmented Generation (RAG) application that allows users to interact with YouTube videos through natural language questions. The application processes video transcripts, builds semantic search indexes, retrieves relevant content, and generates context-aware answers using Google's Gemini model.
Tech: Python, Google Gemini 2.5 Flash, Sentence Trasformers, Fatser Whisper, LangChain
🔗 Repository: https://github.com/PiyushKumar74110/Podcast-Q-A-Bot
An end-to-end AI system that converts YouTube videos into structured knowledge and enables conversational interaction using Retrieval-Augmented Generation (RAG). The system performs audio extraction, transcription, summarization, insight extraction, and semantic question answering over video content.
Tech: Python, LangChain, RAG pipeline, LLM Orchestration, Mistral AI, OpenAI Whisper, Saravm AI STT model, ChromaDB, HuggingFace (all-MiniLM-L6-v2), yt-dlp, FFmpeg, soundfile, NumPy, NLP and Text Processing
🔗 Repository: https://github.com/PiyushKumar74110/Saaransh-AI
This project is an end-to-end MLOps pipeline designed for time series data analytics and forecasting. It automates data ingestion, preprocessing, model training, evaluation, and batch prediction workflows. The system enables scalable and reproducible analysis of time-dependent data while following MLOps best practices for deployment and monitoring.
Tech: Python, Pandas, Scikit-Learn, XGBoost, Docker
🔗 Repository: https://github.com/PiyushKumar74110/mlops-batch-job