Skip to content

Shubham2376G/AI_Projects

Repository files navigation

AI & LLM Projects โ€“ Shubham Aggarwal ๐Ÿ‘‹

Welcome to my AI & LLM project portfolio! Iโ€™m passionate about cutting-edge machine learning research and real-world AI applications. This repository showcases projects across Large Language Models (LLMs), data science competitions, and classic ML/NLP tasks, highlighting the impact, tools, and methods behind each.

๐Ÿ’ก Explore my achievements, skills, and certificationsโ€”updated regularly to reflect my AI journey.

๐ŸŒ Check out all my projects on GitHub


๐Ÿ”— Profiles


๐Ÿ“œ Certifications

  • IBM Data Science Professional Certificate (10-course specialization)
  • TensorFlow 2.0: Deep Learning and Artificial Intelligence
  • Machine Learning Specialization (3 courses)
  • Transformers for Natural Language Processing
  • Machine Learning: Natural Language Processing in Python (V2)
  • Modern Computer Vision GPT, PyTorch, Keras, OpenCV4 in 2024!
  • Generative AI, from GANs to CLIP, with Python and Pytorch

๐Ÿ”ฌ LLM Research Study

  • Implemented a teacherโ€“student learning pipeline to compress large LLMs into smaller ones, reducing inference cost while retaining high performance.
  • Key Insight: A well-designed student model can achieve near-teacher accuracy with far fewer parameters, making deployment more efficient and fast.
  • Implemented speculative decoding to accelerate LLM inference by letting a smaller draft model propose multiple tokens, which are then selectively verified by a larger target model.
  • Key Insight: Parallelizing token generation using a lightweight draft model significantly speeds up decoding while maintaining the accuracy of the main model.
  • Reimplemented Microsoftโ€™s 1-BitNet, which leverages 1-bit quantization for training large-scale LLMs, drastically reducing memory and communication cost.
  • Key Insight: 1-bit quantization, combined with error compensation, enables near full-precision accuracy while significantly lowering training overhead.
  • Developed a prompt-engineering framework that guides LLMs through sequential reasoning steps.
  • Improves accuracy and explainability in reasoning tasks.
  • Created Python scripts to learn LangGraph and build a local chatbot with structured memory and human-in-the-loop messaging.
  • Implemented multiple components including ReAct reasoning, parallelization, state management, and long-term memory to understand LangGraph workflows.
  • Fine-tuned Qwen-2 VL multimodal model using LoRA and LlamaFactory for structured data extraction from product images.

๐Ÿ† Competition Work

  • Developed a model to classify patent papers into EPO categories, including text preprocessing with NLTK (stop words removal, stemming, lemmatization) and TF-IDF vectorization..
  • Used an ensemble of CNN and RNN models to improve accuracy to 68%, ranked Top 10 among 1000+ competitors, and presented the solution to a jury.
  • Built an interactive Tableau dashboard to analyze world cuisine trends with filters for country, cuisine type, and ratings.
  • Performed a SWOT analysis on global cuisine patterns to highlight strengths, weaknesses, opportunities, and threats.
  • Performed advanced feature engineering on cricket data.
  • Trained stacked ensembles (XGBoost, LightGBM, CatBoost) with Optuna hyperparameter tuning to maximize predictive accuracy.

๐Ÿค– Machine Learning & NLP Projects

  • Created a group chat system using Microsoft AutoGen with specialized AI agents (Fitness & Nutritionist).
  • Built agents to deliver personalized workout and diet recommendations, simulating an interactive health advisory team.
  • Built an agentic workflow to automatically generate FAQ sections for websites, including relevant hyperlinks.
  • Leveraged SLMs to analyze website content, extract key topics, and create concise, well-structured FAQ entries.
  • Organized and led a workshop on Q-learning and advanced RL methods (OPRO, GRPO).
  • Prepared the dataset by fetching data via RESTful APIs and scraping sources like Wikipedia using BeautifulSoup.
  • Explored and visualized data with Folium and Seaborn heatmaps, and trained an SVM model achieving 88.8% accuracy.
  • Implemented a text classifier for positive/negative/neutral sentiment categorization.
  • Trained a Naive Bayes spam classifier with engineered features (keywords, sender info, formatting patterns).

๐Ÿ“š Research Engagement

In addition to projects and competitions, I also contribute to research.
I maintain a Preprints folder containing my own works:
๐Ÿ‘‰ My Research Preprints

These reflect my initial explorations into AI research directions and demonstrate my ability to translate ideas into written scholarly form.

Alongside this, I have studied and annotated 25+ research papers to deepen my theoretical understanding of AI. These span:

  • Scaling & Efficiency โ†’ Scaling Laws, Chinchilla, Broken Scaling Laws, BitNet, Test-Time-Training
  • Reasoning in LLMs โ†’ Chain-of-Thought Prompting, CoT Decoding, Logic of Thought, Scheming LLMs
  • Model Compression & Optimization โ†’ Knowledge Distillation, Batch Normalization, Differential Methods, Ramanujanโ€™s Randomly Weighted Networks
  • Generative Models โ†’ Diffusion vs. Autoregressive Models, Qwen3 Technical Report
  • Cross-disciplinary Methods โ†’ Betti Numbers in Topology, AI for Data Analysis

A dedicated folder with my notes and summaries is available here:
๐Ÿ‘‰ Research Papers โ€“ Literature Notes

Reading and annotating research papers not only strengthens my theoretical foundation but also sparks creative new ideas for projects, experiments, and applications.

About

This space showcases my explorations in the realm of AI, I have done for academic, self-learning and hobby purposes.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors