smollm2

Active-inference Training with Learned Adaptive Stigmergy — Pure Rust AGI framework, 21 crates, 565 tests, 28 MCP tools, BF16 GPU inference (15.4 tok/s OLMo-3-7B on A100), OpenAI-compatible API, ZK proofs. Zero external dependencies.

Updated Apr 20, 2026
Rust

AlfaPankaj / Neural_Memory_Operating_system

Star

NMOS (Neural Memory OS) is a predictive partial execution engine enabling 70B-level reasoning on 4GB VRAM. It uses the “Zero-Lag” hypothesis, leveraging typing latency as a compute window to mask memory limits via async layer prefetching and speculative decoding.

python machine cuda pytorch memory-management hnsw edge-ai llm generative-ai local-llm llm-inference speculative-decoding smollm2 vram-optimization anticipatory-inference layer-offloading prefeching 70b-model

Updated Apr 6, 2026
Python

IsmaelMousa / automatic-essay-grading

Star

Systematically train and benchmark Mistral, Qwen2.5, and SmolLM2 on essay grading across 39 experiments through data analysis and engineering, structured preprocessing, instruction tuning, postprocessing, and leakage aware evaluation for robust score and rationale generation

modeling evaluation transformers data-engineering data-analysis lora preprocessing postprocessing automatic-essay-scoring instruction-tuning supervised-finetuning flash-attention-2 mistral-7b unsloth qwen2-5 smollm2

Updated Aug 14, 2025
Jupyter Notebook

nishantb06 / smolLM

Star

Reverse Engineering SmolLM2 model and training it from scratch

llama llm smollm2

Updated Dec 28, 2025
Python

mikeesto / smollm2-browser

Star

Run SmolLM2 in a web browser with transformers.js

reactjs llm transformersjs smollm2

Updated Dec 1, 2024
TypeScript

VolkanSah / SmolLM2-customs

Sponsor

Star

he point is not the model — the point is the pattern. Fork it, swap SmolLM2 for any model you want, and you have your own private LLM API running for free.

torch train-model huggingface-transformers llm smollm2 train-llm

Updated Apr 9, 2026
Python

amjadAwad95 / smart-resume-analyzer

Star

Build a system that analyzes resumes using NLP techniques, including preprocessing, skill extraction, and similarity scoring.

Updated Jul 9, 2025
Jupyter Notebook

nmdra / Assignment-Metadata-Extractor

Sponsor

Star

A lightweight, locally hosted LLM pipeline that extracts and normalizes unstructured student assignment text into a strict JSON schema using a fine-tuned SmolLM2-360M model served via Ollama.

huggingface json-extractor fine-tuning-llm ollama unsloth small-language-model smollm2

Updated Apr 19, 2026
Jupyter Notebook

Arya920 / A.R.I.A

Star

ARIA is an AI-powered voice assistant that provides intelligent, web-enhanced answers to user queries. Built using a lightweight HuggingFace model, it integrates real-time web search and responds in a professional tone.

dockerfile flask-application voice-assistant tailwindcss agentic-ai smollm2

Updated Apr 28, 2025
HTML

opcastil11 / planckbot

Star

Adaptive tiny-model layer between LLMs and their tools — observes MCP traffic, trains per-tool LoRA compressors, synthesizes new tools from patterns.

agent mcp lora peft tool-use llm prompt-compression smollm2 model-context-protocol claude-code token-optimization

Updated Apr 24, 2026
Python

zhuyie / smollm2.go

Star

Minimal Go implementation for inference Hugging Face SmolLM2 Instruct models.

llm-inference smollm2

Updated Apr 27, 2026
Go

AparnaRoy76 / LLM-finetuning

Star

A comprehensive toolkit for fine-tuning Large Language Models (LLMs) using cutting-edge techniques like LoRA, QLoRA, and Unsloth. Includes notebooks and scripts for customizing models such as SmolLM2, Llama-3.2, and Embedding Gemma for tasks like tool calling, reasoning, and embeddings. Perfect for researchers and developers looking to adapt LLMs

lora fine-tuning llm qlora unsloth llama3 smollm2

Updated Jan 15, 2026
Jupyter Notebook

natebabyak / smollm2-chatbot

Sponsor

Star

Free, open-source, privacy-focused, customizable AI chatbot.

nextjs transformersjs smollm2

Updated Dec 13, 2025
TypeScript

xtoazt / maple

Star

A new AI model based on SmolLM2 135M that can be fine tuned in real time and locally hosted on the web using WebContainers.

fast free maple localai webllm smollm2 lowkeykindacool

Updated Oct 12, 2025
TypeScript

Improve this page

Add a description, image, and links to the smollm2 topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the smollm2 topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

smollm2

Here are 22 public repositories matching this topic...

monade / smol-but-mighty

ethicalabs-ai / Kurtis-E1-MLX-Voice-Agent

ccozad / ml-reference-designs

stlin256 / VLM_Live

ethicalabs-ai / kurtis

pramodkoujalagi / SmolLM2-360M-Instruct-Text-2-JSON

web3guru888 / ATLAS

AlfaPankaj / Neural_Memory_Operating_system

IsmaelMousa / automatic-essay-grading

nishantb06 / smolLM

mikeesto / smollm2-browser

VolkanSah / SmolLM2-customs

amjadAwad95 / smart-resume-analyzer

nmdra / Assignment-Metadata-Extractor

Arya920 / A.R.I.A

opcastil11 / planckbot

zhuyie / smollm2.go

AparnaRoy76 / LLM-finetuning

natebabyak / smollm2-chatbot

xtoazt / maple

Improve this page

Add this topic to your repo