GitHub - ronakrajput8882/CNN-Image-Classifier: CNN CIFAR-10 Image Classifier 🖼️ — Custom PyTorch CNN (75.1% acc) served via FastAPI with a fully custom UI. Deployed with Docker on HuggingFace Spaces. No Gradio, no Streamlit — built from scratch.

📌 Project Overview

A custom CNN image classifier trained from scratch on CIFAR-10, served via a FastAPI backend with a fully custom HTML/JS frontend — no Gradio, no Streamlit. Deployed as a Docker container on HuggingFace Spaces.

Upload any image → get an instant prediction with top-3 class probabilities and confidence scores across all 10 classes.

🎯 ~75.1% test accuracy on CIFAR-10 using a custom 3-layer CNN built entirely in PyTorch.

📂 Dataset

Property	Details
Dataset	CIFAR-10
Total Images	60,000 (50K train / 10K test)
Image Size	32 × 32 × 3 (RGB)
Classes	10 balanced classes
Source	`torchvision.datasets.CIFAR10`

Classes: airplane · automobile · bird · cat · deer · dog · frog · horse · ship · truck

🔄 Pipeline Workflow

Image Upload → Resize (32×32) → Normalize → CNN Forward Pass → Softmax → Top-3 Predictions → JSON Response

1️⃣ Preprocessing

Input image resized to 32×32 using transforms.Resize
Normalized with mean (0.5, 0.5, 0.5) and std (0.5, 0.5, 0.5) → pixel values mapped to [-1, 1]
Converted to PyTorch tensor and batched with unsqueeze(0)

2️⃣ Model Inference

Single forward pass through the CNN
torch.softmax applied to logits → probability distribution
torch.topk(probs, 3) extracts top-3 predictions

3️⃣ API Response

FastAPI /predict endpoint returns: top class, emoji, confidence %, top-3 predictions, and full probability distribution for all 10 classes

🤖 Model Architecture ⭐ Best Model

Custom CNN — 3 Conv Layers

class CNN(nn.Module):
    def __init__(self):
        super(CNN, self).__init__()
        self.conv_layers = nn.Sequential(
            nn.Conv2d(3, 32, kernel_size=3, padding=1), nn.ReLU(), nn.MaxPool2d(2, 2),   # 32×32 → 16×16
            nn.Conv2d(32, 64, kernel_size=3, padding=1), nn.ReLU(), nn.MaxPool2d(2, 2),  # 16×16 → 8×8
            nn.Conv2d(64, 128, kernel_size=3, padding=1), nn.ReLU(), nn.MaxPool2d(2, 2), # 8×8 → 4×4
        )
        self.fc_layers = nn.Sequential(
            nn.Linear(4 * 4 * 128, 256), nn.ReLU(),
            nn.Linear(256, 10),
        )

Feature maps: 32 → 64 → 128 filters (progressive depth)
Spatial reduction: 32×32 → 16×16 → 8×8 → 4×4 via MaxPooling
FC layers: 2048 → 256 → 10 (output logits)
Inference: CPU-compatible, single forward pass, no TTA

📊 Results

Metric	Value
Test Accuracy	~75.1%
Architecture	Custom CNN (3 Conv + 2 FC)
Parameters	~2.1M
Input Size	32 × 32 × 3
Output	10-class softmax
Inference Mode	CPU (no GPU required)

🔍 Key Insights

🧠 Progressive filter doubling (32 → 64 → 128) consistently improves feature extraction on CIFAR-10 without overfitting at this scale
📉 Resolution bottleneck is the primary accuracy ceiling — CIFAR-10's 32×32 images lose fine-grained detail, making classes like cat vs dog genuinely hard even for CNNs
⚠️ Softmax overconfidence is real — the model outputs high confidence even on out-of-distribution images; temperature scaling would help
🚀 A ResNet-18 backbone on the same dataset would push accuracy to ~90–93%, confirming the custom CNN is strong for its parameter count
🐸 frog, ship, and airplane are typically the easiest classes due to distinct color distributions; cat and dog are the hardest

🗂️ Repository Structure

cifar10-classifier/
│
├── app.py                  # FastAPI backend — model loading + /predict endpoint
├── index.html              # Custom frontend UI (drag & drop + results display)
├── cnn_cifar10.pth         # Trained model weights
├── requirements.txt        # Python dependencies
├── Dockerfile              # Docker container config for HF Spaces
├── limitations.txt         # Known model limitations & future improvements
└── README.md               # This file

🚀 Quick Start

Run Locally

# Clone the repo
git clone https://github.com/ronakrajput8882/CNN-Image-Classifier.git
cd CNN-Image-Classifier

# Install dependencies
pip install -r requirements.txt

# Start the server
python app.py
# → Open http://localhost:7860

Run with Docker

docker build -t cifar10-classifier .
docker run -p 7860:7860 cifar10-classifier

Use the Live Demo

🌐 https://ronakrajput8882-cifar10-classifier.hf.space/

🧠 Key Learnings

Serving a PyTorch model with FastAPI is more flexible and production-ready than Gradio/Streamlit for custom UIs
Docker on HuggingFace Spaces gives full control over the runtime environment — no SDK lock-in
CIFAR-10's 32×32 resolution is a hard accuracy ceiling for custom CNNs; modern architectures use data augmentation (RandomCrop, HorizontalFlip, Cutout) to push past 90%
Softmax probabilities are not calibrated — a 95% confidence score ≠ 95% correct; always mention this to end users
Building the frontend from scratch (vs Gradio) teaches you exactly what the model API contract looks like in production

🛠️ Tech Stack

Tool	Use
PyTorch	Model definition, training, inference
torchvision	CIFAR-10 dataset, image transforms
FastAPI	REST API backend (`/predict` endpoint)
uvicorn	ASGI server
Pillow	Image loading and RGB conversion
Docker	Containerization for HF Spaces deployment
HTML/CSS/JS	Custom frontend UI

🌐 Connect with me

If you found this useful, please ⭐ the repo!

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

📌 Project Overview

📂 Dataset

🔄 Pipeline Workflow

1️⃣ Preprocessing

2️⃣ Model Inference

3️⃣ API Response

🤖 Model Architecture ⭐ Best Model

Custom CNN — 3 Conv Layers

📊 Results

🔍 Key Insights

🗂️ Repository Structure

🚀 Quick Start

Run Locally

Run with Docker

Use the Live Demo

🧠 Key Learnings

🛠️ Tech Stack

🌐 Connect with me

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
Dockerfile		Dockerfile
LICENSE		LICENSE
README.md		README.md
app.py		app.py
cnn_cifar10.pth		cnn_cifar10.pth
index.html		index.html
limitations.txt		limitations.txt
requirements.txt		requirements.txt

Folders and files

Latest commit

History

Repository files navigation

📌 Project Overview

📂 Dataset

🔄 Pipeline Workflow

1️⃣ Preprocessing

2️⃣ Model Inference

3️⃣ API Response

🤖 Model Architecture ⭐ Best Model

Custom CNN — 3 Conv Layers

📊 Results

🔍 Key Insights

🗂️ Repository Structure

🚀 Quick Start

Run Locally

Run with Docker

Use the Live Demo

🧠 Key Learnings

🛠️ Tech Stack

🌐 Connect with me

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages