MicroGpt-torch

A minimal GPT implementation in 83 lines of Python + PyTorch. Rewrite of microgpt.py by Andrej Karpathy, replacing scalar-level autograd with PyTorch tensor ops + CUDA.

Trains on ~32K English names and generates new ones.

Requirements

Python >= 3.10
PyTorch >= 2.4 (for F.rms_norm, F.scaled_dot_product_attention)

Usage

python microgpt-torch.py

The script will:

Download the dataset (if not present)
Train for 40 epochs (auto-detects GPU)
Generate 20 new "hallucinated" names

Files

microgpt-torch.py — compact version (83 lines)
microgpt-torch-comments.py — same code with bilingual comments (EN/CN)

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
README.md		README.md
input.txt		input.txt
microgpt-torch-comments.py		microgpt-torch-comments.py
microgpt-torch.py		microgpt-torch.py
training_losses.json		training_losses.json

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

MicroGpt-torch

Requirements

Usage

Files

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

MicroGpt-torch

Requirements

Usage

Files

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages