Skip to content

Latest commit

 

History

History
9 lines (5 loc) · 287 Bytes

File metadata and controls

9 lines (5 loc) · 287 Bytes

Tiny Transformer

An experiment implementing the decoder part of the transformer architecture from the paper "Attention is All You Need" by Vaswani et al. (2017) in PyTorch.

The current model was trained on TinyStories Dataset.

Demo

This project is a work in progress.