Recurser

A new model and implementation to reduce VRAM usage on transformer models.

Online demos

Reduce the VRAM usage of GPT2-XL by 25%. We can run GPT2-XL(float32) with Pytorch on the colab or with our gpu.

Installation

Always install the library from PyPI:

  pip install recursers

Todos

Re-implement recurser for other models
Enable MPS acceleration on Mac
Retraining: The model training of the recurser is a little different from the usual.

(back to top)

Reference

Karpathy's elegant GPT implementation
https://github.com/karpathy/nanoGPT

Hugging Face's library
https://github.com/huggingface/transformers

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
recursers		recursers
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Recurser

Online demos

Installation

Todos

Reference

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Recurser

Online demos

Installation

Todos

Reference

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages