Curated collection of research on the limitations of next-token prediction and methods that go beyond it.
-
Updated
Dec 21, 2025
Curated collection of research on the limitations of next-token prediction and methods that go beyond it.
A lightweight experimental generative model for chemistry, with mini Qwen2-like architecture and horizon loss and biologically-aware RL fine-tuning on SELFIES molecular representations.
ChemMiniQ3-SAbRLo is a lightweight experimental generative model for chemistry, built on mini Qwen2-like arch, designed for rapid prototyping of HuggingFace AutoModel and AutoTokenizer compatibility, and fast iteration of Multi-Token Prediction (MTP) and RL fine-tuning algorithms/rewards.
Add a description, image, and links to the multi-token-prediction topic page so that developers can more easily learn about it.
To associate your repository with the multi-token-prediction topic, visit your repo's landing page and select "manage topics."