Awsome-Large-Language-Diffusion-Models

A comprehensive list of papers about Large-Language-Diffusion-Models.

Important

Contributions welcome:

If you have a relevant paper not included in the library, please contact us! Or, you may also consider submitting 'Pull requests' directly, thank you!
If you think your paper is more suitable for another category, please contact us or submit 'Pull requests'.
If your paper is accepted, you may consider updating the relevant information.
Thank you!

💥 News 💥

🔥🔥🔥 Awsome-Large-LDM is now open!

Framework

Large Diffusion Language Models
Multi-Modal Large Diffusion Language Models
Backgrounds
- Seminal Diffusion Papers
- Diffusion Language Models (<7B)

Large Diffusion Language Models

Scaling

Paper Title	Year	Conference/Journal	Remark
David helps Goliath: Inference-Time Collaboration Between Small Specialized and Large General Diffusion LMs	2023	NAACL
Diffusion Language Models Can Perform Many Tasks with Scaling and Instruction-Finetuning	2023	Arxiv
TESS 2: A Large-Scale Generalist Diffusion Language Model	2025	ACL	Adapted from Mistral-7B-v0.1
Scaling Diffusion Language Models via Adaptation from Autoregressive Models	2025	ICLR	127M~7B (GPT2, LLaMA2)
Large Language Diffusion Models	2025	Arxiv	LLaDA-8B
LLaDA 1.5: Variance-Reduced Preference Optimization for Large Language Diffusion Models	2025	Arxiv
Large Language Models to Diffusion Finetuning	2025	Arxiv

Caching

Paper Title	Year	Conference/Journal
Accelerating Diffusion Language Model Inference via Efficient KV Caching and Guided Diffusion	2025	Arxiv
dKV-Cache: The Cache for Diffusion Language Models	2025	Arxiv
Fast-dLLM: Training-free Acceleration of Diffusion LLM by Enabling KV Cache and Parallel Decoding	2025	Arxiv

Reasoning

Paper Title	Year	Conference/Journal
Reinforcing the Diffusion Chain of Lateral Thought with Diffusion Language Models	2025	Arxiv
d1: Scaling Reasoning in Diffusion Large Language Models via Reinforcement Learning	2025	Arxiv
Diffusion of Thought: Chain-of-Thought Reasoning in Diffusion Language Models	2024	NeurIPS

Multi-Modal Large Diffusion Language Models

Paper Title	Year	Conference/Journal	Remark
MMaDA: Multimodal Large Diffusion Language Models	2025	Arxiv
LLaDA-V: Large Language Diffusion Models with Visual Instruction Tuning	2025	Arxiv

Backgrounds

Seminal Diffusion Papers

Paper Title	Year	Conference/Journal	Remark
Deep Unsupervised Learning using Nonequilibrium Thermodynamics	2015	ICML	Diffusion Formulation
Denoising Diffusion Probabilistic Models	2020	NeurIPS
Denoising Diffusion Implicit Models	2021	ICLR
Score-Based Generative Modeling through Stochastic Differential Equations	2021	ICLR
DPM-Solver: A Fast ODE Solver for Diffusion Probabilistic Model Sampling in Around 10 Steps	2022	NeurIPS
High-Resolution Image Synthesis with Latent Diffusion Models	2022	CVPR
Scalable Diffusion Models with Transformers	2023	ICCV
Score-based Generative Modeling in Latent Space	2021	NeurIPS	Latent
Structured Denoising Diffusion Models in Discrete State-Spaces	2021	NeurIPS	Discrete
Vector Quantized Diffusion Model for Text-to-Image Synthesis	2022	CVPR	VQ
Diffusion Models Beat GANs on Image Synthesis	2021	NeurIPS	CG
Classifier-Free Diffusion Guidance	2021	NeurIPS	CFG
Analog Bits: Generating Discrete Data using Diffusion Models with Self-Conditioning	2023	ICLR	Self-conditioning
Progressive Distillation for Fast Sampling of Diffusion Models	2022	ICLR	Distillation
Consistency Models	2023	ICML

Diffusion Language Models (<7B)

Paper Title	Year	Conference/Journal	Remark
Diffusion-LM Improves Controllable Text Generation	2022	NeurIPS	Embedding
DiffuSeq: Sequence to Sequence Text Generation with Diffusion Models	2023	ICLR	Embedding
DiffusionBERT: Improving Generative Masked Language Models with Diffusion Models	2023	ACL	Masked
Latent Diffusion for Language Generation	2023	NeurIPS	Latent
Discrete Diffusion Modeling by Estimating the Ratios of the Data Distribution	2024	ICML	Masked
SSD-LM: Semi-autoregressive Simplex-based Diffusion Language Model for Text Generation and Modular Control	2023	ACL	Simplex, Blockwise
AR-Diffusion: Auto-Regressive Diffusion Model for Text Generation	2023	NeurIPS	AR-like noise
Likelihood-Based Diffusion Language Models	2023	NeurIPS	Plaid1B
Scaling up Masked Diffusion Models on Text	2024	ICLR	1.1B
Block Diffusion: Interpolating Between Autoregressive and Diffusion Language Models	2025	ICLR

Contact

We welcome all researchers to contribute to this repository.

If you have a related paper that was not added to the library, please contact us.

Email: jake630@snu.ac.kr / wjk9904@snu.ac.kr

Name		Name	Last commit message	Last commit date
Latest commit History 20 Commits
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Awsome-Large-Language-Diffusion-Models

💥 News 💥

Framework

Large Diffusion Language Models

Scaling

Caching

Reasoning

Multi-Modal Large Diffusion Language Models

Backgrounds

Seminal Diffusion Papers

Diffusion Language Models (<7B)

Contact

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Folders and files

Latest commit

History

Repository files navigation

Awsome-Large-Language-Diffusion-Models

💥 News 💥

Framework

Large Diffusion Language Models

Scaling

Caching

Reasoning

Multi-Modal Large Diffusion Language Models

Backgrounds

Seminal Diffusion Papers

Diffusion Language Models (<7B)

Contact

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Packages