🚀 BA-Att: Block Approximate Sparse Attention

Efficient Long-Context Modeling in Diffusion Language Models
📄 CVPR 2026 (Findings)

🔥 Highlights

🚀 Up to 6.95× speedup over FlashAttention
⚡ Training-free sparse attention (no finetuning required)
🧠 Maintains near full-attention performance at 50% sparsity
🎥 Strong generalization across language, multimodal, and video generation

📌 Overview

We propose Block Approximate Sparse Attention (BA-Att), a training-free block-sparse attention framework for Diffusion Language Models (DLMs).

Unlike prior works relying on fixed patterns, BA-Att:

Performs selection in downsampled space
Uses norm-based ranking to reduce approximation error
Applies covariance compensation for accuracy recovery

🧠 Framework Overview

📦 Code

🚧 Code is coming soon!

We are currently cleaning and organizing the codebase.

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
assets		assets
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

🚀 BA-Att: Block Approximate Sparse Attention

🔥 Highlights

📌 Overview

🧠 Framework Overview

📦 Code

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Folders and files

Latest commit

History

Repository files navigation

🚀 BA-Att: Block Approximate Sparse Attention

🔥 Highlights

📌 Overview

🧠 Framework Overview

📦 Code

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Packages