fla-org / flash-linear-attention Public

Notifications You must be signed in to change notification settings
Fork 518
Star 5k

Code
Issues 41
Pull requests 22
Discussions
Actions
Projects
Wiki
Security and quality
Insights

Additional navigation options

Code
Issues
Pull requests
Discussions
Actions
Projects
Wiki
Security and quality
Insights

Pull requests: fla-org/flash-linear-attention

Labels 16 Milestones 3

New pull request New

22 Open 511 Closed

Author

Filter by author

Uh oh!

There was an error while loading. Please reload this page.

Label

Filter by label

Uh oh!

There was an error while loading. Please reload this page.

Use alt + click/return to exclude labels

or ⇧ + click/return for logical OR

Projects

Filter by project

Uh oh!

There was an error while loading. Please reload this page.

Milestones

Filter by milestone

Uh oh!

There was an error while loading. Please reload this page.

Reviews

Filter by reviews

No reviews Review required Approved review Changes requested

Assignee

Filter by who’s assigned

Assigned to nobody

Uh oh!

There was an error while loading. Please reload this page.

Sort

Sort by

Newest Oldest Most commented Least commented Recently updated Least recently updated Best match

Most reactions

Pull requests list

Fix dk in normalized linear attention

#875 opened May 3, 2026 by ayghri

Loading…

[KDA][AMD]for kda kernel,fix core dump on AMD GPU and tune the config for AMD branch

#869 opened Apr 29, 2026 by binding7012

Loading…

[Common] Switch chunk paths to exp2

#867 opened Apr 29, 2026 by yzhangcs Member

Loading…

7 tasks

[GDN] Fix WY backward when gating is disabled

#863 opened Apr 26, 2026 by yzhangcs Member

Loading…

1 task

[Model] Add YOCO model implementation

#857 opened Apr 22, 2026 by Shomvel Contributor • Draft

[Ops] Fix int32 overflow in pointer arithmetic across all Triton kernels

#818 opened Apr 8, 2026 by tmct Contributor • Draft

Add MALA (Magnitude-Aware Linear Attention) to FLA

#809 opened Apr 3, 2026 by drdanielwuwu

Loading…

[L2Norm] Fix bf16 numerical stability by calculating norm in f32 then normalising in input dtype

#806 opened Mar 31, 2026 by tmct Contributor • Draft

feat: add Quasar Attention and standalone model implementation

#805 opened Mar 31, 2026 by troy12x

Loading…

[GDN] Tricked kernels: ungated KKT + fused inference via similarity transform

#797 opened Mar 28, 2026 by hypnopump Contributor

Loading…

5 tasks

[Layernorm] Fix autotuner crash and OOB writes in layer_norm_bwd on high-SM GPUs

#796 opened Mar 28, 2026 by mpurland Contributor

Loading…

5 tasks done

Add LinOSS model(ICLR 2025 oral)

#749 opened Feb 17, 2026 by Phoenix8215 Contributor

Loading…

[NPU] add NPU (Ascend) backend for chunk_gla

#737 opened Feb 5, 2026 by noemotiovon • Draft

Add NPU support for the fused_norm_gate operator

#719 opened Jan 20, 2026 by iiiiLllllzx • Draft

Add fused short convolution kernel with L2 norm

#661 opened Nov 24, 2025 by sustcsonglin Collaborator

Loading…

[kda] add recursive block intra implementation

#656 opened Nov 22, 2025 by sustcsonglin Collaborator

Loading…

[Deltaformer] kernel improvement; if-else optimization; change w to fp32; add 1e-9 to avoid nan

#603 opened Sep 30, 2025 by foreverpiano

Loading…

Update README.md of ops delta_rule

#595 opened Sep 17, 2025 by SeepingFragranceLock Contributor

Loading…

Cached inference for NSA

#574 opened Aug 22, 2025 by mutiann Contributor

Loading…

Modify output shape in nsa for decoding

#565 opened Aug 14, 2025 by Espere-1119-Song

Loading…

Updated the Technical Note for WY of DPLR

#562 opened Aug 12, 2025 by phnazari

Loading…

Delta Product Rule Backwards Kernel

#526 opened Jul 14, 2025 by phi-jkim

Loading…

ProTip! Type g i on any issue or pull request to go back to the issue listing page.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!