Skip to content

Modular GDN forward / backward kernels (compatible with Kimi CP) #13

@icavan

Description

@icavan

Description

Implement modular forward and backward CUDA kernels for the GDN (Gated Delta Network) algorithm, compatible with Kimi CP.

Context

GDN is a linear attention variant that improves expressiveness with gating and delta-style updates. Implementing modular forward/backward kernels compatible with Kimi CP would enable efficient distributed training via context parallelism.

Tasks

  • Implement GDN forward kernels (modular, similar to KDA structure)
  • Implement GDN backward kernels
  • Ensure compatibility with Kimi CP's context parallelism protocol
  • Add correctness tests against FLA reference implementation
  • Add benchmarks

References

Metadata

Metadata

Assignees

Labels

enhancementNew feature or requesthelp wantedExtra attention is needed

Type

No type
No fields configured for issues without a type.

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions