Skip to content

Optimize MLA NKI kernel: hoist DMA loads, reduce masking ops, simplif…

20182de
Select commit
Loading
Failed to load commit list.
Open

Add GLM-5 (754B MoE) contrib model for trn2.48xlarge #143

Optimize MLA NKI kernel: hoist DMA loads, reduce masking ops, simplif…
20182de
Select commit
Loading
Failed to load commit list.

Workflow runs completed with no jobs