-
Notifications
You must be signed in to change notification settings - Fork 124
Pull requests: sgl-project/sgl-kernel-npu
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
Adaptation of the Deepep A5 normal and low-latency operators.
#464
opened May 8, 2026 by
1329009851
Contributor
Loading…
fix of current build.sh cannot handle multiple CANN installation
#460
opened May 4, 2026 by
Sawyer117
Loading…
Adaptation of the Deepep A5 normal and low-latency operators.
#458
opened Apr 30, 2026 by
oagniqgnat
Contributor
Loading…
Add prebuilt metadata support and tests for chunk operations
#454
opened Apr 29, 2026 by
AndyLi429
Contributor
Loading…
improve performance for fused gdn gating and solve tril
#450
opened Apr 27, 2026 by
zhaozx-cn
Loading…
support ssd chunk scan triton & ssd chunk state triton on npu
#448
opened Apr 27, 2026 by
sigama-w
Loading…
add dispatch_ffn_combine_bf16 kernel for deepep
#410
opened Mar 27, 2026 by
zuje123
Collaborator
Loading…
[WIP] add fuse_deep_moe_no_buffer for enable-torch-compile
#409
opened Mar 27, 2026 by
jiaming1130
Loading…
add fused_deep_moe test for dispatch_ffn_combine
#400
opened Mar 18, 2026 by
zuje123
Collaborator
Loading…
MMLU benchmark for different inverse implementations
#374
opened Feb 11, 2026 by
gioelegott
Loading…
(tri_inv) (pto-isa) implement AIV triangular inverse using pto-isa
#369
opened Feb 6, 2026 by
zouzias
Contributor
Loading…
wrap triton_kernels into callable that can be traced into a graph
#368
opened Feb 5, 2026 by
lawtherWu
Loading…
deeepep normal support shmem with asymmetric tensor
#328
opened Jan 19, 2026 by
zuje123
Collaborator
Loading…
Previous Next
ProTip!
Add no:assignee to see everything that’s not assigned.