[Prototype] Add scalable sparse map kernel for large MoE models (128+ experts)#375
Draft
tscholak wants to merge 4 commits intoadd-gpt-oss-converterfrom
Draft
[Prototype] Add scalable sparse map kernel for large MoE models (128+ experts)#375tscholak wants to merge 4 commits intoadd-gpt-oss-converterfrom
tscholak wants to merge 4 commits intoadd-gpt-oss-converterfrom