Skip to content
#

gdn

Here are 9 public repositories matching this topic...

vLLM patcher for Qwen3.6 on consumer NVIDIA — Qwen3.6-35B-A3B-FP8 (192 tok/s, +68% over stock) + Qwen3.6-27B-int4-AutoRound + 256K context. 126 patches: TurboQuant k8v4 KV, MTP/DFlash spec-decode, FULL cudagraph, hybrid GDN streaming, structured boot summary, one-command installer, 1958 tests. v7.72.2.

  • Updated May 12, 2026
  • Python

Improve this page

Add a description, image, and links to the gdn topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the gdn topic, visit your repo's landing page and select "manage topics."

Learn more