Technical Showcase: 22B True-MoE Engine running on 6GB VRAM (GTX 1060). Demonstrates "Surgical" NF4 quantization, dynamic expert swapping, and the custom "Grace Hopper" pipeline.
-
Updated
Jan 8, 2026
Technical Showcase: 22B True-MoE Engine running on 6GB VRAM (GTX 1060). Demonstrates "Surgical" NF4 quantization, dynamic expert swapping, and the custom "Grace Hopper" pipeline.
Add a description, image, and links to the gtx1060 topic page so that developers can more easily learn about it.
To associate your repository with the gtx1060 topic, visit your repo's landing page and select "manage topics."