Windows NVIDIA-only Triton 3.7.0 build pipeline for RTX 5090 / Blackwell sm_120a, with FP8 tl.dot validation and peak benchmark results.
windows benchmark gpu cuda pytorch nvidia triton windows-build tensor-cores blackwell fp8 triton-lang rtx-5090 triton-windows cuda-13 sm120a fp8-matmul
-
Updated
May 26, 2026 - Batchfile