brandonmmusic-max

Brandon M. Music brandonmmusic-max

I am practicing lawyer from Kentucky who took an interest in ai systems engineering. I've been coding in some form for 30 years .

Achievements

sm120-moe-bench sm120-moe-bench Public

SM120 MoE Inference Benchmark: Qwen3.5-397B on RTX PRO 6000 Blackwell — K=64 CUTLASS kernel fix + real-world legal prompt benchmarks

Cuda 3
flashinfer flashinfer Public

Forked from flashinfer-ai/flashinfer

FlashInfer: Kernel Library for LLM Serving

Python 1
vllm vllm Public

Forked from vllm-project/vllm

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 1
cutlass cutlass Public

Forked from NVIDIA/cutlass

CUDA Templates and Python DSLs for High-Performance Linear Algebra

C++ 1
free-code free-code Public

Forked from paoloanzn/free-code

The free build of Claude Code. All telemetry removed, security-prompt guardrails stripped, all experimental features enabled.

TypeScript 1
verdict-warp-decode verdict-warp-decode Public

Neuron-centric fused MoE kernel for SM120 NVFP4 — 17.5μs/layer, 1.02x faster than VerdictMoE, 5.6x faster than CUTLASS

Cuda 1