perf(liger/cutile): tune cross_entropy BLOCK_SIZE by GPU arch & Modularize transformer HF benchmark & other updates#153
Open
hannahli-nv wants to merge 4 commits into
Open
perf(liger/cutile): tune cross_entropy BLOCK_SIZE by GPU arch & Modularize transformer HF benchmark & other updates#153hannahli-nv wants to merge 4 commits into
hannahli-nv wants to merge 4 commits into