Skip to content

Commit 86e3eec

Browse files
committed
remove allreduce fusion arg
1 parent 33bb71a commit 86e3eec

1 file changed

Lines changed: 1 addition & 1 deletion

File tree

benchmarks/single_node/qwen3.5_fp4_b200.sh

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -42,7 +42,7 @@ PYTHONNOUSERSITE=1 python3 -m sglang.launch_server --model-path=$MODEL --host=0.
4242
--mem-fraction-static $MEM_FRAC_STATIC --chunked-prefill-size 32768 --max-prefill-tokens 32768 \
4343
--context-length $CONTEXT_LENGTH --disable-radix-cache \
4444
--attention-backend trtllm_mha --moe-runner-backend flashinfer_trtllm \
45-
--enable-flashinfer-allreduce-fusion --scheduler-recv-interval 30 \
45+
--scheduler-recv-interval 30 \
4646
--stream-interval 30 --quantization modelopt_fp4 \
4747
--kv-cache-dtype fp8_e4m3 --fp4-gemm-backend flashinfer_cutlass > $SERVER_LOG 2>&1 &
4848

0 commit comments

Comments
 (0)