Skip to content

Commit 6d9ff42

Browse files
committed
update sgl image
fix PR number
1 parent ed1b0ba commit 6d9ff42

File tree

2 files changed

+2
-2
lines changed

2 files changed

+2
-2
lines changed

.github/configs/nvidia-master.yaml

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -1805,7 +1805,7 @@ qwen3.5-bf16-b200-sglang:
18051805
- { tp: 8, ep: 1, conc-start: 4, conc-end: 64 }
18061806

18071807
qwen3.5-fp8-b200-sglang:
1808-
image: lmsysorg/sglang:v0.5.9-cu129-amd64
1808+
image: lmsysorg/sglang:v0.5.9-cu130-amd64
18091809
model: Qwen/Qwen3.5-397B-A17B-FP8
18101810
model-prefix: qwen3.5
18111811
runner: b200

perf-changelog.yaml

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -976,4 +976,4 @@
976976
description:
977977
- "Replace FP8 TP4/EP4 with TP8 config (conc 4-128) for all ISL/OSL combos"
978978
- "Add --enable-flashinfer-allreduce-fusion to FP8 benchmark script"
979-
pr-link: https://github.com/SemiAnalysisAI/InferenceX/pull/XXX
979+
pr-link: https://github.com/SemiAnalysisAI/InferenceX/pull/918

0 commit comments

Comments
 (0)