Skip to content

Commit 663cf8c

Browse files
fix typo
1 parent 8c57e1b commit 663cf8c

File tree

2 files changed

+18
-20
lines changed

2 files changed

+18
-20
lines changed

.github/scripts/profiler_rocprofv2.py

Lines changed: 0 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -21,8 +21,6 @@
2121
else:
2222
kernel_dict[name] = [time]
2323

24-
kernel_list.sort(key = lambda row: row[0])
25-
2624
data = [["name", "mean", "stdev", "count"]]
2725
for name, time_list in kernel_dict.items():
2826
count = len(time_list)

.github/workflows/standalone-benchmark.yml

Lines changed: 18 additions & 18 deletions
Original file line numberDiff line numberDiff line change
@@ -16,24 +16,24 @@ jobs:
1616
matrix:
1717
name: [cpu, nvidia-h100, nvidia-l40s, amd-mi300x, amd-w7900]
1818
include:
19-
- name: cpu
20-
runner: cern-nextgen-h100
21-
cmake_args: -DENABLE_CUDA=0 -DENABLE_HIP=0
22-
profiler_runs: 42
23-
standalone_runs: 42
24-
cpu_gpu: "-c"
25-
- name: nvidia-h100
26-
runner: cern-nextgen-h100
27-
cmake_args: -DENABLE_CUDA=1 -DENABLE_HIP=0 -DCUDA_COMPUTETARGET=90
28-
profiler_runs: 21
29-
standalone_runs: 42
30-
cpu_gpu: "-g --memSize 20000000000"
31-
- name: nvidia-l40s
32-
runner: cern-nextgen-l40s
33-
cmake_args: -DENABLE_CUDA=1 -DENABLE_HIP=0 -DCUDA_COMPUTETARGET=89
34-
profiler_runs: 42
35-
standalone_runs: 42
36-
cpu_gpu: "-g --memSize 20000000000"
19+
# - name: cpu
20+
# runner: cern-nextgen-h100
21+
# cmake_args: -DENABLE_CUDA=0 -DENABLE_HIP=0
22+
# profiler_runs: 42
23+
# standalone_runs: 42
24+
# cpu_gpu: "-c"
25+
# - name: nvidia-h100
26+
# runner: cern-nextgen-h100
27+
# cmake_args: -DENABLE_CUDA=1 -DENABLE_HIP=0 -DCUDA_COMPUTETARGET=90
28+
# profiler_runs: 21
29+
# standalone_runs: 42
30+
# cpu_gpu: "-g --memSize 20000000000"
31+
# - name: nvidia-l40s
32+
# runner: cern-nextgen-l40s
33+
# cmake_args: -DENABLE_CUDA=1 -DENABLE_HIP=0 -DCUDA_COMPUTETARGET=89
34+
# profiler_runs: 42
35+
# standalone_runs: 42
36+
# cpu_gpu: "-g --memSize 20000000000"
3737
- name: amd-mi300x
3838
runner: cern-nextgen-mi300x
3939
cmake_args: -DENABLE_CUDA=0 -DENABLE_HIP=1 -DHIP_AMDGPUTARGET=gfx942

0 commit comments

Comments
 (0)