Skip to content

Commit b4d149c

Browse files
committed
full test
1 parent 7418781 commit b4d149c

File tree

1 file changed

+3
-3
lines changed

1 file changed

+3
-3
lines changed

eval_protocol/benchmarks/test_aime25.py

Lines changed: 3 additions & 3 deletions
Original file line numberDiff line numberDiff line change
@@ -79,9 +79,9 @@ def aime2025_dataset_adapter(rows: List[Dict[str, Any]]) -> List[EvaluationRow]:
7979

8080
@evaluation_test(
8181
input_dataset=[
82-
_get_aime_dataset_path(),
83-
# "https://huggingface.co/datasets/opencompass/AIME2025/raw/main/aime2025-I.jsonl",
84-
# "https://huggingface.co/datasets/opencompass/AIME2025/raw/main/aime2025-II.jsonl",
82+
# _get_aime_dataset_path(),
83+
"https://huggingface.co/datasets/opencompass/AIME2025/raw/main/aime2025-I.jsonl",
84+
"https://huggingface.co/datasets/opencompass/AIME2025/raw/main/aime2025-II.jsonl",
8585
],
8686
dataset_adapter=aime2025_dataset_adapter,
8787
completion_params=[

0 commit comments

Comments
 (0)