Supports explicit pytest parametrization by dphuang2 · Pull Request #190 · eval-protocol/python-sdk

dphuang2 · 2025-09-18T21:32:45Z

@pytest.mark.parametrize(
    "completion_params",
    [
        {
            "model": "fireworks_ai/accounts/fireworks/models/deepseek-v3p1",
        },
        {
            "model": "fireworks_ai/accounts/fireworks/models/kimi-k2-instruct-0905",
        },
    ],
    ids=DefaultParameterIdGenerator.generate_id_from_dict,
)
@evaluation_test(
    input_rows=[input_rows],
    rollout_processor=SingleTurnRolloutProcessor(),
    preprocess_fn=split_multi_turn_rows,
    mode="all",
)
async def test_llm_judge_openai_responses(rows: List[EvaluationRow]) -> List[EvaluationRow]:
    return await aha_judge(rows)

Dylan Huang added 3 commits September 17, 2025 15:42

v2 proposal

aa6077c

allow for manual parametrization using pytest

8e406ee

delete proposal

ab6d761

dphuang2 changed the title ~~Allow for manual pytest parametrization~~ Supports explicit pytest parametrization Sep 18, 2025

benjibc approved these changes Sep 18, 2025

View reviewed changes

Dylan Huang added 2 commits September 18, 2025 16:01

test_import_logs works

94ae1b3

add ids

d2d5d95

dphuang2 merged commit 057e132 into main Sep 18, 2025
3 of 6 checks passed

dphuang2 deleted the eval-protocol-v2-interface branch September 18, 2025 23:18

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Supports explicit pytest parametrization#190

Supports explicit pytest parametrization#190
dphuang2 merged 5 commits intomainfrom
eval-protocol-v2-interface

dphuang2 commented Sep 18, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

dphuang2 commented Sep 18, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants