[SPARK-56244][PYTHON] Refine benchmark class layout in bench_eval_type.py by Yicong-Huang · Pull Request #55040 · apache/spark

Yicong-Huang · 2026-03-26T21:14:59Z

What changes were proposed in this pull request?

Refines the benchmark class layout in bench_eval_type.py:

Move scenarios and UDF definitions into EvalType mixin classes - each mixin now owns its _scenarios, _udfs, params, and param_names, so Time/Peakmem benchmark classes are zero-copy (pass).
Extract MockProtocolWriter and MockDataFactory utility classes - consolidates 17 scattered module-level helper functions into two organized classes with @staticmethod methods.

Shared scenarios between related eval types (e.g., ScalarArrow/ScalarArrowIter) use inheritance rather than cross-mixin references.

Why are the changes needed?

The current layout has significant repetition: every Time/Peakmem class duplicates _scenarios, _udfs, params, and param_names. Helper functions are scattered across the file with no clear organization.

Does this PR introduce any user-facing change?

No

How was this patch tested?

Verified all 18 benchmark classes import correctly, have correct params/param_names, and pass smoke tests (setup + time_worker) for every eval type.

Was this patch authored or co-authored using generative AI tooling?

No

…ault, and struct type support

zhengruifeng · 2026-03-27T04:36:29Z

merged to master

Yicong-Huang added 4 commits March 26, 2026 18:39

refactor: refine benchmark class layout in bench_eval_type.py

ac57427

refactor: extract MockProtocolWriter and MockDataFactory util classes

6709d3d

refactor: unify MockDataFactory API with keyword args, batch_size def…

49a9771

…ault, and struct type support

refactor: unify scenario tuples, write_udf_payload, and TYPE_REGISTRY

81a1a67

zhengruifeng approved these changes Mar 27, 2026

View reviewed changes

zhengruifeng closed this in 80d9d47 Mar 27, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[SPARK-56244][PYTHON] Refine benchmark class layout in bench_eval_type.py#55040

[SPARK-56244][PYTHON] Refine benchmark class layout in bench_eval_type.py#55040
Yicong-Huang wants to merge 4 commits intoapache:masterfrom
Yicong-Huang:SPARK-56244/refine-bench-layout

Yicong-Huang commented Mar 26, 2026

Uh oh!

zhengruifeng commented Mar 27, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

Yicong-Huang commented Mar 26, 2026

What changes were proposed in this pull request?

Why are the changes needed?

Does this PR introduce any user-facing change?

How was this patch tested?

Was this patch authored or co-authored using generative AI tooling?

Uh oh!

zhengruifeng commented Mar 27, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants