[SPARK-56241][SQL] Derive `outputOrdering` from `KeyedPartitioning` key expressions by peter-toth · Pull Request #55036 · apache/spark

peter-toth · 2026-03-26T16:41:36Z

What changes were proposed in this pull request?

Within a KeyedPartitioning partition, all rows share the same key value, so the key expressions are trivially sorted (ascending) within each partition.

This PR makes two plan nodes expose that structural guarantee via outputOrdering:

DataSourceV2ScanExecBase: when outputPartitioning is a KeyedPartitioning, prepend one ascending SortOrder per key expression to whatever SupportsReportOrdering reports, merging overlapping sameOrderExpressions in a single pass.
GroupPartitionsExec:
- Non-coalescing (every group has ≤ 1 input partition): pass through child.outputOrdering unchanged.
- Coalescing without reducers: re-derive ordering from the output KeyedPartitioning key expressions; a join may embed multiple KeyedPartitionings with different expressions — expose equivalences via sameOrderExpressions.
- Coalescing with reducers: fall back to super.outputOrdering (empty), because merged partitions share only the reduced key.

Why are the changes needed?

Before this change, outputOrdering on both nodes returned an empty sequence (unless SupportsReportOrdering was implemented), even though the within- partition ordering was structurally guaranteed by the partitioning itself. As a result, EnsureRequirements would insert a redundant SortExec before SortMergeJoin inputs that are already in key order.

Does this PR introduce any user-facing change?

Yes. Queries involving storage-partitioned joins (v2 bucketing) no longer add a redundant SortExec before SortMergeJoin when the join keys match the partition keys, reducing CPU and memory overhead.

How was this patch tested?

New unit test class GroupPartitionsExecSuite covering all four outputOrdering branches (non-coalescing, coalescing without reducers with single and multi-key, join sameOrderExpressions, coalescing with reducers).
New SQL integration tests in KeyGroupedPartitioningSuite:
- Scan with KeyedPartitioning reports key-derived outputOrdering.
- Non-coalescing GroupPartitionsExec (non-identical key sets) passes through child ordering — no pre-join SortExec.
- Coalescing GroupPartitionsExec derives ordering from key expressions — no pre-join SortExec.

Was this patch authored or co-authored using generative AI tooling?

Generated-by: Claude Sonnet 4.6

…xpressions ### What changes were proposed in this pull request? Within a `KeyedPartitioning` partition, all rows share the same key value, so the key expressions are trivially sorted (ascending) within each partition. This PR makes two plan nodes expose that structural guarantee via `outputOrdering`: - **`DataSourceV2ScanExecBase`**: when `outputPartitioning` is a `KeyedPartitioning`, prepend one ascending `SortOrder` per key expression to whatever `SupportsReportOrdering` reports, merging overlapping `sameOrderExpressions` in a single pass. - **`GroupPartitionsExec`**: - *Non-coalescing* (every group has ≤ 1 input partition): pass through `child.outputOrdering` unchanged. - *Coalescing without reducers*: re-derive ordering from the output `KeyedPartitioning` key expressions; a join may embed multiple `KeyedPartitioning`s with different expressions — expose equivalences via `sameOrderExpressions`. - *Coalescing with reducers*: fall back to `super.outputOrdering` (empty), because merged partitions share only the reduced key. ### Why are the changes needed? Before this change, `outputOrdering` on both nodes returned an empty sequence (unless `SupportsReportOrdering` was implemented), even though the within- partition ordering was structurally guaranteed by the partitioning itself. As a result, `EnsureRequirements` would insert a redundant `SortExec` before `SortMergeJoin` inputs that are already in key order. ### Does this PR introduce _any_ user-facing change? Yes. Queries involving storage-partitioned joins (v2 bucketing) no longer add a redundant `SortExec` before `SortMergeJoin` when the join keys match the partition keys, reducing CPU and memory overhead. ### How was this patch tested? - New unit test class `GroupPartitionsExecSuite` covering all four `outputOrdering` branches (non-coalescing, coalescing without reducers with single and multi-key, join `sameOrderExpressions`, coalescing with reducers). - New SQL integration tests in `KeyGroupedPartitioningSuite` (SPARK-56241): - Scan with `KeyedPartitioning` reports key-derived `outputOrdering`. - Non-coalescing `GroupPartitionsExec` (non-identical key sets) passes through child ordering — no pre-join `SortExec`. - Coalescing `GroupPartitionsExec` derives ordering from key expressions — no pre-join `SortExec`. ### Was this patch authored or co-authored using generative AI tooling? Generated-by: Claude Sonnet 4.6

dongjoon-hyun

It's a nice improvement. I expected many generated query plan changes in the test case, but there is no change from the existing generated plan. Is there any reason, @peter-toth ?

peter-toth · 2026-03-26T17:13:20Z

It's a nice improvement. I expected many generated query plan changes in the test case, but there is no change from the existing generated plan. Is there any reason, @peter-toth ?

We don't have any prodiction ready DSv2 filesources in Spark so the generated test plans / expected outputs doesn't cover this feature either.

dongjoon-hyun · 2026-03-26T17:15:56Z

Got it~

dongjoon-hyun

+1, LGTM. Thank you, @peter-toth .

dongjoon-hyun · 2026-03-26T17:17:57Z

cc @cloud-fan , @szehon-ho , @aokolnychyi , @gengliangwang , too

peter-toth · 2026-03-26T17:20:04Z

Iceberg can benefit from the change.
I will add a follow-up improvement in the scope of SPARK-55715 to keep ordering even when we coalesce partitions, and once @anuragmantri's apache/iceberg#14948 is also merged it will be a major improvement.

peter-toth · 2026-03-26T20:23:10Z

Marked as draft for now. Let me doublecheck a few edgecases as changing the reported ordering without the concept of constant order, which would be safe to prepend to any ordering, can be problematic.

Stale review.

dongjoon-hyun reviewed Mar 26, 2026

View reviewed changes

dongjoon-hyun previously approved these changes Mar 26, 2026

View reviewed changes

fix expected test output

4260f53

peter-toth force-pushed the SPARK-56241-outputordering-from-keyedpartitioning branch from 7946dce to 4260f53 Compare March 26, 2026 19:39

peter-toth marked this pull request as draft March 26, 2026 20:21

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[SPARK-56241][SQL] Derive `outputOrdering` from `KeyedPartitioning` key expressions#55036

[SPARK-56241][SQL] Derive `outputOrdering` from `KeyedPartitioning` key expressions#55036
peter-toth wants to merge 2 commits intoapache:masterfrom
peter-toth:SPARK-56241-outputordering-from-keyedpartitioning

peter-toth commented Mar 26, 2026

Uh oh!

dongjoon-hyun left a comment

Uh oh!

peter-toth commented Mar 26, 2026 •

edited

Loading

Uh oh!

dongjoon-hyun commented Mar 26, 2026

Uh oh!

dongjoon-hyun left a comment

Uh oh!

dongjoon-hyun commented Mar 26, 2026

Uh oh!

peter-toth commented Mar 26, 2026 •

edited

Loading

Uh oh!

peter-toth commented Mar 26, 2026 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

peter-toth commented Mar 26, 2026

What changes were proposed in this pull request?

Why are the changes needed?

Does this PR introduce any user-facing change?

How was this patch tested?

Was this patch authored or co-authored using generative AI tooling?

Uh oh!

dongjoon-hyun left a comment

Choose a reason for hiding this comment

Uh oh!

peter-toth commented Mar 26, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

dongjoon-hyun commented Mar 26, 2026

Uh oh!

dongjoon-hyun left a comment

Choose a reason for hiding this comment

Uh oh!

dongjoon-hyun commented Mar 26, 2026

Uh oh!

peter-toth commented Mar 26, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

peter-toth commented Mar 26, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

peter-toth commented Mar 26, 2026 •

edited

Loading

peter-toth commented Mar 26, 2026 •

edited

Loading

peter-toth commented Mar 26, 2026 •

edited

Loading