HIVE-29367: prevent Long overflows in ConvertJoinMapJoin #6237

konstantinb · 2025-12-16T17:08:04Z

What changes were proposed in this pull request?

HIVE-29367: fixing overflows in ConvertJoinMapJoin calculations

Why are the changes needed?

ConvertJoinMapJoin does not use StatsUtils.safeAdd()/saveMult() for all its calculations. There are some real life scenarios when it could perform a catastrophic decision to convert a join to a mapjoin after calculating negative size for the 'small" table, resulting in an OOM during query processing

Does this PR introduce any user-facing change?

No

How was this patch tested?

Via unit testing and with load testing on a custom Hive installation based of 4.0x version

You can see the test output generated by the pre-fix code here:
it clearly confirms the decision of perform a mapjoin despite very large volume of data

sonarqubecloud · 2025-12-18T20:16:37Z

Quality Gate passed

Issues
0 New issues
0 Accepted issues

Measures
0 Security Hotspots
0.0% Coverage on New Code
0.0% Duplication on New Code

See analysis details on SonarQube Cloud

deniskuzZ · 2026-01-15T12:24:15Z

ql/src/java/org/apache/hadoop/hive/ql/optimizer/ConvertJoinMapJoin.java

      }
      Operator<? extends OperatorDesc> parentOp = joinOp.getParentOperators().get(pos);
-      totalSize += computeOnlineDataSize(parentOp.getStatistics());
+      totalSize = StatsUtils.safeAdd(totalSize, computeOnlineDataSize(parentOp.getStatistics()));


I'm not sure it's appropriate to use safeAdd for a table size?
on the other side hashTableDataSizeAdjustment does that as well, so I guess it's fine
cc @zabetak, @thomasrebele

I think it's fine. The total size here does not need to be 100% correct, it's just an estimation that influences the join decision. Might make sense to rename it to estimatedTotalSize.

deniskuzZ · 2026-01-15T12:28:05Z

ql/src/java/org/apache/hadoop/hive/ql/optimizer/ConvertJoinMapJoin.java

      if (cs != null) {
        String colTypeLowerCase = cs.getColumnType().toLowerCase();
-        long nonNullCount = cs.getNumNulls() > 0 ? numRows - cs.getNumNulls() + 1 : numRows;
+        long nonNullCount = cs.getNumNulls() > 0 ? Math.max(1L, numRows - cs.getNumNulls() + 1) : numRows;


maybe Math.max(0L, numRows - cs.getNumNulls()) + 1

deniskuzZ · 2026-01-15T12:40:39Z

@konstantinb, do we need same for
long llapMaxSize = (long) (maxSize + (maxSize * overSubscriptionFactor * slotsPerQuery))

konstantinb · 2026-01-17T15:42:35Z

I'm away until the end of the next week. I will respond to review comments when I'm back. Thank you.

konstantinb · 2026-01-26T18:40:57Z

@konstantinb, do we need same for long llapMaxSize = (long) (maxSize + (maxSize * overSubscriptionFactor * slotsPerQuery))

@deniskuzZ, from my analysis, all elements of this calculation are directly derived from Hive configuration settings. I believe that overflow could only occur with (currently) extremely improbable configuration settings, such as 1TB of RAM for the mapjoin conversion threshold, overSubscriptionFactor of 1000 and slotsPerQuery of 8390

I realize that huge amounts of RAM could become available sooner rather than later. At the same time, since this functionality is not data-driven but purely config-driven, should it be better considered a separate fix?

sonarqubecloud · 2026-01-27T01:52:59Z

Quality Gate passed

Issues
0 New issues
0 Accepted issues

Measures
0 Security Hotspots
0.0% Coverage on New Code
0.0% Duplication on New Code

See analysis details on SonarQube Cloud

HIVE-29367: preventing Long overflows in ConvertJoinMapJoin

fafafe8

asf-ci-hive added tests pending tests unstable and removed tests pending labels Dec 16, 2025

HIVE-29367: attempt a rebuild

b8bb1e9

asf-ci-hive added tests pending tests passed and removed tests unstable tests pending labels Dec 17, 2025

HIVE-29367: quality gate feedback

eee1347

asf-ci-hive added tests pending and removed tests passed labels Dec 18, 2025

konstantinb changed the title ~~HIVE-29367: preventing Long overflows in ConvertJoinMapJoin~~ HIVE-29367: prevent Long overflows in ConvertJoinMapJoin Dec 18, 2025

konstantinb marked this pull request as ready for review December 18, 2025 19:18

asf-ci-hive added tests passed and removed tests pending labels Dec 18, 2025

deniskuzZ reviewed Jan 15, 2026

View reviewed changes

HIVE-29367: addressing PR feedback

72535de

asf-ci-hive added tests pending and removed tests passed labels Jan 26, 2026

konstantinb requested review from deniskuzZ and thomasrebele January 26, 2026 18:41

asf-ci-hive added tests unstable and removed tests pending labels Jan 26, 2026

HIVE-29367: attempt a rebuild

a959a99

asf-ci-hive added tests pending and removed tests unstable labels Jan 27, 2026

asf-ci-hive added tests passed and removed tests pending labels Jan 27, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

HIVE-29367: prevent Long overflows in ConvertJoinMapJoin #6237

HIVE-29367: prevent Long overflows in ConvertJoinMapJoin #6237

konstantinb commented Dec 16, 2025 •

edited

Loading

Uh oh!

sonarqubecloud bot commented Dec 18, 2025

Uh oh!

deniskuzZ Jan 15, 2026 •

edited

Loading

Uh oh!

thomasrebele Jan 15, 2026

Uh oh!

deniskuzZ Jan 15, 2026 •

edited

Loading

Uh oh!

deniskuzZ commented Jan 15, 2026

Uh oh!

konstantinb commented Jan 17, 2026

Uh oh!

konstantinb commented Jan 26, 2026

Uh oh!

sonarqubecloud bot commented Jan 27, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

HIVE-29367: prevent Long overflows in ConvertJoinMapJoin #6237

Are you sure you want to change the base?

HIVE-29367: prevent Long overflows in ConvertJoinMapJoin #6237

Conversation

konstantinb commented Dec 16, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What changes were proposed in this pull request?

Why are the changes needed?

Does this PR introduce any user-facing change?

How was this patch tested?

Uh oh!

sonarqubecloud bot commented Dec 18, 2025

Quality Gate passed

Uh oh!

deniskuzZ Jan 15, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

thomasrebele Jan 15, 2026

Choose a reason for hiding this comment

Uh oh!

deniskuzZ Jan 15, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

deniskuzZ commented Jan 15, 2026

Uh oh!

konstantinb commented Jan 17, 2026

Uh oh!

konstantinb commented Jan 26, 2026

Uh oh!

sonarqubecloud bot commented Jan 27, 2026

Quality Gate passed

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

konstantinb commented Dec 16, 2025 •

edited

Loading

deniskuzZ Jan 15, 2026 •

edited

Loading

deniskuzZ Jan 15, 2026 •

edited

Loading