HIVE-29598: Fix vectorized outer join wrong results due to stale scratch column values by ryukobayashi · Pull Request #6486 · apache/hive

ryukobayashi · 2026-05-15T06:58:16Z

What changes were proposed in this pull request?

In vectorized outer join, generateOuterNulls() and generateOuterNullsRepeatedAll() set isNull[i] = true on scratch columns but leave vector[i] untouched. When hive.vectorized.reuse.scratch.columns=true (the default), a scratch column slot
freed after an expression evaluation (e.g. CastStringToLong) can be reused for the outer join's null-marking column. After reset() clears isNull[], the expression overwrites vector[i] with a fresh value (e.g. 2025). Later, generateOuterNulls()
sets isNull[i] = true without clearing vector[i], leaving a stale non-zero value.

Downstream operators such as ColOrCol read vector[i] directly to distinguish "false" (== 0) from "null" (!= 0). The stale value causes null rows to be misinterpreted as "true", producing wrong OR/AND/CASE WHEN results.

The fix adds clearVectorValue(), called whenever isNull[i] is set to true in the outer join null-marking paths, zeroing vector[i] for all supported column vector types (LongColumnVector, DoubleColumnVector, BytesColumnVector,
TimestampColumnVector, IntervalDayTimeColumnVector).

Why are the changes needed?

Without the fix, vectorized outer joins silently return wrong results when scratch column reuse is enabled (the default). The bug is non-obvious because it only triggers when a specific combination of conditions is met: a type-casting expression allocates a scratch column that is later reused for the outer join's null-marking column, and the join result is consumed by a boolean operator that reads the raw vector value for null discrimination. Users have no indication that results are wrong; workarounds require disabling vectorization entirely (hive.vectorization.enabled=false) or disabling scratch column reuse (hive.vectorized.reuse.scratch.columns=false), both of which carry a significant performance cost.

Does this PR introduce any user-facing change?

No

How was this patch tested?

I added qtest.

…tch column values

sonarqubecloud · 2026-05-15T08:32:02Z

Quality Gate passed

Issues
6 New issues
0 Accepted issues

Measures
0 Security Hotspots
0.0% Coverage on New Code
0.0% Duplication on New Code

See analysis details on SonarQube Cloud

soumyakanti3578

Vectorization code can be tricky and brittle. To ensure there are no unintended consequences of this change, could you please add some tests? A minimal reproducer as a qtest is essential, and unit tests for the vector clearing operation across different ColumnVector types would also be valuable.

soumyakanti3578 · 2026-05-15T19:14:53Z

+  private static void clearVectorValue(ColumnVector colVector, int index) {
+    if (colVector instanceof LongColumnVector) {
+      ((LongColumnVector) colVector).vector[index] = 0L;
+    } else if (colVector instanceof DoubleColumnVector) {
+      ((DoubleColumnVector) colVector).vector[index] = 0.0;
+    } else if (colVector instanceof BytesColumnVector) {
+      BytesColumnVector bcv = (BytesColumnVector) colVector;
+      bcv.vector[index] = null;
+      bcv.start[index] = 0;
+      bcv.length[index] = 0;
+    } else if (colVector instanceof TimestampColumnVector) {
+      ((TimestampColumnVector) colVector).setNullValue(index);
+    } else if (colVector instanceof IntervalDayTimeColumnVector) {
+      ((IntervalDayTimeColumnVector) colVector).setNullValue(index);


Can you please confirm that these are the only ColumnVectors that can appear in smallTableValueColumnMap and outerSmallTableKeyColumnMap?
Maybe we should do the same for all ColumnVector types, and it would be better to do this in the individual classes instead of handling it here.

Thanks. I added qtest and unit test.
The investigation confirms that DecimalColumnVector can also appear (for DECIMAL columns without DECIMAL_64 physical variation), so we've added handling for it. Regarding moving the logic into individual classes: ColumnVector has 15 concrete subclasses including container types (StructColumnVector, ListColumnVector, MapColumnVector, UnionColumnVector) and VoidColumnVector, for which per-slot clearing has no well-defined semantics. Only two classes currently define setNullValue(). Adding an abstract method to the base class would be a breaking change to storage-api with disproportionate scope. We prefer to keep the dispatch self-contained in clearVectorValue where it directly addresses the bug.

…ch column values

…torValue

HIVE-29598: Fix vectorized outer join wrong results due to stale scra…

b067c9e

…tch column values

asf-ci-hive added the tests pending label May 15, 2026

asf-ci-hive added tests passed and removed tests pending labels May 15, 2026

soumyakanti3578 requested changes May 15, 2026

View reviewed changes

HIVE-29598: Add regression test for vectorized outer join stale scrat…

6b31424

…ch column values

asf-ci-hive added tests pending and removed tests passed labels May 18, 2026

HIVE-29598: Add unit tests and handle DecimalColumnVector in clearVec…

18801de

…torValue

asf-ci-hive added tests failed and removed tests pending labels May 18, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

HIVE-29598: Fix vectorized outer join wrong results due to stale scratch column values#6486

HIVE-29598: Fix vectorized outer join wrong results due to stale scratch column values#6486
ryukobayashi wants to merge 3 commits into
apache:masterfrom
ryukobayashi:HIVE-29598

ryukobayashi commented May 15, 2026 •

edited

Loading

Uh oh!

sonarqubecloud Bot commented May 15, 2026

Uh oh!

soumyakanti3578 left a comment

Uh oh!

soumyakanti3578 May 15, 2026

Uh oh!

ryukobayashi May 18, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

ryukobayashi commented May 15, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What changes were proposed in this pull request?

Why are the changes needed?

Does this PR introduce any user-facing change?

How was this patch tested?

Uh oh!

sonarqubecloud Bot commented May 15, 2026

Quality Gate passed

Uh oh!

soumyakanti3578 left a comment

Choose a reason for hiding this comment

Uh oh!

soumyakanti3578 May 15, 2026

Choose a reason for hiding this comment

Uh oh!

ryukobayashi May 18, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

ryukobayashi commented May 15, 2026 •

edited

Loading