Minor compaction by GWphua · Pull Request #19016 · apache/druid

GWphua · 2026-02-12T10:26:41Z

Motivation

Submitting a compaction task with SpecificSegmentsSpec (segment IDs) would cause Druid to lock, read, and rewrite all segments in the umbrella interval, defeating the purpose of targeting specific segments.

This results in very long compaction tasks, as the entire interval's segments are being considered for compaction. After the changes are being introduced, we are able to select multiple small segments to compact instead of processing all segments in the interval. This reduces the time taken for compaction from ~3h to ~5min.

Description

This PR adds support for minor compaction: the ability to compact a specific subset of segments within a time chunk rather than all segments in the interval. Previously,

The core problem spans multiple layers of the compaction pipeline:

Locking: CompactionTask#findSegmentsToLock and all sub-task findSegmentsToLock() methods retrieve every segment in the umbrella interval via RetrieveUsedSegmentsAction, meaning the task acquires locks far broader than necessary.
Input resolution: NativeCompactionRunner#createIoConfig always passes null for segmentIds to DruidInputSource, so the input source reads the full interval regardless of the input spec.
Timeline lookup: retrieveRelevantTimelineHolders() uses SegmentTimeline.lookup() which requires ONLY_COMPLETE partitions... a filtered subset of segments appears incomplete and will be silently excluded.
Validation: CompactionTask.SegmentProvider#checkSegments with TIME_CHUNK lock granularity delegates to SpecificSegmentsSpec.validateSegments() which requires an exact match between the spec's segments and all segments in the interval. This guarantees a failure when we give any proper subset of segments in the interval.

Changes and Explanations

dropExisting conflict guard

A constructor-level validation in CompactionTask now rejects the combination of SpecificSegmentsSpec with dropExisting = true, since dropExisting semantics replace all segments in the interval — directly contradicting minor compaction intent.

Segment filtering in lock acquisition

CompactionTask.findSegmentsToLock() now filters the result of RetrieveUsedSegmentsAction to only the segment IDs present in SpecificSegmentsSpec. The same filtering is applied in IndexTask, ParallelIndexSupervisorTask, and SinglePhaseSubTask via CTX_KEY_SPECIFIC_SEGMENTS_TO_COMPACT propagated from NativeCompactionRunner#createContextForSubtask().

This follows the existing pattern of passing compaction metadata through CTX_KEY_APPENDERATOR_TRACKING_TASK_ID.

`CompactionTask.SegmentProvider` caches for TIME_CHUNK granularity in `checkSegments()`

The intuition behind this approach is:

SegmentProvider#findSegments is first being called, followed by SegmentProvider#checkSegments.
When findSegments is called, we do not know which lock granularity is being used.
Time granularity requires all segments in the interval, while segment granularity requires only the input segments
Save all segments in interval in findSegments as allSegmentsInInterval, then later use this field when we encounter a TIME_CHUNK lock granularity.

Honestly, I am not too satisfied with how I approached this problem, owing to the fact that developers now need to keep a temporal relationship between findSegments and checkSegments. Would love to hear about any alternatives to this problem!

Segment-ID-based input for DruidInputSource

NativeCompactionRunner#createIoConfig now detects SpecificSegmentsSpec and resolves the segment ID strings into WindowedSegmentId objects, passing them to DruidInputSource instead of the interval.

DruidInputSource already supports this code path, but it was never wired up from the compaction side.

Timeline lookup with incomplete partitions

retrieveRelevantTimelineHolders() now calls lookupWithIncompletePartitions() (i.e. Partitions.INCOMPLETE_OK) when the input spec is SpecificSegmentsSpec.

Without this, a filtered segment set that doesn't cover all partitions in the interval produces an empty timeline result and the compaction silently does nothing.

Compaction using MSQ engine

MSQ compaction is fundamentally incompatible with minor compaction introduced by this change: it forces dropExisting = true, uses REPLACE ingestion mode (which acquires TIME_CHUNK locks covering the full interval), and queries via MultipleIntervalSegmentSpec. A validation check is added in MSQCompactionRunner.validateCompactionTask() to reject SpecificSegmentsSpec with an explicit error message rather than failing in an opaque way downstream.

For compaction using MSQ, please see #18996.

Release note

Compaction tasks using SpecificSegmentsSpec (segment ID list) now correctly compact only the specified segments instead of all segments in the umbrella interval. This new feature is unsupported in MSQ.

Key changed/added classes in this PR

CompactionTask
NativeCompactionRunner
IndexTask
ParallelIndexSupervisorTask
ParallelIndexSupervisorTask
SinglePhaseSubTask
MSQCompactionRunner
CompactionTaskTest / TaskLockHelperTest

This PR has:

been self-reviewed.
added documentation for new or modified features or behaviors.
a release note entry in the PR description.
added Javadocs for most classes and all non-trivial methods. Linked related entities via Javadoc links.
added or updated version, license, or notice information in licenses.yaml
added comments explaining the "why" and the intent of the code wherever would not be obvious for an unfamiliar reader.
added unit tests or modified existing tests to cover new code paths, ensuring the threshold for code coverage is met.
added integration tests.
been tested in a test Druid cluster.

CompactionTaskTest TaskLockHelperTest

...xing-service/src/main/java/org/apache/druid/indexing/common/task/NativeCompactionRunner.java

+      inputInterval = interval;
+    }
+
+    if (inputInterval != null && !compactionIOConfig.isAllowNonAlignedInterval()) {


gianm · 2026-02-12T16:43:00Z

It looks like this and #18996 are aiming at similar goals but are taking different approaches. A big one is that #18996 only works with MSQ compaction and this one only works with non-MSQ compaction tasks. I am wondering if they can coexist.

re: this piece,

MSQ compaction is fundamentally incompatible with minor compaction: it forces dropExisting = true, uses REPLACE ingestion mode (which acquires TIME_CHUNK locks covering the full interval), and queries via MultipleIntervalSegmentSpec.

#18996 deals with the replace issue by using the "upgrade" system that was introduced for concurrent replace (system from #14407, #15039, #15684). The segments that are not being compacted are carried through without modification ("upgraded"). It deals with the MultipleIntervalSegmentSpec issue by using a new feature in TableInputSpec to be able to reference specific segments (#18922).

GWphua · 2026-02-13T06:41:05Z

Thanks for pointing this out @gianm, I see that #18996 happen to fix compaction on the MSQ side, and that's pretty neat! I do not have much experience with MSQ, given that we are still using Druid v27 (yea, its old... but we are upgrading soon).

In our production servers, we used this PR by making a script to select segments, and issue minor compaction specs. There are still further plans to incorporate segment selection with automatic compaction.

Would like to ask what is the direction for handling specific segments? I see that there are some discussions about SpecificSegmentsSpec feeling somewhat unused... If the new feature in TableInputSpec is applicable for my use case, I would be happy to collaborate and make changes on my side 😄

GWphua added 3 commits February 12, 2026 17:23

Test Driven Dev

ef27a95

CompactionTaskTest TaskLockHelperTest

Minor Compaction Impl

2f4cd1d

Deprecated fixes

558e049

github-actions bot added Area - Batch Ingestion Area - Ingestion Area - MSQ For multi stage queries - https://github.com/apache/druid/issues/12262 labels Feb 12, 2026

github-advanced-security bot found potential problems Feb 12, 2026

View reviewed changes

gianm mentioned this pull request Feb 12, 2026

add incremental compaction support #18996

Open

10 tasks

FrankChen021 approved these changes Feb 20, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Minor compaction#19016

Minor compaction#19016
GWphua wants to merge 3 commits intoapache:masterfrom
GWphua:minor-compaction

GWphua commented Feb 12, 2026 •

edited

Loading

Uh oh!

Check notice

gianm commented Feb 12, 2026

Uh oh!

GWphua commented Feb 13, 2026 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Comments

Conversation

GWphua commented Feb 12, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Motivation

Description

Changes and Explanations

dropExisting conflict guard

Segment filtering in lock acquisition

CompactionTask.SegmentProvider caches for TIME_CHUNK granularity in checkSegments()

Segment-ID-based input for DruidInputSource

Timeline lookup with incomplete partitions

Compaction using MSQ engine

Release note

Key changed/added classes in this PR

Uh oh!

Check notice

Uh oh!

gianm commented Feb 12, 2026

Uh oh!

GWphua commented Feb 13, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Comments

GWphua commented Feb 12, 2026 •

edited

Loading

`CompactionTask.SegmentProvider` caches for TIME_CHUNK granularity in `checkSegments()`

GWphua commented Feb 13, 2026 •

edited

Loading