Skip to content

[python][ray] Pin merge source table snapshot#8110

Open
QuakeWang wants to merge 1 commit into
apache:masterfrom
QuakeWang:ray-merge-snapshot
Open

[python][ray] Pin merge source table snapshot#8110
QuakeWang wants to merge 1 commit into
apache:masterfrom
QuakeWang:ray-merge-snapshot

Conversation

@QuakeWang
Copy link
Copy Markdown
Contributor

Purpose

Ray merge-into already pins target reads to the base snapshot, but Paimon source tables were still normalized through read_paimon without an explicit snapshot. Because Ray Dataset execution is lazy, source planning could otherwise observe a later table snapshot than the one seen during merge preparation.

This PR captures the latest snapshot id for string source tables during _prepare and passes it to read_paimon, so the source side uses a stable snapshot throughout merge planning and execution.

Tests

  • python -m pytest paimon-python/pypaimon/tests/ray_data_evolution_merge_into_test.py -q

Signed-off-by: QuakeWang <wangfuzheng0814@foxmail.com>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant