Skip to content

Resync docs/v1 snapshot to current truth (5 stale entries)#17

Closed
mprammer wants to merge 1 commit into
mp/add-amazon-reviews-2023from
mp/refresh-docs-v1
Closed

Resync docs/v1 snapshot to current truth (5 stale entries)#17
mprammer wants to merge 1 commit into
mp/add-amazon-reviews-2023from
mp/refresh-docs-v1

Conversation

@mprammer

Copy link
Copy Markdown
Contributor

Regenerating docs/v1 for the Amazon dataset (#16) surfaced pre-existing drift between the tracked snapshot and current truth, which #16 deliberately held back to stay scoped. This resyncs it. BI-CommonGovernment picks up the richer description already in the authoritative sources.json. Open Food Facts (4,466,927 β†’ 4,517,492 rows) and OSM Germany Relations (889,712 β†’ 890,059) reflect fresher local rebuilds β€” new row counts, file sizes, and recorded row-group counts β€” and Spambase and uci-iris gain row-group / vortex metadata the committed snapshot was missing. All five verified against the on-disk parquet/vortex.

The Open Food Facts and uci-iris entries also update their pinned parquet/vortex sha256 to this machine's build, which is what publish and the strict-checksum loader path check against.

Stacked on #16 β€” retarget to develop once that lands.

πŸ€– Generated with Claude Code

Promote the drift that the Amazon dataset PR (#16) held back to stay scoped:
BI-CommonGovernment's authoritative sources.json description, fresher
Open Food Facts (4,466,927 -> 4,517,492 rows) and OSM Germany Relations
(889,712 -> 890,059) local builds with their sizes and row-group counts, and
recorded row-group / vortex metadata for Spambase and uci-iris. Verified
against the on-disk parquet/vortex.

Co-Authored-By: Claude <noreply@anthropic.com>
Signed-off-by: mprammer <martin@spiraldb.com>
@mprammer mprammer deleted the branch mp/add-amazon-reviews-2023 June 11, 2026 16:09
@mprammer mprammer closed this Jun 11, 2026
@mprammer mprammer deleted the mp/refresh-docs-v1 branch June 11, 2026 16:13
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant