Skip to content

Use data.isamples.org for all parquet file URLs#79

Merged
rdhyee merged 1 commit intoisamplesorg:mainfrom
rdhyee:data-isamples-org-urls
Apr 3, 2026
Merged

Use data.isamples.org for all parquet file URLs#79
rdhyee merged 1 commit intoisamplesorg:mainfrom
rdhyee:data-isamples-org-urls

Conversation

@rdhyee
Copy link
Copy Markdown
Contributor

@rdhyee rdhyee commented Apr 3, 2026

Summary

  • Replaces all pub-a18234d962364c22a50c787b7ca09fa5.r2.dev references with data.isamples.org
  • 18 URL replacements across 8 files (tutorials, README, tools, scripts)
  • Cloudflare zone activated 2026-04-03, Worker route verified with range requests + CORS

Test plan

  • curl -I https://data.isamples.org/isamples_202601_wide.parquet returns 206
  • quarto preview and verify Interactive Explorer loads data
  • Check progressive globe tutorial still renders

Closes #75

🤖 Generated with Claude Code

Now that Cloudflare DNS delegation is active and the Worker route is
configured, use the human-readable data.isamples.org domain instead
of the opaque pub-a18234d962364c22a50c787b7ca09fa5.r2.dev URL.

Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>
@rdhyee rdhyee merged commit 1f07c6d into isamplesorg:main Apr 3, 2026
1 check passed
@rdhyee
Copy link
Copy Markdown
Contributor Author

rdhyee commented Apr 3, 2026

data.isamples.org File Index

File Size Description
isamples_202601_wide.parquet 278 MB Wide format — one row per entity (20M rows), all sources. Primary file for tutorials and Explorer.
isamples_202601_wide_h3.parquet 292 MB Wide format + H3 hex indices. Used by Cesium globe visualizations.
isamples_202512_narrow.parquet 820 MB Narrow/graph format — explicit nodes + edges (106M rows). For PQG traversals.
isamples_202601_samples_map_lite.parquet 60 MB Lightweight: pid, label, source, lat/lng, h3. For fast map rendering.
isamples_202601_sample_facets.parquet 25 MB Pre-computed facet values per sample. Powers Explorer filter counts.
isamples_202601_facet_summaries.parquet 2 KB Aggregate facet summary stats. Tiny — loads instantly for UI initialization.
isamples_202601_h3_summary_res4.parquet 0.6 MB H3 res4 clusters (~38K). Zoomed-out globe view + homepage animation.
isamples_202601_h3_summary_res6.parquet 1.6 MB H3 res6 clusters. Mid-zoom level.
isamples_202601_h3_summary_res8.parquet 2.4 MB H3 res8 clusters. Close-zoom detail.

Total: ~1.48 GB across 9 parquet files. All serve with HTTP 206 range requests and CORS headers — DuckDB-WASM compatible.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

Set up data.isamples.org custom domain for R2 parquet files

1 participant