Skip to content

Commit 1f07c6d

Browse files
authored
Replace R2 public bucket URLs with data.isamples.org custom domain (#79)
Codex verified all parquet endpoints return 206 with range requests and CORS headers.
1 parent 9b929b3 commit 1f07c6d

File tree

8 files changed

+18
-18
lines changed

8 files changed

+18
-18
lines changed

README.md

Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -58,10 +58,10 @@ All tutorials query parquet files hosted on Cloudflare R2:
5858

5959
```javascript
6060
// Wide format (recommended) - 280 MB, 20M rows
61-
const WIDE_URL = "https://pub-a18234d962364c22a50c787b7ca09fa5.r2.dev/isamples_202601_wide.parquet";
61+
const WIDE_URL = "https://data.isamples.org/isamples_202601_wide.parquet";
6262

6363
// Narrow format (advanced) - 850 MB, 106M rows
64-
const NARROW_URL = "https://pub-a18234d962364c22a50c787b7ca09fa5.r2.dev/isamples_202512_narrow.parquet";
64+
const NARROW_URL = "https://data.isamples.org/isamples_202512_narrow.parquet";
6565
```
6666

6767
## Related Repositories

scripts/profile_queries.py

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -17,7 +17,7 @@
1717
import duckdb
1818

1919
# Data sources
20-
REMOTE_URL = "https://pub-a18234d962364c22a50c787b7ca09fa5.r2.dev/isamples_202601_wide.parquet"
20+
REMOTE_URL = "https://data.isamples.org/isamples_202601_wide.parquet"
2121
LOCAL_PATH = "/tmp/isamples_202601_wide.parquet"
2222

2323
# Sample geocode PIDs for point selection tests (will be populated from data)

tools/globe_capture.html

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -57,7 +57,7 @@
5757
viewer.camera.setView({ destination: globalRect });
5858

5959
// Load H3 cluster data from R2
60-
const R2 = "https://pub-a18234d962364c22a50c787b7ca09fa5.r2.dev";
60+
const R2 = "https://data.isamples.org";
6161

6262
async function loadData() {
6363
const JSDELIVR_BUNDLES = duckdb.getJsDelivrBundles();

tutorials/isamples_explorer.qmd

Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -26,10 +26,10 @@ duckdbModule = import("https://cdn.jsdelivr.net/npm/@duckdb/duckdb-wasm@1.28.0/+
2626
```{ojs}
2727
//| code-fold: true
2828
// Data source configuration
29-
parquet_url = "https://pub-a18234d962364c22a50c787b7ca09fa5.r2.dev/isamples_202601_wide.parquet"
29+
parquet_url = "https://data.isamples.org/isamples_202601_wide.parquet"
3030
3131
// Pre-computed facet summaries (2KB - loads instantly)
32-
facet_summaries_url = "https://pub-a18234d962364c22a50c787b7ca09fa5.r2.dev/isamples_202601_facet_summaries.parquet"
32+
facet_summaries_url = "https://data.isamples.org/isamples_202601_facet_summaries.parquet"
3333
3434
// Source color scheme (consistent with iSamples conventions)
3535
SOURCE_COLORS = ({

tutorials/narrow_vs_wide_performance.qmd

Lines changed: 4 additions & 4 deletions
Original file line numberDiff line numberDiff line change
@@ -64,10 +64,10 @@ import { DuckDBClient } from "https://cdn.jsdelivr.net/npm/@observablehq/duckdb@
6464
//| echo: false
6565
// Define parquet URLs - iSamples full dataset on Cloudflare R2 (all sources)
6666
// Updated 2026-01-14: Using Zenodo narrow/wide files on Cloudflare R2
67-
narrowUrl = "https://pub-a18234d962364c22a50c787b7ca09fa5.r2.dev/isamples_202512_narrow.parquet"
68-
wideUrl = "https://pub-a18234d962364c22a50c787b7ca09fa5.r2.dev/isamples_202601_wide.parquet"
69-
wideH3Url = "https://pub-a18234d962364c22a50c787b7ca09fa5.r2.dev/isamples_202601_wide_h3.parquet"
70-
summariesUrl = "https://pub-a18234d962364c22a50c787b7ca09fa5.r2.dev/isamples_202601_facet_summaries.parquet"
67+
narrowUrl = "https://data.isamples.org/isamples_202512_narrow.parquet"
68+
wideUrl = "https://data.isamples.org/isamples_202601_wide.parquet"
69+
wideH3Url = "https://data.isamples.org/isamples_202601_wide_h3.parquet"
70+
summariesUrl = "https://data.isamples.org/isamples_202601_facet_summaries.parquet"
7171
```
7272

7373
### Environment Info

tutorials/parquet_cesium_isamples_wide.qmd

Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -45,7 +45,7 @@ Cesium.Ion.defaultAccessToken = 'eyJhbGciOiJIUzI1NiIsInR5cCI6IkpXVCJ9.eyJqdGkiOi
4545
//| echo: false
4646
viewof parquet_path = Inputs.text({
4747
label:"Source (iSamples Wide Format + H3)",
48-
value:"https://pub-a18234d962364c22a50c787b7ca09fa5.r2.dev/isamples_202601_wide_h3.parquet",
48+
value:"https://data.isamples.org/isamples_202601_wide_h3.parquet",
4949
placeholder: "URL or file:///path/to/file.parquet",
5050
width:"100%",
5151
submit:true
@@ -123,7 +123,7 @@ Download the file locally, then serve it:
123123

124124
```bash
125125
# Download the wide parquet file (~242MB)
126-
curl -O https://pub-a18234d962364c22a50c787b7ca09fa5.r2.dev/isamples_202601_wide.parquet
126+
curl -O https://data.isamples.org/isamples_202601_wide.parquet
127127

128128
# Serve it locally
129129
python3 -m http.server 8000

tutorials/progressive_globe.qmd

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -197,7 +197,7 @@ Cesium.Ion.defaultAccessToken = 'eyJhbGciOiJIUzI1NiIsInR5cCI6IkpXVCJ9.eyJqdGkiOi
197197
//| output: false
198198
199199
// === Constants ===
200-
R2_BASE = "https://pub-a18234d962364c22a50c787b7ca09fa5.r2.dev"
200+
R2_BASE = "https://data.isamples.org"
201201
h3_res4_url = `${R2_BASE}/isamples_202601_h3_summary_res4.parquet`
202202
h3_res6_url = `${R2_BASE}/isamples_202601_h3_summary_res6.parquet`
203203
h3_res8_url = `${R2_BASE}/isamples_202601_h3_summary_res8.parquet`

tutorials/zenodo_isamples_analysis.qmd

Lines changed: 5 additions & 5 deletions
Original file line numberDiff line numberDiff line change
@@ -28,14 +28,14 @@ This tutorial demonstrates how to efficiently analyze large geospatial datasets
2828
## Dataset Information
2929

3030
**Primary dataset** (Jan 2026, H3-indexed):
31-
- **URL**: `https://pub-a18234d962364c22a50c787b7ca09fa5.r2.dev/isamples_202601_wide_h3.parquet`
31+
- **URL**: `https://data.isamples.org/isamples_202601_wide_h3.parquet`
3232
- **Size**: ~292 MB wide format with H3 indices, 6.7M MaterialSampleRecords (20M total rows)
3333
- **H3 columns**: Pre-computed `h3_res4`, `h3_res6`, `h3_res8` (BIGINT) for spatial grouping
3434
- **Sources**: SESAR (4.6M), OpenContext (1M), GEOME (605K), Smithsonian (322K)
3535
- **Hosting**: Cloudflare R2 with HTTP range request support
3636

3737
**Facet summaries** (2KB, instant):
38-
- **URL**: `https://pub-a18234d962364c22a50c787b7ca09fa5.r2.dev/isamples_202601_facet_summaries.parquet`
38+
- **URL**: `https://data.isamples.org/isamples_202601_facet_summaries.parquet`
3939
- **Schema**: `facet_type`, `facet_value`, `scheme`, `count`
4040

4141
**Note**: *Data was originally archived on Zenodo and is now served from Cloudflare R2 for better performance and reliability.*
@@ -88,18 +88,18 @@ topojson = require("topojson-client@3")
8888
// Dataset URLs - try multiple options for CORS compatibility
8989
// Primary: Cloudflare R2 (Jan 2026 wide format with H3 indices)
9090
parquet_urls = [
91-
'https://pub-a18234d962364c22a50c787b7ca09fa5.r2.dev/isamples_202601_wide_h3.parquet',
91+
'https://data.isamples.org/isamples_202601_wide_h3.parquet',
9292
9393
// Fallback: original wide format without H3
94-
'https://pub-a18234d962364c22a50c787b7ca09fa5.r2.dev/isamples_202601_wide.parquet',
94+
'https://data.isamples.org/isamples_202601_wide.parquet',
9595
9696
// Fallback: older versions
9797
'https://labs.dataunbound.com/docs/2025/07/isamples_export_2025_04_21_16_23_46_geo.parquet',
9898
'https://zenodo.org/api/records/15278211/files/isamples_export_2025_04_21_16_23_46_geo.parquet/content'
9999
]
100100
101101
// Pre-computed facet summaries (2KB - loads instantly)
102-
facet_summaries_url = 'https://pub-a18234d962364c22a50c787b7ca09fa5.r2.dev/isamples_202601_facet_summaries.parquet'
102+
facet_summaries_url = 'https://data.isamples.org/isamples_202601_facet_summaries.parquet'
103103
104104
// Test CORS and find working URL - with rate limiting protection
105105
working_parquet_url = {

0 commit comments

Comments
 (0)