ContextLab
diff --git a/‎CLAUDE.md‎
Lines changed: 3 additions & 1 deletion b/‎CLAUDE.md‎
Lines changed: 3 additions & 1 deletion
diff --git a/‎index.html‎
Lines changed: 19 additions & 5 deletions b/‎index.html‎
Lines changed: 19 additions & 5 deletions
diff --git a/‎scripts/decode-tokens.js‎
Lines changed: 207 additions & 0 deletions b/‎scripts/decode-tokens.js‎
Lines changed: 207 additions & 0 deletions
diff --git a/‎specs/010-analytics-data-collection/checklists/requirements.md‎
Lines changed: 34 additions & 0 deletions b/‎specs/010-analytics-data-collection/checklists/requirements.md‎
Lines changed: 34 additions & 0 deletions
diff --git a/‎specs/010-analytics-data-collection/contracts/gas-endpoint.md‎
Lines changed: 43 additions & 0 deletions b/‎specs/010-analytics-data-collection/contracts/gas-endpoint.md‎
Lines changed: 43 additions & 0 deletions
@@ -11,6 +11,8 @@ Auto-generated from all feature plans. Last updated: 2026-02-27
 - localStorage (user progress), file-based JSON (question banks) (007-fix-mobile-mode)
 - JavaScript ES2022+ (ES modules), HTML5, CSS3 + nanostores 1.1, Vite 7.3, deck.gl 9.2, KaTeX (CDN), pako (new — for deflate compression) (008-shareable-map-links)
 - localStorage (user progress), URL query parameter (shared state) (008-shareable-map-links)
+- JavaScript ES2022+ (ES modules), HTML5, CSS3 + nanostores 1.1, Vite 7.3, pako (existing — for token deflate), GoatCounter (external CDN script) (010-analytics-data-collection)
+- localStorage (opt-out preference), Google Sheets (collection records via GAS) (010-analytics-data-collection)
 
 - JavaScript ES2022+ (ES modules), HTML5, CSS3 + nanostores 1.1.0, Vite 7.3.1, Canvas 2D API, KaTeX (CDN) (003-ux-bugfix-cleanup)
 
@@ -31,9 +33,9 @@ npm test && npm run lint
 JavaScript ES2022+ (ES modules), HTML5, CSS3: Follow standard conventions
 
 ## Recent Changes
+- 010-analytics-data-collection: Added JavaScript ES2022+ (ES modules), HTML5, CSS3 + nanostores 1.1, Vite 7.3, pako (existing — for token deflate), GoatCounter (external CDN script)
 - 008-shareable-map-links: Added JavaScript ES2022+ (ES modules), HTML5, CSS3 + nanostores 1.1, Vite 7.3, deck.gl 9.2, KaTeX (CDN), pako (new — for deflate compression)
 - 007-fix-mobile-mode: Added JavaScript ES2022+ (ES modules), HTML5, CSS3 + nanostores 1.1, Vite 7.3, deck.gl 9.2, KaTeX (CDN)
-- 006-performance-and-ux-refinement: Added JavaScript ES2022+ (ES modules), HTML5, CSS3 + nanostores 1.1, Vite 7.3, Canvas 2D API, KaTeX (CDN), deck.gl 9.2
 
 
 <!-- MANUAL ADDITIONS START -->
 
@@ -994,7 +994,7 @@ <h2>Map out (an approximation of) everything you know!</h2>
                     </p>
                     <p>
                         <i class="fa-solid fa-graduation-cap" style="margin-right: 8px;"></i>
-                        Click <a href="#" id="landing-info-link" style="white-space:nowrap;"><i class="fa-solid fa-circle-info" style="margin: 0 2px;"></i> info</a> in the upper right to learn more, or read our <a href="https://osf.io/preprints/psyarxiv/dh3q2" target="_blank" rel="noopener">research paper</a>.
+                        Click <a href="#" id="landing-info-link" style="white-space:nowrap;"><i class="fa-solid fa-circle-info" style="margin: 0 2px;"></i> info</a> in the upper right to learn more, or read our <a href="https://www.doi.org/10.1038/s41467-026-69746-w" target="_blank" rel="noopener">research paper</a>.
                     </p>
 
                     <button id="landing-start-btn" class="landing-start-btn">Map my knowledge!</button>
@@ -1079,9 +1079,7 @@ <h3 style="margin-bottom: 0.5rem; font-size: 0.9rem; color: var(--color-primary)
             </p>
             <p style="line-height: 1.7; margin-bottom: 1rem; font-size: 0.9rem;">
                 <strong>Research paper:</strong>
-                <a href="https://osf.io/preprints/psyarxiv/dh3q2" target="_blank" rel="noopener">
-                    Text embedding models yield detailed conceptual knowledge maps derived from short multiple-choice quizzes
-                </a>
+                Fitzpatrick PC, Heusser AC, Manning JR (2026). Text embedding models yield detailed conceptual knowledge maps derived from short multiple-choice quizzes. <em>Nature Communications</em>, 17(2055): <a href="https://www.doi.org/10.1038/s41467-026-69746-w" target="_blank" rel="noopener">10.1038/s41467-026-69746-w</a>.
             </p>
             <p style="line-height: 1.7; margin-bottom: 1rem; font-size: 0.9rem;">
                 <strong>Source code:</strong>
@@ -1116,9 +1114,21 @@ <h3 style="margin-bottom: 0.5rem; font-size: 0.9rem; color: var(--color-primary)
                 Click on the correct response to answer each question. Use the number keys 1&ndash;4 for quick answers.
             </p>
             <p style="font-size: 0.8rem; color: var(--color-text-muted);">
-                All computation runs in your browser. No data is sent to any server.
+                All computation runs in your browser.
                 Your progress is saved locally and can be exported or reset at any time.
             </p>
+            <h3 style="margin-top: 1rem; margin-bottom: 0.5rem; font-size: 0.9rem; color: var(--color-primary);">Data Collection</h3>
+            <p style="line-height: 1.7; margin-bottom: 0.5rem; font-size: 0.85rem;">
+                We collect anonymized quiz responses (answers only) to help improve our system.
+                No personal information is stored. You can opt out at any time using the toggle below.
+            </p>
+            <div id="collect-toggle-wrap" style="display: flex; align-items: center; gap: 0.5rem; margin-bottom: 0.25rem;">
+                <div id="collect-toggle-track" role="switch" aria-checked="false" aria-label="Share anonymized responses" tabindex="0"
+                     style="position:relative;width:36px;height:20px;background:var(--color-text-muted,#94a3b8);border-radius:10px;cursor:pointer;transition:background 0.2s;flex-shrink:0;">
+                    <div id="collect-toggle-thumb" style="position:absolute;top:2px;left:2px;width:16px;height:16px;background:#fff;border-radius:50%;transition:left 0.2s;box-shadow:0 1px 3px rgba(0,0,0,0.2);"></div>
+                </div>
+                <span style="font-size:0.8rem;color:var(--color-text-muted);">Share anonymized responses</span>
+            </div>
             <p style="font-size: 0.75rem; color: var(--color-text-muted); opacity: 0.8; margin-top: 0.75rem; padding-top: 0.75rem; border-top: 1px solid var(--color-border);">
                 <strong>Note:</strong> The knowledge estimates assume that your responses reflect genuine effort.
                 Randomly clicking through questions without thinking will generate estimates
@@ -1220,5 +1230,9 @@ <h1>JavaScript Required</h1>
 
     <!-- Vite Entry Point -->
     <script type="module" src="/src/app.js"></script>
+
+    <!-- GoatCounter analytics (cookie-free, privacy-respecting) -->
+    <script data-goatcounter="https://context-lab.goatcounter.com/count"
+            async src="//gc.zgo.at/count.js"></script>
 </body>
 </html>
@@ -0,0 +1,207 @@
+#!/usr/bin/env node
+
+/**
+ * Offline token decoder for Knowledge Mapper response collection.
+ *
+ * Reads tokens from a CSV or JSON file (exported from the Google Sheet)
+ * and decodes each into structured response data.
+ *
+ * Usage:
+ *   node scripts/decode-tokens.js --input tokens.csv --format csv > decoded.csv
+ *   node scripts/decode-tokens.js --input tokens.json --format json > decoded.json
+ *
+ * Input CSV format (from Google Sheet):
+ *   Timestamp, Session ID, Token, Response Count, Domain
+ *
+ * Output includes: session_id, timestamp, question_id, is_correct, is_skipped
+ */
+
+import { readFileSync } from 'fs';
+import { resolve, dirname } from 'path';
+import { fileURLToPath } from 'url';
+import { inflate } from 'pako';
+
+const __dirname = dirname(fileURLToPath(import.meta.url));
+
+// ── Inline token decoder (avoids importing browser-only modules) ───────
+
+function base64urlToBytes(str) {
+  const b64 = str.replace(/-/g, '+').replace(/_/g, '/');
+  const pad = (4 - (b64.length % 4)) % 4;
+  const padded = b64 + '='.repeat(pad);
+  const binary = atob(padded);
+  const bytes = new Uint8Array(binary.length);
+  for (let i = 0; i < binary.length; i++) bytes[i] = binary.charCodeAt(i);
+  return bytes;
+}
+
+function decodeTokenRaw(base64urlString) {
+  try {
+    const compressed = base64urlToBytes(base64urlString);
+    const bytes = inflate(compressed, { raw: true });
+    if (bytes.length < 3) return null;
+
+    const version = bytes[0];
+    const count = (bytes[1] << 8) | bytes[2];
+    const entries = [];
+
+    for (let i = 0; i < count; i++) {
+      const offset = 3 + i * 3;
+      if (offset + 2 >= bytes.length) break;
+      const index = (bytes[offset] << 8) | bytes[offset + 1];
+      const value = bytes[offset + 2];
+      entries.push({
+        index,
+        is_correct: value === 2,
+        is_skipped: value === 1,
+      });
+    }
+
+    return { version, entries };
+  } catch (err) {
+    console.error('[decoder] Failed to decode token:', err.message);
+    return null;
+  }
+}
+
+// ── Question index builder ─────────────────────────────────────────────
+
+async function loadQuestionIndex() {
+  // Load all domain bundles and merge questions (matching browser boot flow)
+  const dataDir = resolve(__dirname, '..', 'data', 'domains');
+  const { readdirSync } = await import('fs');
+  const files = readdirSync(dataDir).filter(f => f.endsWith('.json') && f !== 'all.json');
+
+  const allQuestions = new Map();
+
+  // Load all.json first (boot bundle with 50 questions)
+  const allBundle = JSON.parse(readFileSync(resolve(dataDir, 'all.json'), 'utf-8'));
+  for (const q of allBundle.questions) allQuestions.set(q.id, q);
+
+  // Load all domain bundles to get the full 2500 questions
+  for (const file of files) {
+    try {
+      const bundle = JSON.parse(readFileSync(resolve(dataDir, file), 'utf-8'));
+      if (bundle.questions) {
+        for (const q of bundle.questions) allQuestions.set(q.id, q);
+      }
+    } catch { /* skip malformed files */ }
+  }
+
+  // Sort deterministically — must match buildIndex() in question-index.js exactly
+  // (uses < / > comparison, NOT localeCompare)
+  const sorted = [...allQuestions.values()].sort((a, b) => {
+    const da = (a.domain_ids?.[0] || '');
+    const db = (b.domain_ids?.[0] || '');
+    if (da < db) return -1;
+    if (da > db) return 1;
+    if (a.id < b.id) return -1;
+    if (a.id > b.id) return 1;
+    return 0;
+  });
+
+  const indexToQuestion = new Map();
+  sorted.forEach((q, i) => indexToQuestion.set(i, q));
+
+  return indexToQuestion;
+}
+
+// ── Input parsing ──────────────────────────────────────────────────────
+
+function parseCSVInput(content) {
+  const lines = content.trim().split('\n');
+  // Skip header row
+  const records = [];
+  for (let i = 1; i < lines.length; i++) {
+    const parts = lines[i].split(',').map(s => s.trim());
+    if (parts.length < 3) continue;
+    records.push({
+      timestamp: parts[0],
+      session_id: parts[1],
+      token: parts[2],
+      response_count: parseInt(parts[3], 10) || 0,
+    });
+  }
+  return records;
+}
+
+function parseJSONInput(content) {
+  const data = JSON.parse(content);
+  return Array.isArray(data) ? data : [data];
+}
+
+// ── Main ───────────────────────────────────────────────────────────────
+
+async function main() {
+  const args = process.argv.slice(2);
+  let inputFile = null;
+  let outputFormat = 'csv';
+
+  for (let i = 0; i < args.length; i++) {
+    if (args[i] === '--input' && args[i + 1]) inputFile = args[++i];
+    else if (args[i] === '--format' && args[i + 1]) outputFormat = args[++i];
+    else if (args[i] === '--help') {
+      console.log('Usage: node scripts/decode-tokens.js --input <file> --format <csv|json>');
+      process.exit(0);
+    }
+  }
+
+  if (!inputFile) {
+    console.error('Error: --input <file> is required');
+    process.exit(1);
+  }
+
+  const content = readFileSync(resolve(inputFile), 'utf-8');
+  const isJSON = inputFile.endsWith('.json');
+  const records = isJSON ? parseJSONInput(content) : parseCSVInput(content);
+
+  console.error(`[decoder] Loading question index...`);
+  const indexToQuestion = await loadQuestionIndex();
+  console.error(`[decoder] Loaded ${indexToQuestion.size} questions`);
+  console.error(`[decoder] Decoding ${records.length} tokens...`);
+
+  const decoded = [];
+
+  for (const record of records) {
+    const result = decodeTokenRaw(record.token);
+    if (!result) {
+      console.error(`[decoder] Failed to decode token from session ${record.session_id}`);
+      continue;
+    }
+
+    for (const entry of result.entries) {
+      const q = indexToQuestion.get(entry.index);
+      decoded.push({
+        session_id: record.session_id,
+        timestamp: record.timestamp,
+        question_index: entry.index,
+        question_id: q?.id || `unknown_${entry.index}`,
+        domain: q?.domain_ids?.[0] || 'unknown',
+        question_text: q?.question_text || '',
+        correct_answer: q ? q.options?.[q.correct_answer] || '' : '',
+        is_correct: entry.is_correct,
+        is_skipped: entry.is_skipped,
+      });
+    }
+  }
+
+  // Output
+  if (outputFormat === 'json') {
+    console.log(JSON.stringify(decoded, null, 2));
+  } else {
+    // CSV
+    console.log('session_id,timestamp,domain,question_index,question_id,question_text,correct_answer,is_correct,is_skipped');
+    for (const row of decoded) {
+      const text = `"${(row.question_text || '').replace(/"/g, '""')}"`;
+      const answer = `"${(row.correct_answer || '').replace(/"/g, '""')}"`;
+      console.log(`${row.session_id},${row.timestamp},${row.domain},${row.question_index},${row.question_id},${text},${answer},${row.is_correct},${row.is_skipped}`);
+    }
+  }
+
+  console.error(`[decoder] Decoded ${decoded.length} responses from ${records.length} tokens`);
+}
+
+main().catch(err => {
+  console.error('Fatal:', err);
+  process.exit(1);
+});
@@ -0,0 +1,34 @@
+# Specification Quality Checklist: Analytics & Response Data Collection
+
+**Purpose**: Validate specification completeness and quality before proceeding to planning
+**Created**: 2026-03-20
+**Feature**: [spec.md](../spec.md)
+
+## Content Quality
+
+- [X] No implementation details (languages, frameworks, APIs)
+- [X] Focused on user value and business needs
+- [X] Written for non-technical stakeholders
+- [X] All mandatory sections completed
+
+## Requirement Completeness
+
+- [X] No [NEEDS CLARIFICATION] markers remain
+- [X] Requirements are testable and unambiguous
+- [X] Success criteria are measurable
+- [X] Success criteria are technology-agnostic (no implementation details)
+- [X] All acceptance scenarios are defined
+- [X] Edge cases are identified
+- [X] Scope is clearly bounded
+- [X] Dependencies and assumptions identified
+
+## Feature Readiness
+
+- [X] All functional requirements have clear acceptance criteria
+- [X] User scenarios cover primary flows
+- [X] Feature meets measurable outcomes defined in Success Criteria
+- [X] No implementation details leak into specification
+
+## Notes
+
+- All items pass. Spec is ready for planning.
@@ -0,0 +1,43 @@
+# Contract: Google Apps Script Collection Endpoint
+
+## Request
+
+**Method**: POST
+**URL**: `https://script.google.com/macros/s/{DEPLOYMENT_ID}/exec`
+**Mode**: `no-cors` (fire-and-forget — no response body expected)
+**Content-Type**: `application/json`
+
+### Body
+
+```json
+{
+  "session_id": "a1b2c3d4",
+  "token": "O8LAwTDnP-NPJiap_8xO_...",
+  "response_count": 10,
+  "domain": "all"
+}
+```
+
+| Field | Type | Required | Description |
+|-|-|-|-|
+| session_id | string | yes | 8-char hex, generated per page load |
+| token | string | yes | Base64url-encoded response token from `encodeToken()` |
+| response_count | integer | yes | Total responses in the token |
+| domain | string | yes | Active domain ID at time of send |
+
+## Response
+
+Not inspected (no-cors mode). The client treats all sends as fire-and-forget.
+
+## Google Apps Script Handler
+
+The `doPost(e)` function:
+1. Parses `e.postData.contents` as JSON
+2. Appends a row to the configured sheet: `[new Date(), session_id, token, response_count, domain]`
+3. Returns `ContentService.createTextOutput('ok')`
+
+## Error Handling
+
+- Network failure: silently caught, logged to console
+- Invalid JSON: GAS returns error, client ignores (no-cors)
+- Quota exceeded: GAS returns 429, client ignores (no-cors)