A working todo app built by SAM in 14 minutes from a single markdown PRD. Every artifact — PRD, contracts, stories, source, tests, and the post-integration evidence Lens captured — is committed here so you can see exactly what SAM produces, without running anything.
This repo is the end-to-end output of one SAM plan-n-build run. The human wrote prd.md. SAM produced everything else.
prd.md # The input — one markdown file
sdocs/
├── architecture-ref.md # SAM's architectural decisions
├── contracts/ # Typed seams between stories
│ ├── api/ # Endpoint request/response shapes
│ ├── type/ # Shared TypeScript types
│ └── db/ # Repository interface
├── epics/EPIC-001-todo-management.md # One epic
├── stories/ # 6 stories (5 features + 1 integration)
└── evidence/EPIC-001/ # Lens captured this AFTER tests passed
├── screenshot-1-empty.png ...
├── screenshot-6-too-long-error.png
├── network.json # Every HTTP call + status
├── console.log # Browser console (clean run)
├── playwright/.../video.webm # Real Chrome session
└── report/index.html # Playwright trace viewer
client/ # React + TypeScript (Vite)
server/ # Fastify + TypeScript
tests/e2e/ # Playwright integration test
_sam/ # SAM agent and workflow definitions
| Phase | Time | Output |
|---|---|---|
| PRD → contracts + stories | ~4 min | 6 contracts, 1 epic, 6 stories |
| Stories 1–5 (RED → GREEN → REFACTOR) | ~9 min | 33 unit tests passing |
| Integration story + Lens evidence | ~1 min | video, 6 screenshots, clean network log |
| Total | 14 min | A working todo app, type-safe end-to-end |
Zero integration rework. No CORS surprise. No type drift between server and client. Contracts are designed before stories, so cross-file seams are typed and enforced by tsc.
sdocs/evidence/EPIC-001/— six screenshots from a real browsersdocs/evidence/EPIC-001/network.json— every HTTP call captured: GET 200 → POST 201 → PATCH 200 → DELETE 204sdocs/evidence/EPIC-001/console.log— clean browser console (no errors)sdocs/evidence/EPIC-001/report/— openindex.htmlin a browser for the full Playwright trace viewerprd.md— the only thing a human wrote
git clone https://github.com/sam-agents/example-todo.git
cd example-todo
npm install
npm run devVisit http://localhost:5174 — the same code path Lens recorded.
To run the same Playwright scenario Lens ran:
npx playwright install chromium
npm run test:e2eThe pipeline ran six agents in sequence:
- Atlas designed typed contracts before any story was written
- Titan wrote failing tests covering every acceptance criterion (RED)
- Dyna wrote the minimum code to make them pass (GREEN)
- Argus reviewed and refactored against quality standards (REFACTOR)
- The integration story re-tested the whole epic against real HTTP — no mocks
- Lens drove a real headless Chrome, captured video + screenshots + network + console as evidence
If a story drifts from a contract, TypeScript fails. If the app doesn't actually integrate, the integration story fails. If the UI is broken, Lens evidence shows it. The agents do not say "I tested it" — the files on disk say it.
npx sam-agentsSee the SAM repo for the full agent roster, workflows, and per-platform install instructions (Claude Code, Cursor, Gemini CLI, GitHub Copilot, Antigravity).
PRD in, working app out. That's SAM.
