A comprehensive autonomous development system powered by the Claude Agent SDK. Build, enhance, and manage multiple software projects simultaneously with a coordinated agent army.
- 13 Specialized Agents - Architect, Builder, Verifier, Reviewer, DevOps, and more
- Interactive TUI - Rich terminal interface with Linear integration
- Vector Embeddings - Semantic similarity search for agent memory
- Memory Dashboard - Visualize agent learning patterns and mistakes
- E2B Sandboxing - All code execution in isolated cloud VMs
- Multi-Project Support - Manage multiple projects simultaneously
# Windows
python tui.py
# Or double-click: tui.bat
# Linux/Mac
python tui.pyThe TUI provides:
- Project creation and management
- Real-time progress monitoring
- Agent memory dashboard
- Linear issue integration
python run_orchestrator.py
# Or double-click: run_orchestrator.batThis platform uses a Multi-Agent Orchestrator that coordinates 13 specialized agents:
| Agent | Role |
|---|---|
| Architect | Planning and design - analyzes requirements, designs architecture |
| Builder | Feature implementation - writes code, creates tests |
| Test Generator | Automated test creation with Context7 documentation lookup |
| Verifier | Quality assurance - ensures 100% completion with blocking subtasks |
| Reviewer | Code review - checks quality, security, performance |
| DevOps | CI/CD pipelines, containerization, cloud deployment |
| Documentation | API docs, user guides, technical specs |
| Reporter | Project status reports, sprint summaries |
| Analytics | Pattern analysis, bottleneck detection, optimization recommendations |
| Refactor | Code quality, technical debt reduction |
| Database | Schema design, migrations, query optimization |
| UI Design | UI/UX, WCAG accessibility validation, responsive design |
| E2B Sandbox | Secure sandboxed code execution |
All code execution runs in E2B cloud sandboxes - never on your local machine.
| Layer | Protection |
|---|---|
| E2B Sandbox | All bash commands execute in isolated cloud VMs |
| Hard-Fail | System refuses to run if E2B unavailable (no silent fallback) |
| Shell Injection | All paths escaped with shlex.quote() |
| Bash Blocking | Direct Bash tool blocked, redirects to E2B MCP tools |
| File Locking | Concurrent access protected with filelock |
# Install Claude Code CLI
npm install -g @anthropic-ai/claude-code
# Install Python dependencies
pip install -r requirements.txtCopy .env.example to .env and add your API keys:
# Required
CLAUDE_CODE_OAUTH_TOKEN=your-token # Run: claude setup-token
E2B_API_KEY=your-e2b-key # Get from: https://e2b.dev
# Recommended
LINEAR_API_KEY=your-linear-key # Get from: https://linear.app/settings/api
CONTEXT7_API_KEY=your-context7-key # Get from: https://context7.com
# Optional
ANTHROPIC_API_KEY=your-anthropic-key
MAX_CONCURRENT_AGENTS=10
AGENT_TIMEOUT=3600
DEFAULT_MODEL=claude-opus-4-5-20251101
# Project Configuration (for run_orchestrator.py)
PROJECT_NAME=My Project
PROJECT_PATH=./projects/default
SPEC_FILE=./prompts/app_spec.txt- Edit
prompts/app_spec.txtwith your project specification - Update
.envwith your project settings:PROJECT_NAME- Your project's namePROJECT_PATH- Where to create the projectSPEC_FILE- Path to your specification file
Or use the TUI (recommended) which lets you configure projects interactively.
claude --version
pip show claude-code-sdk
python verify_fixes.py # Verify all components- Subtask support with blocking mechanism
- Test coverage tracking per task
- Agent assignment tracking
- Automatic CHECKLIST.md generation
- Multi-project management
- Workload distribution and load balancing
- Project status tracking (active, paused, completed)
- Priority levels: CRITICAL > HIGH > MEDIUM > LOW
- Agent type matching
- Dependency management with fail-safe checks
- Automatic retry (up to 3 attempts)
- Pub/sub inter-agent communication
- Direct messaging to specific agents
- File-based persistence with locking and error handling
- Persistent markdown-based memory per agent
- Pattern learning from successes
- Mistake tracking to avoid repeated errors
- Vector embeddings for semantic similarity search
- Sentence-transformers integration (all-MiniLM-L6-v2)
- Lazy model loading for fast startup
- Cosine similarity search across patterns/mistakes
- NumPy-based storage for fast retrieval
- Rich console visualization of agent memory
- Pattern and mistake statistics
- Real-time memory inspection
- Cloud-based isolated execution
- Hard-fail if unavailable (no local fallback)
- Shell injection protection
Coding-Agent-Harness/
βββ tui.py # Interactive terminal interface
βββ tui.bat # Windows TUI launcher
βββ run_orchestrator.py # Orchestrator entry point
βββ run_orchestrator.bat # Windows orchestrator launcher
βββ orchestrator.py # Multi-agent coordinator
βββ client.py # Claude SDK + 10 MCP servers
βββ security.py # Bash blocking, E2B redirect
βββ verify_fixes.py # Component verification script
βββ pytest.ini # Test configuration
βββ core/
β βββ enhanced_checklist.py # Task tracking with subtasks
β βββ project_registry.py # Multi-project management
β βββ task_queue.py # Priority-based distribution
β βββ message_bus.py # Inter-agent communication
β βββ agent_memory.py # Learning and memory
β βββ embeddings.py # Vector embeddings for similarity
β βββ memory_dashboard.py # Rich console visualization
β βββ e2b_sandbox_manager.py # Sandboxed execution
βββ agents/
β βββ base_agent.py # Foundation class
β βββ architect_agent.py # Planning and design
β βββ builder_agent.py # Feature implementation
β βββ test_generator_agent.py # Test creation
β βββ verifier_agent.py # Quality assurance
β βββ reviewer_agent.py # Code review
β βββ devops_agent.py # Infrastructure
β βββ documentation_agent.py # Documentation
β βββ reporter_agent.py # Reports
β βββ analytics_agent.py # Pattern analysis
β βββ refactor_agent.py # Code quality
β βββ database_agent.py # Schema design
β βββ ui_design_agent.py # UI/UX design
βββ mcp_servers/
β βββ e2b/
β βββ e2b_mcp_server.py # E2B sandbox MCP server
βββ prompts/
β βββ app_spec.txt # Your application specification (edit this!)
β βββ initializer_prompt.md # Checklist creation prompt
β βββ coding_prompt.md # Task implementation prompt
βββ tests/ # Integration tests
β βββ test_embeddings.py # Embedding system tests
β βββ test_security.py # Security hook tests
βββ .env.example # Environment template
βββ requirements.txt # Python dependencies (pinned)
| Server | Purpose |
|---|---|
| E2B | Sandboxed command execution (CRITICAL) |
| Playwright | Browser automation and UI testing |
| Context7 | Documentation lookup |
| Filesystem | File operations |
| GitHub | Version control and PRs |
| Git | Local git operations |
| Memory | Knowledge graph memory |
| Sequential Thinking | Problem-solving |
| Fetch | Web content retrieval |
| Linear | Project management and issue tracking |
βββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββ
β MULTI-AGENT ORCHESTRATOR β
βββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββ€
β Project Spec βββΊ Orchestrator βββΊ Agent Pool (13 agents) β
β β β
β ββββββββββββββ΄βββββββββββββ β
β βΌ βΌ β
β Task Queue Message Bus β
β (prioritized) (pub/sub) β
β β β β
β βββββββββββ¬ββββββββββββββββ β
β βΌ β
β ββββββββββββββββ β
β β E2B Sandbox β βββ All execution here β
β ββββββββββββββββ β
β β β
β βΌ β
β ββββββββββββββββ β
β β Checklist β βββ Progress tracking β
β β System β β
β ββββββββββββββββ β
βββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββ
"E2B_API_KEY not set"
Get an API key from https://e2b.dev and add to .env
"CLAUDE_CODE_OAUTH_TOKEN not set"
Run claude setup-token and add the token to .env
"SECURITY: Direct bash execution blocked"
This is expected! All commands must go through E2B sandbox.
The agent should use mcp__e2b__e2b_execute_command instead.
Verify security fixes:
python verify_fixes.pyMIT License - see LICENSE for details.