Universal AI Development Platform

A comprehensive autonomous development system powered by the Claude Agent SDK. Build, enhance, and manage multiple software projects simultaneously with a coordinated agent army.

✨ Key Features

13 Specialized Agents - Architect, Builder, Verifier, Reviewer, DevOps, and more
Interactive TUI - Rich terminal interface with Linear integration
Vector Embeddings - Semantic similarity search for agent memory
Memory Dashboard - Visualize agent learning patterns and mistakes
E2B Sandboxing - All code execution in isolated cloud VMs
Multi-Project Support - Manage multiple projects simultaneously

🚀 Quick Start

Interactive TUI (Recommended)

# Windows
python tui.py
# Or double-click: tui.bat

# Linux/Mac
python tui.py

The TUI provides:

Project creation and management
Real-time progress monitoring
Agent memory dashboard
Linear issue integration

Orchestrator Mode

python run_orchestrator.py
# Or double-click: run_orchestrator.bat

🤖 Multi-Agent Orchestrator

This platform uses a Multi-Agent Orchestrator that coordinates 13 specialized agents:

Agent	Role
Architect	Planning and design - analyzes requirements, designs architecture
Builder	Feature implementation - writes code, creates tests
Test Generator	Automated test creation with Context7 documentation lookup
Verifier	Quality assurance - ensures 100% completion with blocking subtasks
Reviewer	Code review - checks quality, security, performance
DevOps	CI/CD pipelines, containerization, cloud deployment
Documentation	API docs, user guides, technical specs
Reporter	Project status reports, sprint summaries
Analytics	Pattern analysis, bottleneck detection, optimization recommendations
Refactor	Code quality, technical debt reduction
Database	Schema design, migrations, query optimization
UI Design	UI/UX, WCAG accessibility validation, responsive design
E2B Sandbox	Secure sandboxed code execution

🔒 Security Model

All code execution runs in E2B cloud sandboxes - never on your local machine.

Layer	Protection
E2B Sandbox	All bash commands execute in isolated cloud VMs
Hard-Fail	System refuses to run if E2B unavailable (no silent fallback)
Shell Injection	All paths escaped with `shlex.quote()`
Bash Blocking	Direct Bash tool blocked, redirects to E2B MCP tools
File Locking	Concurrent access protected with `filelock`

📋 Prerequisites

1. Install Dependencies

# Install Claude Code CLI
npm install -g @anthropic-ai/claude-code

# Install Python dependencies
pip install -r requirements.txt

2. Configure Environment

Copy .env.example to .env and add your API keys:

# Required
CLAUDE_CODE_OAUTH_TOKEN=your-token    # Run: claude setup-token
E2B_API_KEY=your-e2b-key              # Get from: https://e2b.dev

# Recommended
LINEAR_API_KEY=your-linear-key        # Get from: https://linear.app/settings/api
CONTEXT7_API_KEY=your-context7-key    # Get from: https://context7.com

# Optional
ANTHROPIC_API_KEY=your-anthropic-key
MAX_CONCURRENT_AGENTS=10
AGENT_TIMEOUT=3600
DEFAULT_MODEL=claude-opus-4-5-20251101

# Project Configuration (for run_orchestrator.py)
PROJECT_NAME=My Project
PROJECT_PATH=./projects/default
SPEC_FILE=./prompts/app_spec.txt

3. Configure Your Project

Edit prompts/app_spec.txt with your project specification
Update .env with your project settings:
- PROJECT_NAME - Your project's name
- PROJECT_PATH - Where to create the project
- SPEC_FILE - Path to your specification file

Or use the TUI (recommended) which lets you configure projects interactively.

4. Verify Installation

claude --version
pip show claude-code-sdk
python verify_fixes.py  # Verify all components

🏗️ Core Infrastructure

1. Enhanced Checklist Manager (`core/enhanced_checklist.py`)

Subtask support with blocking mechanism
Test coverage tracking per task
Agent assignment tracking
Automatic CHECKLIST.md generation

2. Project Registry (`core/project_registry.py`)

Multi-project management
Workload distribution and load balancing
Project status tracking (active, paused, completed)

3. Task Queue (`core/task_queue.py`)

Priority levels: CRITICAL > HIGH > MEDIUM > LOW
Agent type matching
Dependency management with fail-safe checks
Automatic retry (up to 3 attempts)

4. Message Bus (`core/message_bus.py`)

Pub/sub inter-agent communication
Direct messaging to specific agents
File-based persistence with locking and error handling

5. Agent Memory (`core/agent_memory.py`)

Persistent markdown-based memory per agent
Pattern learning from successes
Mistake tracking to avoid repeated errors
Vector embeddings for semantic similarity search

6. Embedding Manager (`core/embeddings.py`)

Sentence-transformers integration (all-MiniLM-L6-v2)
Lazy model loading for fast startup
Cosine similarity search across patterns/mistakes
NumPy-based storage for fast retrieval

7. Memory Dashboard (`core/memory_dashboard.py`)

Rich console visualization of agent memory
Pattern and mistake statistics
Real-time memory inspection

8. E2B Sandbox Manager (`core/e2b_sandbox_manager.py`)

Cloud-based isolated execution
Hard-fail if unavailable (no local fallback)
Shell injection protection

📁 Project Structure

Coding-Agent-Harness/
├── tui.py                    # Interactive terminal interface
├── tui.bat                   # Windows TUI launcher
├── run_orchestrator.py       # Orchestrator entry point
├── run_orchestrator.bat      # Windows orchestrator launcher
├── orchestrator.py           # Multi-agent coordinator
├── client.py                 # Claude SDK + 10 MCP servers
├── security.py               # Bash blocking, E2B redirect
├── verify_fixes.py           # Component verification script
├── pytest.ini                # Test configuration
├── core/
│   ├── enhanced_checklist.py # Task tracking with subtasks
│   ├── project_registry.py   # Multi-project management
│   ├── task_queue.py         # Priority-based distribution
│   ├── message_bus.py        # Inter-agent communication
│   ├── agent_memory.py       # Learning and memory
│   ├── embeddings.py         # Vector embeddings for similarity
│   ├── memory_dashboard.py   # Rich console visualization
│   └── e2b_sandbox_manager.py # Sandboxed execution
├── agents/
│   ├── base_agent.py         # Foundation class
│   ├── architect_agent.py    # Planning and design
│   ├── builder_agent.py      # Feature implementation
│   ├── test_generator_agent.py # Test creation
│   ├── verifier_agent.py     # Quality assurance
│   ├── reviewer_agent.py     # Code review
│   ├── devops_agent.py       # Infrastructure
│   ├── documentation_agent.py # Documentation
│   ├── reporter_agent.py     # Reports
│   ├── analytics_agent.py    # Pattern analysis
│   ├── refactor_agent.py     # Code quality
│   ├── database_agent.py     # Schema design
│   └── ui_design_agent.py    # UI/UX design
├── mcp_servers/
│   └── e2b/
│       └── e2b_mcp_server.py # E2B sandbox MCP server
├── prompts/
│   ├── app_spec.txt          # Your application specification (edit this!)
│   ├── initializer_prompt.md # Checklist creation prompt
│   └── coding_prompt.md      # Task implementation prompt
├── tests/                    # Integration tests
│   ├── test_embeddings.py    # Embedding system tests
│   └── test_security.py      # Security hook tests
├── .env.example              # Environment template
└── requirements.txt          # Python dependencies (pinned)

🔌 MCP Servers (10 Total)

Server	Purpose
E2B	Sandboxed command execution (CRITICAL)
Playwright	Browser automation and UI testing
Context7	Documentation lookup
Filesystem	File operations
GitHub	Version control and PRs
Git	Local git operations
Memory	Knowledge graph memory
Sequential Thinking	Problem-solving
Fetch	Web content retrieval
Linear	Project management and issue tracking

Workflow

┌─────────────────────────────────────────────────────────────┐
│                 MULTI-AGENT ORCHESTRATOR                     │
├─────────────────────────────────────────────────────────────┤
│  Project Spec ──► Orchestrator ──► Agent Pool (13 agents)   │
│                        │                                     │
│           ┌────────────┴────────────┐                       │
│           ▼                         ▼                       │
│     Task Queue              Message Bus                     │
│     (prioritized)           (pub/sub)                       │
│           │                         │                       │
│           └─────────┬───────────────┘                       │
│                     ▼                                       │
│              ┌──────────────┐                               │
│              │ E2B Sandbox  │  ◄── All execution here       │
│              └──────────────┘                               │
│                     │                                       │
│                     ▼                                       │
│              ┌──────────────┐                               │
│              │  Checklist   │  ◄── Progress tracking        │
│              │   System     │                               │
│              └──────────────┘                               │
└─────────────────────────────────────────────────────────────┘

Troubleshooting

"E2B_API_KEY not set" Get an API key from https://e2b.dev and add to .env

"CLAUDE_CODE_OAUTH_TOKEN not set" Run claude setup-token and add the token to .env

"SECURITY: Direct bash execution blocked" This is expected! All commands must go through E2B sandbox. The agent should use mcp__e2b__e2b_execute_command instead.

Verify security fixes:

python verify_fixes.py

License

MIT License - see LICENSE for details.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Universal AI Development Platform

✨ Key Features

🚀 Quick Start

Interactive TUI (Recommended)

Orchestrator Mode

🤖 Multi-Agent Orchestrator

🔒 Security Model

📋 Prerequisites

1. Install Dependencies

2. Configure Environment

3. Configure Your Project

4. Verify Installation

🏗️ Core Infrastructure

1. Enhanced Checklist Manager (`core/enhanced_checklist.py`)

2. Project Registry (`core/project_registry.py`)

3. Task Queue (`core/task_queue.py`)

4. Message Bus (`core/message_bus.py`)

5. Agent Memory (`core/agent_memory.py`)

6. Embedding Manager (`core/embeddings.py`)

7. Memory Dashboard (`core/memory_dashboard.py`)

8. E2B Sandbox Manager (`core/e2b_sandbox_manager.py`)

📁 Project Structure

🔌 MCP Servers (10 Total)

Workflow

Troubleshooting

License

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 14 Commits
agents		agents
config		config
core		core
mcp_servers/e2b		mcp_servers/e2b
prompts		prompts
tests		tests
utils		utils
.env.example		.env.example
.gitignore		.gitignore
ARCHITECTURE.md		ARCHITECTURE.md
CLAUDE.md		CLAUDE.md
LICENSE		LICENSE
README.md		README.md
agent.py		agent.py
checklist_manager.py		checklist_manager.py
client.py		client.py
linear_config.py		linear_config.py
orchestrator.py		orchestrator.py
progress.py		progress.py
prompts.py		prompts.py
pytest.ini		pytest.ini
requirements.txt		requirements.txt
run_orchestrator.bat		run_orchestrator.bat
run_orchestrator.py		run_orchestrator.py
security.py		security.py
setup_checklist.py		setup_checklist.py
tui.bat		tui.bat
tui.py		tui.py
verify_fixes.py		verify_fixes.py
verify_orchestrator.py		verify_orchestrator.py

Folders and files

Latest commit

History

Repository files navigation

Universal AI Development Platform

✨ Key Features

🚀 Quick Start

Interactive TUI (Recommended)

Orchestrator Mode

🤖 Multi-Agent Orchestrator

🔒 Security Model

📋 Prerequisites

1. Install Dependencies

2. Configure Environment

3. Configure Your Project

4. Verify Installation

🏗️ Core Infrastructure

1. Enhanced Checklist Manager (core/enhanced_checklist.py)

2. Project Registry (core/project_registry.py)

3. Task Queue (core/task_queue.py)

4. Message Bus (core/message_bus.py)

5. Agent Memory (core/agent_memory.py)

6. Embedding Manager (core/embeddings.py)

7. Memory Dashboard (core/memory_dashboard.py)

8. E2B Sandbox Manager (core/e2b_sandbox_manager.py)

📁 Project Structure

🔌 MCP Servers (10 Total)

Workflow

Troubleshooting

License

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

1. Enhanced Checklist Manager (`core/enhanced_checklist.py`)

2. Project Registry (`core/project_registry.py`)

3. Task Queue (`core/task_queue.py`)

4. Message Bus (`core/message_bus.py`)

5. Agent Memory (`core/agent_memory.py`)

6. Embedding Manager (`core/embeddings.py`)

7. Memory Dashboard (`core/memory_dashboard.py`)

8. E2B Sandbox Manager (`core/e2b_sandbox_manager.py`)

Packages