PharmForge - AI-Powered Drug Discovery Platform 🧬

An open-source workflow orchestrator for computational drug discovery, powered by Claude Code Agent Orchestration System v2.

🎯 What Is PharmForge?

PharmForge is a comprehensive platform that connects 39+ data sources and computational tools into complete in silico drug discovery pipelines. From target identification to lead optimization, PharmForge automates the entire computational workflow.

Key Features

39 Integrated Adapters - PubChem, ChEMBL, OpenTargets, UniProt, KEGG, STRING-DB, BioGRID, and 32 more
Natural Language Orchestration - Describe your pipeline in plain English
Multi-Objective Optimization - Pareto ranking for binding, ADMET, synthesis, and novelty
Reproducible Workflows - Lockfiles with version control and DOIs
GPU-Accelerated - AutoDock Vina, GNINA, DiffDock, OpenMM support
Production Ready - 38/39 adapters tested and validated
Open Source - MIT licensed, self-hostable

AI Agent Architecture

PharmForge is built using the Claude Code Agent Orchestration System v2:

🧠 Claude (Orchestrator) - Manages the 200k context window, creates todos, delegates tasks
✍️ Coder Subagent - Implements features in clean context
👁️ Tester Subagent - Validates implementations with Playwright
🆘 Stuck Subagent - Escalates issues requiring human decision

🏗️ PharmForge Adapters (39 Total)

Molecular Databases (5)

PubChem - 110M+ compounds, properties, bioactivity
ChEMBL - 2.3M+ compounds with bioactivity data
BindingDB - 2.5M+ binding affinities
ZINC Fragments - Drug-like fragment library
DrugCentral - FDA-approved drugs and clinical data

Docking & Scoring (3)

AutoDock Vina - Fast molecular docking
GNINA - CNN-based scoring
DiffDock - ML-powered docking (GPU)

Molecular Generation (4)

REINVENT - RL-based generation
MolGAN - GAN-based generation
De Novo - Fragment-based design
RDKit Local - Chemistry toolkit

Retrosynthesis (2)

AiZynthFinder - Retrosynthesis planning
LLM Retrosynthesis - GPT-4-powered routes

ADMET & Toxicity (2)

ADMET-AI - MIT ADMET predictor
pkCSM - 28 ADMET properties

Target Prediction (2)

SwissTargetPrediction - Target identification
TargetNet - ML target prediction

Protein Structure (4)

AlphaFold - Structure prediction
RCSB PDB - Experimental structures
PDB-REDO - Re-refined structures
SWISS-MODEL - Homology modeling

Molecular Dynamics (1)

OpenMM - MD simulations (GPU)

Literature & Patents (5)

PubMed - 35M+ biomedical articles
Europe PMC - Life science literature
SureChEMBL - 17M+ patent chemicals
Google Patents - Patent search
Lens.org - Patent analytics

Clinical & Adverse Events (2)

ClinicalTrials.gov - 450k+ clinical trials
FDA FAERS - Adverse event reports

Pathway & Systems Biology (2)

Reactome - Biological pathways
KEGG - Pathway database

Gene Expression (2)

GTEx - Tissue expression
GEO - Gene expression datasets

Protein Interactions (2)

BioGRID - Protein-protein interactions
STRING-DB - Interaction networks

Target-Disease Associations (1)

OpenTargets - Target validation

Protein Information (1)

UniProt - Protein sequences and functions

Disease Information (1)

DisGeNET - Gene-disease associations

🚀 Quick Start

Prerequisites

Docker & Docker Compose (for containers)
Python 3.9+ (for local development)
NVIDIA GPU (optional, for GPU-accelerated adapters)
16GB+ RAM recommended

Installation

# Clone this repository
git clone https://github.com/your-org/pharmforge.git
cd pharmforge/claude-code-agents-wizard-v2

# Copy environment template
cp .env.example .env

# Edit .env with your API keys (optional - most adapters work without keys)
# Required: OPENAI_API_KEY (for LLM retrosynthesis)
# Optional: BIOGRID_ACCESS_KEY (free registration)

# Start all services
docker-compose up -d

# Check service health
curl http://localhost:8000/health

# Access the UI
open http://localhost:8501

Your First Pipeline

# Example: Find drug candidates for EGFR
python -c "
from backend.core.pipeline import Pipeline

pipeline = Pipeline()
results = pipeline.execute(
    query='Find EGFR inhibitors with good ADMET',
    limit=10
)

print(f'Found {len(results)} candidates')
for r in results[:3]:
    print(f'  {r.smiles}: Score {r.score:.2f}')
"

Running Tests

# Backend tests
docker-compose exec backend pytest

# Full integration test
python tests/e2e/test_full_pipeline.py

📊 Phase 3 Status (Current)

Timeline: Weeks 9-12 (Days 57-84) Focus: Polish, Validation & Launch Status: In Progress

Completed ✅

39 adapters implemented (38 production-ready)
5 new FREE adapters added (BioGRID, STRING-DB, GEO, pkCSM, KEGG)
All adapters validated and tested
GPU support enabled (RTX 5080)
Docker development environment ready
Comprehensive documentation (8,000+ lines)

In Progress 🔄

Backend runtime fixes (score normalization, health endpoint)
Frontend integration (Streamlit/React decision)
Benchmark suite (DUD-E, TDC)
AWS cloud deployment preparation
Phase 3 documentation updates

Planned 📋

Validation benchmarks published
Preprint submitted to ChemRxiv
AWS infrastructure deployed
Beta signup flow live
GitHub public launch (target: 500+ stars)
First 10-20 paying customers

Metrics (Target by Day 84)

Adapters: 39/39 ✅ (100% complete)
Production Ready: 38/39 (97%)
Test Coverage: 95%+
Documentation: Comprehensive ✅
API Keys Required: 2 (OpenAI, BioGRID - both free)
Monthly Cost: $0-5 (OpenAI usage only)

📖 Documentation

Quick Links

Deployment Guide - Docker Compose and AWS setup
User Guide - Getting started and troubleshooting
Adapter Inventory - All 39 adapters documented
Changelog - Version history and updates
Phase 3 Plan - Current phase roadmap

Architecture Docs

Phase 1 - Core infrastructure
Phase 2 - Pipeline completion
Phase 3 - Polish & launch
Frontend Design - UI design system

📖 How to Use (Agent Workflow)

Starting a Project

When you want to build something, just tell Claude your requirements:

You: "Build a todo app with React and TypeScript"

Claude will automatically:

Create a detailed todo list using TodoWrite
Delegate the first todo to the coder subagent
The coder implements in its own clean context window
Delegate verification to the tester subagent (Playwright screenshots)
If ANY problem occurs, the stuck subagent asks you what to do
Mark todo complete and move to the next one
Repeat until project complete

The Workflow

USER: "Build X"
    ↓
CLAUDE: Creates detailed todos with TodoWrite
    ↓
CLAUDE: Invokes coder subagent for todo #1
    ↓
CODER (own context): Implements feature
    ↓
    ├─→ Problem? → Invokes STUCK → You decide → Continue
    ↓
CODER: Reports completion
    ↓
CLAUDE: Invokes tester subagent
    ↓
TESTER (own context): Playwright screenshots & verification
    ↓
    ├─→ Test fails? → Invokes STUCK → You decide → Continue
    ↓
TESTER: Reports success
    ↓
CLAUDE: Marks todo complete, moves to next
    ↓
Repeat until all todos done ✅

🛠️ How It Works

Claude (The Orchestrator)

Your 200k Context Window

Creates and maintains comprehensive todo lists
Sees the complete project from A-Z
Delegates individual todos to specialized subagents
Tracks overall progress across all tasks
Maintains project state and context

How it works: Claude IS the orchestrator - it uses its 200k context to manage everything

Coder Subagent

Fresh Context Per Task

Gets invoked with ONE specific todo item
Works in its own clean context window
Writes clean, functional code
Never uses fallbacks - invokes stuck agent immediately
Reports completion back to Claude

When it's used: Claude delegates each coding todo to this subagent

Tester Subagent

Fresh Context Per Verification

Gets invoked after each coder completion
Works in its own clean context window
Uses Playwright MCP to see rendered output
Takes screenshots to verify layouts
Tests interactions (clicks, forms, navigation)
Never marks failing tests as passing
Reports pass/fail back to Claude

When it's used: Claude delegates testing after every implementation

Stuck Subagent

Fresh Context Per Problem

Gets invoked when coder or tester hits a problem
Works in its own clean context window
ONLY subagent that can ask you questions
Presents clear options for you to choose
Blocks progress until you respond
Returns your decision to the calling agent
Ensures no blind fallbacks or workarounds

When it's used: Whenever ANY subagent encounters ANY problem

🚨 The "No Fallbacks" Rule

This is the key differentiator:

Traditional AI: Hits error → tries workaround → might fail silently This system: Hits error → asks you → you decide → proceeds correctly

Every agent is hardwired to invoke the stuck agent rather than use fallbacks. You stay in control.

💡 Example Session

You: "Build a landing page with a contact form"

Claude creates todos:
  [ ] Set up HTML structure
  [ ] Create hero section
  [ ] Add contact form with validation
  [ ] Style with CSS
  [ ] Test form submission

Claude invokes coder(todo #1: "Set up HTML structure")

Coder (own context): Creates index.html
Coder: Reports completion to Claude

Claude invokes tester("Verify HTML structure loads")

Tester (own context): Uses Playwright to navigate
Tester: Takes screenshot
Tester: Verifies HTML structure visible
Tester: Reports success to Claude

Claude: Marks todo #1 complete ✓

Claude invokes coder(todo #2: "Create hero section")

Coder (own context): Implements hero section
Coder: ERROR - image file not found
Coder: Invokes stuck subagent

Stuck (own context): Asks YOU:
  "Hero image 'hero.jpg' not found. How to proceed?"
  Options:
  - Use placeholder image
  - Download from Unsplash
  - Skip image for now

You choose: "Download from Unsplash"

Stuck: Returns your decision to coder
Coder: Proceeds with Unsplash download
Coder: Reports completion to Claude

... and so on until all todos done

📁 Repository Structure

.
├── .claude/
│   ├── CLAUDE.md              # Orchestration instructions for main Claude
│   └── agents/
│       ├── coder.md          # Coder subagent definition
│       ├── tester.md         # Tester subagent definition
│       └── stuck.md          # Stuck subagent definition
├── .mcp.json                  # Playwright MCP configuration
├── .gitignore
└── README.md

🎓 Learn More

Resources

SEO Grove - AI-powered SEO automation platform
ISS AI Automation School - Join our community to learn AI automation
Income Stream Surfers YouTube - Tutorials, breakdowns, and AI automation content

Support

Have questions or want to share what you built?

Join the ISS AI Automation School community
Subscribe to Income Stream Surfers on YouTube
Check out SEO Grove for automated SEO solutions

🤝 Contributing

This is an open system! Feel free to:

Add new specialized agents
Improve existing agent prompts
Share your agent configurations
Submit PRs with enhancements

📝 How It Works Under the Hood

This system leverages Claude Code's subagent system:

CLAUDE.md instructs main Claude to be the orchestrator
Subagents are defined in .claude/agents/*.md files
Each subagent gets its own fresh context window
Main Claude maintains the 200k context with todos and project state
Playwright MCP is configured in .mcp.json for visual testing

The magic happens because:

Claude (200k context) = Maintains big picture, manages todos
Coder (fresh context) = Implements one task at a time
Tester (fresh context) = Verifies one implementation at a time
Stuck (fresh context) = Handles one problem at a time with human input
Each subagent has specific tools and hardwired escalation rules

🎯 Best Practices

Trust Claude - Let it create and manage the todo list
Review screenshots - The tester provides visual proof of every implementation
Make decisions when asked - The stuck agent needs your guidance
Don't interrupt the flow - Let subagents complete their work
Check the todo list - Always visible, tracks real progress

🔥 Pro Tips

Use /agents command to see all available subagents
Claude maintains the todo list in its 200k context - check anytime
Screenshots from tester are saved and can be reviewed
Each subagent has specific tools - check their .md files
Subagents get fresh contexts - no context pollution!

📜 License

MIT - Use it, modify it, share it!

🙏 Credits

Built by Income Stream Surfer

Powered by Claude Code's agent system and Playwright MCP.

Ready to build something amazing? Just run claude in this directory and tell it what you want to create! 🚀

Name		Name	Last commit message	Last commit date
Latest commit History 19 Commits
.claude		.claude
adapters		adapters
alembic		alembic
backend		backend
frontend		frontend
.dockerignore		.dockerignore
.env.example		.env.example
.gitignore		.gitignore
.mcp.json		.mcp.json
ADAPTER_EXPANSION_COMPLETE.md		ADAPTER_EXPANSION_COMPLETE.md
ADAPTER_EXPANSION_PLAN.md		ADAPTER_EXPANSION_PLAN.md
ADAPTER_EXPANSION_PLAN_FINAL.md		ADAPTER_EXPANSION_PLAN_FINAL.md
ADAPTER_FIX_VERIFICATION_REPORT.md		ADAPTER_FIX_VERIFICATION_REPORT.md
ADAPTER_GAP_ANALYSIS.md		ADAPTER_GAP_ANALYSIS.md
ADAPTER_QUALITY_REPORT.md		ADAPTER_QUALITY_REPORT.md
ADAPTER_TESTING_COMPLETE_OCTOBER_25_2025.md		ADAPTER_TESTING_COMPLETE_OCTOBER_25_2025.md
ADAPTER_TEST_REPORT.md		ADAPTER_TEST_REPORT.md
ADDITIONAL_ADAPTERS_TO_INTEGRATE.md		ADDITIONAL_ADAPTERS_TO_INTEGRATE.md
ADMET_COVERAGE_MATRIX.md		ADMET_COVERAGE_MATRIX.md
ADVANCED_ADAPTERS_SETUP.md		ADVANCED_ADAPTERS_SETUP.md
AIZYNTHFINDER_ADAPTER_DELIVERABLES.md		AIZYNTHFINDER_ADAPTER_DELIVERABLES.md
API_KEYS_SETUP_GUIDE.md		API_KEYS_SETUP_GUIDE.md
AUTOSKLEARN_ADAPTER_DELIVERY.md		AUTOSKLEARN_ADAPTER_DELIVERY.md
BACKEND_ENHANCEMENTS_SUMMARY.md		BACKEND_ENHANCEMENTS_SUMMARY.md
BATCH_5-7_COMPLETION_SUMMARY.md		BATCH_5-7_COMPLETION_SUMMARY.md
BATCH_8_COMPLETION_SUMMARY.md		BATCH_8_COMPLETION_SUMMARY.md
BRENDA_ADAPTER_COMPLETE.md		BRENDA_ADAPTER_COMPLETE.md
CACHE_IMPLEMENTATION_REPORT.md		CACHE_IMPLEMENTATION_REPORT.md
CELERY_IMPLEMENTATION.md		CELERY_IMPLEMENTATION.md
CHANGELOG.md		CHANGELOG.md
CHEMSPIDER_ADAPTER_COMPLETE.md		CHEMSPIDER_ADAPTER_COMPLETE.md
COCONUT_ADAPTER_COMPLETE.md		COCONUT_ADAPTER_COMPLETE.md
COMPLETE_ADAPTER_ECOSYSTEM_FINAL_STATUS.md		COMPLETE_ADAPTER_ECOSYSTEM_FINAL_STATUS.md
COMPLETE_ADAPTER_INVENTORY.md		COMPLETE_ADAPTER_INVENTORY.md
COMPREHENSIVE_ADAPTER_TEST_REPORT_OCT_31_2025.md		COMPREHENSIVE_ADAPTER_TEST_REPORT_OCT_31_2025.md
COMPTOX_ADAPTER_DELIVERABLES.md		COMPTOX_ADAPTER_DELIVERABLES.md
CUSTOM_MODELS_ANALYSIS.md		CUSTOM_MODELS_ANALYSIS.md
CUSTOM_TOOLS_DECISION.md		CUSTOM_TOOLS_DECISION.md
CUSTOM_TOOLS_REORGANIZATION_SUMMARY.md		CUSTOM_TOOLS_REORGANIZATION_SUMMARY.md
DEPLOYMENT_GUIDE.md		DEPLOYMENT_GUIDE.md
DIFFDOCK_SETUP_GUIDE.md		DIFFDOCK_SETUP_GUIDE.md
DOCKER_OPTIMIZATION.md		DOCKER_OPTIMIZATION.md
DOCKING_AND_WORKFLOW_IMPROVEMENTS.md		DOCKING_AND_WORKFLOW_IMPROVEMENTS.md
DOCKING_FIXES_SUMMARY.md		DOCKING_FIXES_SUMMARY.md
Dockerfile.backend		Dockerfile.backend
FINAL_ADAPTER_INVENTORY.md		FINAL_ADAPTER_INVENTORY.md
FINAL_ADAPTER_STATUS_REPORT_OCT_26_2025.md		FINAL_ADAPTER_STATUS_REPORT_OCT_26_2025.md
FINAL_GPU_SETUP_INSTRUCTIONS.md		FINAL_GPU_SETUP_INSTRUCTIONS.md
FRONTEND_EXECUTION_IMPROVEMENTS.md		FRONTEND_EXECUTION_IMPROVEMENTS.md
FRONTEND_FUNCTIONAL_REBUILD_COMPLETE.md		FRONTEND_FUNCTIONAL_REBUILD_COMPLETE.md
FRONTEND_IMPROVEMENTS_SUMMARY.md		FRONTEND_IMPROVEMENTS_SUMMARY.md
FRONTEND_INTEGRATION_ROADMAP.md		FRONTEND_INTEGRATION_ROADMAP.md
FRONTEND_REBUILD_PLAN.md		FRONTEND_REBUILD_PLAN.md
FRONTEND_SETUP_GUIDE.md		FRONTEND_SETUP_GUIDE.md
FRONTEND_VALIDATION.md		FRONTEND_VALIDATION.md
FUTURE_ADAPTER_ROADMAP.md		FUTURE_ADAPTER_ROADMAP.md
GMX_MMPBSA_ADAPTER_COMPLETE.md		GMX_MMPBSA_ADAPTER_COMPLETE.md
GPU_STATUS_AND_NEXT_STEPS.md		GPU_STATUS_AND_NEXT_STEPS.md
GPU_SUCCESS.md		GPU_SUCCESS.md
HEALTH_ENDPOINT_IMPLEMENTATION.md		HEALTH_ENDPOINT_IMPLEMENTATION.md
HMDB_ADAPTER_DELIVERABLES.md		HMDB_ADAPTER_DELIVERABLES.md
INTACT_ADAPTER_DELIVERY.md		INTACT_ADAPTER_DELIVERY.md
INTEGRATION_COMPLETE.md		INTEGRATION_COMPLETE.md
INTEGRATION_TEST_RESULTS.md		INTEGRATION_TEST_RESULTS.md
LATEST_FIXES.md		LATEST_FIXES.md
MARKETPLACE_DESIGN.md		MARKETPLACE_DESIGN.md
MARKETPLACE_MOCKUP_COMPLETE.md		MARKETPLACE_MOCKUP_COMPLETE.md
ML_MODELS_SETUP_SUMMARY.md		ML_MODELS_SETUP_SUMMARY.md
NEW_ADAPTERS_INTEGRATION_GUIDE.md		NEW_ADAPTERS_INTEGRATION_GUIDE.md
NEW_ADAPTERS_INTEGRATION_SUMMARY.md		NEW_ADAPTERS_INTEGRATION_SUMMARY.md
OLORENCHEMENGINE_ADAPTER_COMPLETE.md		OLORENCHEMENGINE_ADAPTER_COMPLETE.md
OPENTARGETS_VALIDATION_REPORT.md		OPENTARGETS_VALIDATION_REPORT.md
ORD_ADAPTER_INTEGRATION_NOTES.md		ORD_ADAPTER_INTEGRATION_NOTES.md
PACKAGE_INSTALLATION_SUMMARY.md		PACKAGE_INSTALLATION_SUMMARY.md
PARALLEL_EXECUTION_FEATURE.md		PARALLEL_EXECUTION_FEATURE.md
PDB_REDO_ADAPTER_STATUS.md		PDB_REDO_ADAPTER_STATUS.md
PHARMFORGE_55_ADAPTERS_COMPLETE.md		PHARMFORGE_55_ADAPTERS_COMPLETE.md
PHARMFORGE_README.md		PHARMFORGE_README.md
PHASE2_COMPLETION_REPORT.md		PHASE2_COMPLETION_REPORT.md
PHASE2_FINAL_VALIDATION.md		PHASE2_FINAL_VALIDATION.md
PHASE3_BACKEND_RUNTIME_FIXES_SUMMARY.md		PHASE3_BACKEND_RUNTIME_FIXES_SUMMARY.md
PHASE3_IMPLEMENTATION_PLAN.md		PHASE3_IMPLEMENTATION_PLAN.md
PROTEIN_STRUCTURE_ADAPTERS_SUMMARY.md		PROTEIN_STRUCTURE_ADAPTERS_SUMMARY.md
PYMOL_ADAPTER_COMPLETE.md		PYMOL_ADAPTER_COMPLETE.md
QUICK_START_ENHANCEMENTS.md		QUICK_START_ENHANCEMENTS.md
README.md		README.md
README_PHARMFORGE.md		README_PHARMFORGE.md
ROUND2_INTEGRATION_COMPLETE.md		ROUND2_INTEGRATION_COMPLETE.md
ROUND3_RECOMMENDATIONS.md		ROUND3_RECOMMENDATIONS.md
RTX_5080_GPU_FIX.md		RTX_5080_GPU_FIX.md
SABDAB_ADAPTER_DELIVERY.md		SABDAB_ADAPTER_DELIVERY.md
SESSION_SUMMARY_OCT_31_2025.md		SESSION_SUMMARY_OCT_31_2025.md
SETUP.md		SETUP.md
STAGED_MODEL_INTEGRATION_PLAN.md		STAGED_MODEL_INTEGRATION_PLAN.md
TASK_COMPLETION_SUMMARY.md		TASK_COMPLETION_SUMMARY.md
TDC_ADMET_ORACLE_FINDINGS.md		TDC_ADMET_ORACLE_FINDINGS.md
TDC_ADMET_STATUS.md		TDC_ADMET_STATUS.md
TDC_INVESTIGATION_REPORT.md		TDC_INVESTIGATION_REPORT.md
TODAYS_WORK_SUMMARY.md		TODAYS_WORK_SUMMARY.md
URGENT_FIXES_APPLIED.md		URGENT_FIXES_APPLIED.md

Folders and files

Latest commit

History

Repository files navigation

PharmForge - AI-Powered Drug Discovery Platform 🧬

🎯 What Is PharmForge?

Key Features

AI Agent Architecture

🏗️ PharmForge Adapters (39 Total)

Molecular Databases (5)

Docking & Scoring (3)

Molecular Generation (4)

Retrosynthesis (2)

ADMET & Toxicity (2)

Target Prediction (2)

Protein Structure (4)

Molecular Dynamics (1)

Literature & Patents (5)

Clinical & Adverse Events (2)

Pathway & Systems Biology (2)

Gene Expression (2)

Protein Interactions (2)

Target-Disease Associations (1)

Protein Information (1)

Disease Information (1)

🚀 Quick Start

Prerequisites

Installation

Your First Pipeline

Running Tests

📊 Phase 3 Status (Current)

Completed ✅

In Progress 🔄

Planned 📋

Metrics (Target by Day 84)

📖 Documentation

Quick Links

Architecture Docs

📖 How to Use (Agent Workflow)

Starting a Project

The Workflow

🛠️ How It Works

Claude (The Orchestrator)

Coder Subagent

Tester Subagent

Stuck Subagent

🚨 The "No Fallbacks" Rule

💡 Example Session

📁 Repository Structure

🎓 Learn More

Resources

Support

🤝 Contributing

📝 How It Works Under the Hood

🎯 Best Practices

🔥 Pro Tips

📜 License

🙏 Credits

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages