Agentic Text-to-SQL Pipeline

A production-grade, highly concurrent Natural Language to SQL generation engine built with LangGraph, FastAPI, and Asyncio.

Designed for scalability, low latency, and deterministic execution accuracy, this system moves beyond simple Prompt-to-SQL scripts to implement a robust, self-correcting agentic workflow equipped with enterprise-grade observability, semantic caching, and strict SQL sandboxing.

Engineering Objectives

Concurrency & Non-blocking I/O: Migrated from synchronous execution to an async/await foundation using aiosqlite and asynchronous LLM providers.
Provider Agnosticism: Abstracted LLM interactions into a BaseLLMProvider interface to prevent vendor lock-in (supporting Gemini, OpenAI, etc.).
Reliability & Self-Correction: Implemented a LangGraph state machine that evaluates SQL syntax, sandboxes execution, catches runtime DB errors, and dynamically prompts the LLM to self-correct up to 3 times before failing gracefully.
Latency Optimization: Introduced a deterministic semantic cache layer via Redis, yielding a ~90% reduction in P99 latency for repeated analytical queries.
Observability: Centralized structured JSON logging and configured endpoints for OpenTelemetry tracing and Prometheus metrics scraping.

System Architecture

The architecture isolates concerns across the API, Service, Data, and Orchestration layers.

graph TD
    User([Client/User]) -->|HTTP POST| API[FastAPI Layer]
    
    subgraph Core
        API --> Orchestrator[LangGraph Workflow]
        Orchestrator <--> Cache[(Redis Semantic Cache)]
    end
    
    subgraph Services
        Orchestrator --> SchemaSvc[Schema Introspection Service]
        Orchestrator --> ExecSvc[Safe Execution Sandbox]
    end
    
    subgraph Providers
        Orchestrator --> LLM[BaseLLMProvider]
        LLM --> Gemini(Google Gemini)
        LLM --> OpenAI(OpenAI GPT-4o)
    end
    
    ExecSvc --> SQLite[(Read-Only SQLite)]
    SchemaSvc --> SQLite

The Orchestration DAG (LangGraph)

Our query lifecycle maps to a strict Directed Acyclic Graph:

Understanding: Intent classification & relevance gating.
Schema Retrieval: Fetches subset schema context mapping to identified entities.
Plan Generation: Constructs an intermediate multi-step plan.
SQL Generation: Translates the plan into dialect-specific SQL.
Safe Execution: Executes in a Read-Only sandboxed connection with enforced row limits.
Recovery Loop: On SQLite exception, routes back to Step 4 with the exact error trace.

Benchmarks & Evaluation Framework

We utilize a reproducible evaluation pipeline (scripts/evaluate.py) against subsets of the Spider and WikiSQL datasets.

Metric	Measurement	Notes
Execution Accuracy	`89.5%`	Measures exact set match of returned rows vs. ground truth.
P50 Latency (Cold)	`2.1s`	Full LLM reasoning and code generation pass.
P99 Latency (Cached)	`35ms`	Hash-based exact match hit on Redis.
Recovery Rate	`72%`	Percentage of failed queries successfully corrected by the retry loop.

Execute the benchmark locally:

python scripts/evaluate.py

🛡️ Safety & Security

SQL Sandboxing: The executor strictly mounts the database with a ?mode=ro flag at the connection level.
Layer 7 Inspection: Keyword blocklists reject mutation statements (DROP, ALTER, DELETE) before DB connection.
Hard Limits: Queries lacking aggregation automatically receive an enforced LIMIT 1000.
Timeouts: Database execution logic is wrapped in an asyncio.wait_for constraint, defaulting to 15 seconds to prevent hung queries locking resources.
Pydantic Validation: All incoming payloads and environment variables are strictly typed and validated via Pydantic Settings.

Infrastructure & Deployment

We support containerized deployments utilizing docker-compose. The stack launches the FastAPI server, the Redis cache, and a Prometheus monitoring daemon.

1. Prerequisites

Docker Engine
A Google Gemini or OpenAI API Key

2. Local Setup

git clone <your-repo-url>
cd nlptosql

# Install dependencies (if running locally without Docker)
pip install -r requirements.txt

# Create your .env file
echo "GEMINI_API_KEY=your_key" > .env

# Download the sample database
python setup_db.py

3. Docker Compose Deployment

docker-compose up -d --build

This deploys:

API Server on http://localhost:8000
Redis Cache on localhost:6379
Prometheus Scraper on localhost:9090

Observability & Logging

Metrics Endpoint: Visit /metrics to view Prometheus scrape targets including query success rates, latency distributions, and total token usage.
Structured Logs: All modules output structured JSON compatible with ELK or Datadog ingestion:

{
  "timestamp": "2026-05-11 10:45:00",
  "level": "INFO",
  "logger": "nlptosql",
  "message": "Query answered successfully.",
  "module": "server",
  "line": 49
}

Name		Name	Last commit message	Last commit date
Latest commit History 17 Commits
.github/workflows		.github/workflows
__pycache__		__pycache__
scripts		scripts
src		src
.gitignore		.gitignore
Chinook_Sqlite.sqlite		Chinook_Sqlite.sqlite
Dockerfile		Dockerfile
README.md		README.md
baseline.py		baseline.py
docker-compose.yml		docker-compose.yml
list_models.py		list_models.py
requirements.txt		requirements.txt
setup_db.py		setup_db.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Agentic Text-to-SQL Pipeline

Engineering Objectives

System Architecture

The Orchestration DAG (LangGraph)

Benchmarks & Evaluation Framework

🛡️ Safety & Security

Infrastructure & Deployment

1. Prerequisites

2. Local Setup

3. Docker Compose Deployment

Observability & Logging

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Agentic Text-to-SQL Pipeline

Engineering Objectives

System Architecture

The Orchestration DAG (LangGraph)

Benchmarks & Evaluation Framework

🛡️ Safety & Security

Infrastructure & Deployment

1. Prerequisites

2. Local Setup

3. Docker Compose Deployment

Observability & Logging

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages