Installation

This guide covers installing and configuring fact_inventory for standalone operation. For deployment patterns (bare metal, Kubernetes, embedding), see DEPLOYMENT.md. For development setup, see DEVELOPMENT.md.

Requirements

Python 3.12+
PostgreSQL 16+ (recommended for JSONB and GIN index performance)
uv (Python package installer; see https://docs.astral.sh/uv/)

pyproject.toml keeps a record of the required python modules.

Quick Start

Clone the repository:

git clone <repository-url>
cd python-fact_inventory

Install dependencies:

uv sync

Set up environment configuration:

Create a .env.production file (or .env.{DEPLOYMENT} for your target environment):

DATABASE_URI=postgresql+asyncpg://user:password@localhost/dbname

Start the application:

export WEB_CONCURRENCY=8
DEPLOYMENT=XYZ uvicorn fact_inventory:app_factory --factory --host 0.0.0.0 --port 8000

The application is now listening on http://0.0.0.0:8000/api/v1/facts (without the /fact_inventory prefix; see Deployment note below) with 8 workers.

Configuration

Configuration is managed through environment variables. The DEPLOYMENT variable selects which .env file to load:

export DEPLOYMENT=production  # loads .env.production
export DEPLOYMENT=staging     # loads .env.staging
export DEPLOYMENT=testing     # loads .env.testing

Environment Variables

All available configuration options are defined in ../fact_inventory/lib/settings.py.

Common settings:

Variable	Description
`DEPLOYMENT`	Environment name (e.g., `production`, `staging`, `testing`)
`DATABASE_URI`	PostgreSQL connection string: `postgresql+asyncpg://user:pass@host/db`
`DEBUG`	Enable debug mode and OpenAPI documentation
`APP_PREFIX`	URL prefix for the application (use `/fact_inventory` for direct Uvicorn access)
`APP_NAME`	Service name for logging
`RETENTION_DAYS`	Keep facts for this many days before auto-purge
`HISTORY_MAX_ENTRIES`	Keep this many newest facts per client after deduplication
`MAX_REQUEST_BODY_MB`	Maximum total request body size (MB)
`MAX_JSON_FIELD_MB`	Maximum per-field JSON size (MB)
`RATE_LIMIT_REQUESTS`	Requests per hour per IP
`RATE_LIMIT_WINDOW_HOURS`	Rate limit window (hours)

Database Configuration

PostgreSQL 16+ is recommended for JSONB and GIN index performance. For development, SQLite is supported but not recommended for production.

PostgreSQL connection string format:

postgresql+asyncpg://username:password@hostname:port/database_name

Example with local PostgreSQL:

export DATABASE_URI="postgresql+asyncpg://fact_user:secure_password@localhost:5432/fact_inventory"

Rate Limiting

Adjust via environment variables:

export RATE_LIMIT_REQUESTS=10
export RATE_LIMIT_WINDOW_HOURS=1

This allows 10 requests per hour per IP. Rate limiting is based on the client's IP address and can be bypassed with IP rotation.

Payload Size Limits

If clients submit large payloads:

# Increase total request body limit to 50 MB
export MAX_REQUEST_BODY_MB=50

# Increase per-field limit to 10 MB
export MAX_JSON_FIELD_MB=10

# Ensure total > 3 x per-field
# Example: 50 > 3 * 10 (30) -- OK

Data Retention

Purge old facts automatically:

# Keep facts for 180 days (6 months)
export RETENTION_DAYS=180

# Check for expired facts every 24 hours
export RETENTION_CHECK_INTERVAL_HOURS=24

# Add up to 30 minutes of random jitter to prevent thundering-herd
export RETENTION_CHECK_JITTER_MINUTES=30

Deduplication

Keep only the newest facts per client:

# Keep the 5 newest facts per client
export HISTORY_MAX_ENTRIES=5

# Check for duplicates every 12 hours
export HISTORY_CHECK_INTERVAL_HOURS=12

# Add up to 15 minutes of random jitter
export HISTORY_CHECK_JITTER_MINUTES=15

Database Setup

For production, use Litestar's Alembic migrations to manage schema changes safely across deployments.

Why use Alembic?

Version control for database schema
Safe rollback if migrations fail
Audit trail of schema changes
Can be scripted into CI/CD pipelines
Works across multiple instances without race conditions

Setup:

Run migrations against your production database:

DEPLOYMENT=production uv run litestar --app fact_inventory:app database upgrade

Start the application (tables will already exist):

export WEB_CONCURRENCY=8
DEPLOYMENT=production uvicorn fact_inventory:app_factory --factory --host 0.0.0.0 --port 8000

Running the Application

Standalone with Uvicorn

Start the ASGI server:

export WEB_CONCURRENCY=8
uvicorn fact_inventory:app_factory --factory --host 0.0.0.0 --port 8000

The application listens on http://0.0.0.0:8000.

Important deployment note: By default, APP_PREFIX=/ because a reverse proxy (nginx, Apache, or Kubernetes Ingress) strips the external /fact_inventory prefix before forwarding. The application internally sees clean paths like /api/v1/facts.

For direct development access without a reverse proxy, override the prefix:

export WEB_CONCURRENCY=8
APP_PREFIX=/fact_inventory uvicorn fact_inventory:app_factory --factory --host 0.0.0.0 --port 8000

Then access the API at http://localhost:8000/fact_inventory/api/v1/facts.

With systemd (Bare Metal)

Create /etc/systemd/system/fact-inventory.service:

[Unit]
Description=Fact Inventory Service
After=network.target postgresql.service
Wants=postgresql.service

[Service]
Type=notify
User=fact_inventory
Group=fact_inventory
WorkingDirectory=/opt/fact_inventory
Environment="DEPLOYMENT=production"
Environment="WEB_CONCURRENCY=8"
ExecStart=/usr/bin/uv run uvicorn fact_inventory:app_factory --factory --host 127.0.0.1 --port 8000
Restart=on-failure
RestartSec=10

[Install]
WantedBy=multi-user.target

Enable and start:

sudo systemctl daemon-reload
sudo systemctl enable fact-inventory
sudo systemctl start fact-inventory

View logs:

sudo journalctl -u fact-inventory -f

Debug Mode

Enable OpenAPI documentation:

export DEBUG=true
DEPLOYMENT=testing uvicorn fact_inventory:app_factory --factory

Then visit:

OpenAPI spec: http://localhost:8000/fact_inventory/schema
Swagger UI: http://localhost:8000/fact_inventory/schema/swagger

(Replace /fact_inventory with your configured APP_PREFIX if different.)

Health Checks

The application provides health and readiness endpoints for monitoring you can enable:

Health: GET /health (or GET /fact_inventory/health if using APP_PREFIX=/fact_inventory)
Ready: GET /ready (or GET /fact_inventory/ready if using APP_PREFIX=/fact_inventory)

Example with curl:

curl http://localhost:8000/health
curl http://localhost:8000/ready

These are useful for container orchestration probes and load balancer health checks.

Troubleshooting

Database Connection Failed

Verify PostgreSQL is running and the connection string is correct:

psql "postgresql://fact_user@localhost/fact_inventory"

If that works, verify the async connection string with the correct driver:

export DATABASE_URI="postgresql+asyncpg://fact_user@localhost/fact_inventory"

HTTP 413 Request Entity Too Large

Two possible causes:

Total request body exceeds MAX_REQUEST_BODY_MB
- Increase MAX_REQUEST_BODY_MB
- Ensure MAX_REQUEST_BODY_MB > 3 x MAX_JSON_FIELD_MB
Single JSON field exceeds MAX_JSON_FIELD_MB
- Increase MAX_JSON_FIELD_MB
- Check logs for the specific field name

Example fix:

export MAX_REQUEST_BODY_MB=50
export MAX_JSON_FIELD_MB=10

Rate Limited (HTTP 429)

Clients are rate-limited by IP by default.

Adjust RATE_LIMIT_REQUESTS and RATE_LIMIT_WINDOW_HOURS
Note: Rate limits can be bypassed with IP rotation

Application Won't Start

Check environment variables:

echo $DEPLOYMENT
echo $DATABASE_URI

Ensure .env.{DEPLOYMENT} exists in the application directory and contains DATABASE_URI.

High Database Load

The background cleanup tasks (retention and deduplication) run on intervals with random jitter. If you see periodic spikes in database load that impact performance, increase the jitter:

export RETENTION_CHECK_JITTER_MINUTES=60
export HISTORY_CHECK_JITTER_MINUTES=60

Or increase the check intervals to run less frequently:

export RETENTION_CHECK_INTERVAL_HOURS=48
export HISTORY_CHECK_INTERVAL_HOURS=24

Next Steps

Deployment: See DEPLOYMENT.md for bare metal, Kubernetes, and embedding patterns
Example Client: See gather_facts.yml in the repository root for an Ansible playbook
Querying: See VIEWS.md for SQL examples and pre-built views

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Installation

Requirements

Quick Start

Configuration

Environment Variables

Database Configuration

Rate Limiting

Payload Size Limits

Data Retention

Deduplication

Database Setup

Running the Application

Standalone with Uvicorn

With systemd (Bare Metal)

Debug Mode

Health Checks

Troubleshooting

Database Connection Failed

HTTP 413 Request Entity Too Large

Rate Limited (HTTP 429)

Application Won't Start

High Database Load

Next Steps

FilesExpand file tree

INSTALL.md

Latest commit

History

INSTALL.md

File metadata and controls

Installation

Requirements

Quick Start

Configuration

Environment Variables

Database Configuration

Rate Limiting

Payload Size Limits

Data Retention

Deduplication

Database Setup

Running the Application

Standalone with Uvicorn

With systemd (Bare Metal)

Debug Mode

Health Checks

Troubleshooting

Database Connection Failed

HTTP 413 Request Entity Too Large

Rate Limited (HTTP 429)

Application Won't Start

High Database Load

Next Steps