The Enterprise Document Analyzer is a premium, AI-powered platform designed to extract, index, and query information from complex PDF documents. Whether dealing with digital text, scanned invoices, or handwritten notes, the system uses a hybrid approach combining traditional OCR and Google's Gemini Vision to ensure high-accuracy data retrieval.
The application uses a streamlined 3-view interface. You can switch between views using the sidebar or context buttons.
The starting point for all new workflows.

Features:
- Drag & Drop Upload: Support for PDF files.
- Previewer: Instant visual confirmation of the file.
- Extraction Modes:
- OCR (Default): Best for standard digital PDFs or clean scans. Uses
DoclingandTesseract. - GEMINI: Advanced AI Vision mode. Best for handwriting, poor scans, or complex layouts where OCR fails.
- OCR (Default): Best for standard digital PDFs or clean scans. Uses
- Ingest Button: Starts the pipeline. This processes the document, extracts tables/images, creates vector chunks, and indexes them.
Interact with your indexed knowledge base using natural language.
- History: Switch between multiple conversation threads on the left.
- Chat Window: Ask questions like "What is the total revenue in Q3?" or "Summarize the safety protocols."
- Citations:
- Quick Sources: Small pills showing page numbers.
- Visual References: The right-hand panel shows the exact text chunk and a visual snapshot of the source page with the relevant section highlighted.
A deep-dive tool to verify extraction quality.

- Images Tab: View all extracted figures, diagrams, and scanned pages. Click to zoom.
- Tables Tab: View extracted tables converted into interactive DataFrames.
- Chunks Tab: Audit the text chunks created during ingestion.
- Full Doc: View the original PDF in an embedded frame.
Manage your knowledge repository.

- Document List: See all indexed files.
- Activate: Switch the "Current Context" to a specific document for the Inspector.
- Delete: Remove a document and its vectors from the system.
- Nuclear Reset: A "Danger Zone" option to wipe the entire database clean.
| Feature | OCR Mode | Gemini Vision Mode |
|---|---|---|
| Speed | Fast ⚡ | Slower (API dependant) 🐢 |
| Cost | Low (Local compute) | Higher (Token costs) |
| Best For | Contracts, Reports, Clean Forms | Handwritten notes, Faded receipts, Charts |
| Table Extraction | Heuristic/Structural | Visual Understanding |
- Query Analysis: The system understands your question.
- Vector Search: It retrieves the top 5 most relevant text chunks from ChromaDB.
- Synthesis: Google Gemini summarizes the chunks into a coherent answer.
- Citation: Every answer is backed by verifiable source evidence.
