feat: add native SQLBot-powered data extraction workflow#11
Merged
asd765973346 merged 1 commit intoOpenDCAI:mainfrom Mar 23, 2026
Merged
Conversation
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Add this suggestion to a batch that can be applied as a single commit.This suggestion is invalid because no changes were made to the code.Suggestions cannot be applied while the pull request is closed.Suggestions cannot be applied while viewing a subset of changes.Only one suggestion per line can be applied in a batch.Add this suggestion to a batch that can be applied as a single commit.Applying suggestions on deleted lines is not supported.You must change the existing code in this line in order to create a valid suggestion.Outdated suggestions cannot be applied.This suggestion has been applied or marked resolved.Suggestions cannot be applied from pending reviews.Suggestions cannot be applied on multi-line comments.Suggestions cannot be applied while the pull request is queued to merge.Suggestion cannot be applied right now. Please check back later.
PR Description
Summary
This PR introduces a native structured-data extraction workflow into Open-NotebookLM and embeds the SQLBot core as an internal backend module.
The goal is to let notebook users work with structured CSV data in a multi-turn workflow:
What’s included
1. Intelligent data extraction workflow
data_extractworkflow for notebook CSV sources2. Multi-datasource querying
3. Session / turn / artifact persistence
4. Artifact reuse and source import
5. Embedded SQLBot backend
sqlbot_backend/external/embeddedSQLBot adapter modesembeddedMain implementation areas
Backend
data_extractroutes and service layerFrontend
Embedded engine
sqlbot_backend/as a vendored structured-data engineWhy this change
Open-NotebookLM currently works well for document-centric workflows, but structured data analysis needs a different interaction model.
This PR adds a notebook-native workflow for structured data so users can:
Current behavior / scope
This PR is focused on the data extraction workflow.
It does not attempt to expose the full SQLBot system as a general-purpose backend replacement for all Open-NotebookLM workflows.
Current scope:
Compatibility notes
SQLBOT_MODEsupports both:externalembeddedembeddedNotes for reviewers
This is a large PR because it combines:
Recommended review order:
fastapi_app/routers/data_extract.pyfastapi_app/services/data_extract_service.pyfastapi_app/workflow_adapters/*frontend_zh/src/pages/NotebookView.tsxsqlbot_backend/Follow-up work
Potential follow-ups after this PR: