Skip to content

Comments

docs: use direct API for pdf chunk and pdf extract#2452

Merged
shuoweil merged 4 commits intomainfrom
shuowei-blob-change-pdf
Feb 12, 2026
Merged

docs: use direct API for pdf chunk and pdf extract#2452
shuoweil merged 4 commits intomainfrom
shuowei-blob-change-pdf

Conversation

@shuoweil
Copy link
Contributor

@shuoweil shuoweil commented Feb 11, 2026

This PR updates notebooks/multimodal/multimodal_dataframe.ipynb to demonstrate PDF text extraction a chunking using custom BigQuery Python UDFs with the pypdf library.

Fixes #<478952827> 🦕

@shuoweil shuoweil self-assigned this Feb 11, 2026
@shuoweil shuoweil requested review from a team as code owners February 11, 2026 19:59
@shuoweil shuoweil requested a review from jialuoo February 11, 2026 19:59
@review-notebook-app
Copy link

Check out this pull request on  ReviewNB

See visual diffs & provide feedback on Jupyter Notebooks.


Powered by ReviewNB

@product-auto-label product-auto-label bot added size: m Pull request size is medium. api: bigquery Issues related to the googleapis/python-bigquery-dataframes API. labels Feb 11, 2026
@shuoweil shuoweil requested a review from GarrettWu February 11, 2026 20:46
@shuoweil shuoweil merged commit 543ce52 into main Feb 12, 2026
23 of 24 checks passed
@shuoweil shuoweil deleted the shuowei-blob-change-pdf branch February 12, 2026 19:34
tswast pushed a commit that referenced this pull request Feb 18, 2026
PR created by the Librarian CLI to initialize a release. Merging this PR
will auto trigger a release.

Librarian Version: v0.7.0
Language Image:
us-central1-docker.pkg.dev/cloud-sdk-librarian-prod/images-prod/python-librarian-generator@sha256:1a2a85ab507aea26d787c06cc7979decb117164c81dd78a745982dfda80d4f68
<details><summary>bigframes: 2.36.0</summary>

##
[2.36.0](v2.35.0...v2.36.0)
(2026-02-17)

### Features

* Initial support for biglake iceberg tables (#2409)
([ae35a98](ae35a989))

* add bigquery.ai.generate_table function (#2453)
([b925aa2](b925aa24))

### Documentation

* fix generate_text and generate_table input docs (#2455)
([078bd32](078bd32e))

* update multimodal dataframe notebook to use public APIs (#2456)
([342fa72](342fa723))

* use direct API for pdf chunk and pdf extract (#2452)
([543ce52](543ce52c))

* use direct API for audio transcription (#2447)
([59cbc5d](59cbc5db))

* Add EXIF metadata extraction example to multimodal notebook (#2429)
([84c6f88](84c6f883))

* Update multimodal notebook to use public runtime helpers (#2451)
([e36dd8b](e36dd8b4))

</details>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

api: bigquery Issues related to the googleapis/python-bigquery-dataframes API. size: m Pull request size is medium.

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants