Skip to content

Add Polars documentation#2685

Open
mcagriardic wants to merge 1 commit into
freeCodeCamp:mainfrom
mcagriardic:add-polars-documentation
Open

Add Polars documentation#2685
mcagriardic wants to merge 1 commit into
freeCodeCamp:mainfrom
mcagriardic:add-polars-documentation

Conversation

@mcagriardic
Copy link
Copy Markdown

@mcagriardic mcagriardic commented May 24, 2026

Add a scraper for the Polars Python API reference
(https://docs.pola.rs/api/python/stable/reference/), pinned to the stable 1.41.0 release.

The site uses the pydata-sphinx-theme, so the scraper reuses the shared sphinx/clean_html filter alongside Polars-specific filters:

  • clean_html removes the theme chrome (sidebars, in-page TOC, prev/next navigation, footer) and tags code blocks for Python highlighting.
  • entries names each page from its heading and groups entries into types (DataFrame, LazyFrame, Series, Expressions, Functions, Data Types, Input/output, etc.). Top-level members are stored flat under api/ and are classified by their member name.

Latest-version detection uses the most recent GitHub release and strips the py- prefix, since the repo also tags Rust (rs-) releases.

If you’re adding a new scraper, please ensure that you have:

  • Tested the scraper on a local copy of DevDocs
  • Ensured that the docs are styled similarly to other docs on DevDocs
  • Added these files to the public/icons/docs/polars/ directory:
    • 16.png: a 16×16 pixel icon for the doc
    • 16@2x.png: a 32×32 pixel icon for the doc
    • SOURCE: A text file containing the URL to the page the image can be found on or the URL of the original image itself

image:
image

Add a scraper for the Polars Python API reference
(https://docs.pola.rs/api/python/stable/reference/), pinned to the
stable 1.41.0 release.

The site uses the pydata-sphinx-theme, so the scraper reuses the shared
sphinx/clean_html filter alongside Polars-specific filters:

- clean_html removes the theme chrome (sidebars, in-page TOC, prev/next
  navigation, footer) and tags code blocks for Python highlighting.
- entries names each page from its heading and groups entries into types
  (DataFrame, LazyFrame, Series, Expressions, Functions, Data Types,
  Input/output, etc.). Top-level members are stored flat under api/ and
  are classified by their member name.

Latest-version detection uses the most recent GitHub release and strips
the py- prefix, since the repo also tags Rust (rs-) releases.
@mcagriardic mcagriardic requested a review from a team as a code owner May 24, 2026 17:06
@mcagriardic
Copy link
Copy Markdown
Author

@simon04, would appreciate reviews. Many thanks!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant