Skip to content

Fix bp#2009

Open
HW618 wants to merge 2 commits into
unclecode:mainfrom
HW618:fix-bp
Open

Fix bp#2009
HW618 wants to merge 2 commits into
unclecode:mainfrom
HW618:fix-bp

Conversation

@HW618

@HW618 HW618 commented Jun 10, 2026

Copy link
Copy Markdown

Summary

Please include a summary of the change and/or which issues are fixed.

eg: Fixes #123 (Tag GitHub issue numbers in this format, so it automatically links the issues with your PR)
Fixes #2007

List of files changed and why

eg: quickstart.py - To update the example as per new changes
modified crawl4ai/html2text/__init__.py -class HTML2Text/ def handle-tag
when bypass_tables is enabled , it would only return tags before, causing malformed markdown.

How Has This Been Tested?

Please describe the tests that you ran to verify your changes.
test by scraping target_url = "https://en.wikipedia.org/wiki/List_of_prime_ministers_of_India" and it can generate correct markdown with rowspan/colspan involved.

Checklist:

  • My code follows the style guidelines of this project
  • I have performed a self-review of my own code
  • I have commented my code, particularly in hard-to-understand areas
  • I have made corresponding changes to the documentation
  • I have added/updated unit tests that prove my fix is effective or that my feature works
  • New and existing unit tests pass locally with my changes

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

[Bug]: the fit_html/cleaned_html/raw_html generator removes rowspan/colspan from tables

1 participant