Skip to content

Conversation

@maxmnemonic
Copy link
Member

Implementation of HTML backend that uses headless browser (via playwright) to materialize HTML pages into images, and add provenances with bboxes to all elements in the converted docling document

Checklist:

  • Documentation has been updated, if necessary.
  • Examples have been added, if necessary.
  • Tests have been added, if necessary.

…ight) to materialize HTML pages into images, and add provenances with bboxes to all elements in the converted docling document

Signed-off-by: Maksym Lysak <mly@zurich.ibm.com>
@maxmnemonic maxmnemonic self-assigned this Feb 9, 2026
@github-actions
Copy link
Contributor

github-actions bot commented Feb 9, 2026

DCO Check Passed

Thanks @maxmnemonic, all your commits are properly signed off. 🎉

@maxmnemonic maxmnemonic added the html issue related to html backend label Feb 9, 2026
@mergify
Copy link

mergify bot commented Feb 9, 2026

Merge Protections

Your pull request matches the following merge protections and will not be merged until they are valid.

🟢 Enforce conventional commit

Wonderful, this rule succeeded.

Make sure that we follow https://www.conventionalcommits.org/en/v1.0.0/

  • title ~= ^(fix|feat|docs|style|refactor|perf|test|build|ci|chore|revert)(?:\(.+\))?(!)?:

@codecov
Copy link

codecov bot commented Feb 9, 2026

Codecov Report

❌ Patch coverage is 11.71875% with 226 lines in your changes missing coverage. Please review.

Files with missing lines Patch % Lines
docling/backend/html_backend.py 8.13% 226 Missing ⚠️

📢 Thoughts on this report? Let us know!

'scipy (>=1.6.0,<2.0.0)',
"accelerate>=1.0.0,<2",
"polyfactory>=2.22.2",
"playwright>=1.58.0",
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

this should go in an optional deps

…TML backend

Signed-off-by: Maksym Lysak <mly@zurich.ibm.com>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

html issue related to html backend

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants