docs: update how_to docs to reflect new loaders interface and functionalities #31219

MedlockM · 2025-05-13T16:04:55Z

Description: Updates two notebooks in the how_to documentation to reflect new loader interfaces and functionalities.
Issue: Some how_to notebooks were still using loader interfaces from previous versions of LangChain and did not demonstrate the latest loader functionalities (e.g., extracting images with ImageBlobParser, extracting tables in specific output formats, parsing documents using Vision-Language Models with ZeroxPDFLoader, and using CloudBlobLoader in the GenericLoader, etc.).
Dependencies: py-zerox
Twitter handle: @MarcMedlock2

…e-how-to-docs

vercel · 2025-05-13T16:05:19Z

The latest updates on your projects. Learn more about Vercel for Git ↗︎

Name	Status	Preview	Comments	Updated (UTC)
langchain	✅ Ready (Inspect)	Visit Preview	💬 Add feedback	May 13, 2025 9:26pm

MedlockM · 2025-05-14T15:47:47Z

@ccurme
Could you please justify your changes? Especially concerning the pdf guide. For example, why not keep the part explaining the new method for parsing with a vision model using Zerox? The way it's currently presented (convert pdf to an image then send the image to a chat object) is "old-fashioned", isn't it?

Likewise, the fact that you can set up an ImageBlobParser directly in a loader to extract tables or images with different strategies is a new feature that should be highlighted, don't you think?

Marc Medlock and others added 7 commits April 21, 2025 11:10

docs: update how_to docs to reflect new loader interfaces

ec5a759

docs: update how_to docs to reflect new loader interfaces

8244669

docs: update how_to docs to reflect new loader interfaces

5481b1a

Merge branch 'langchain-ai:master' into improve-how-to-docs

3d50017

docs: update how_to docs to reflect new loader interfaces

fc0c248

Merge remote-tracking branch 'origin/improve-how-to-docs' into improv…

62b09ea

…e-how-to-docs

docs: update how_to docs to reflect new loader interfaces

d060d75

dosubot bot added the size:XL label May 13, 2025

dosubot bot added the 🤖:docs label May 13, 2025

vercel bot had a problem deploying to Preview May 13, 2025 16:16 Failure

Marc Medlock added 2 commits May 13, 2025 18:17

docs: fix syntax for make lint

244efa1

docs: fix syntax for make lint

d193b16

vercel bot had a problem deploying to Preview May 13, 2025 16:40 Failure

change section header to match anchor link

f0ce9e6

vercel bot deployed to Preview May 13, 2025 19:18 View deployment

ccurme added 5 commits May 13, 2025 15:54

remove some output cells

846c0ba

Merge branch 'master' into improve-how-to-docs

4e0493b

revert changes to pdf guide

0daa485

delete example

87940b1

delete some output

6903ff2

dosubot bot added size:L and removed size:XL labels May 13, 2025

fix

e2bb293

vercel bot deployed to Preview May 13, 2025 20:21 View deployment

Merge branch 'master' into improve-how-to-docs

2c2a4b8

vercel bot deployed to Preview May 13, 2025 21:26 View deployment

ccurme approved these changes May 13, 2025

View reviewed changes

dosubot bot added the lgtm label May 13, 2025

ccurme merged commit ce0b1a9 into langchain-ai:master May 13, 2025
13 checks passed

pprados mentioned this pull request May 14, 2025

Refactoring PDF loaders: all #28970

Closed

2 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

docs: update how_to docs to reflect new loaders interface and functionalities #31219

docs: update how_to docs to reflect new loaders interface and functionalities #31219

Uh oh!

MedlockM commented May 13, 2025

Uh oh!

vercel bot commented May 13, 2025 •

edited

Loading

Uh oh!

Uh oh!

MedlockM commented May 14, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

docs: update how_to docs to reflect new loaders interface and functionalities #31219

docs: update how_to docs to reflect new loaders interface and functionalities #31219

Uh oh!

Conversation

MedlockM commented May 13, 2025

Uh oh!

vercel bot commented May 13, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

MedlockM commented May 14, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

vercel bot commented May 13, 2025 •

edited

Loading