optimize VRAM for calculating pos_bias in LayoutLM v2, v3 #26139

NormXU · 2023-09-13T10:55:08Z

What does this PR do?

The current implementation of calculating 1d_pos_bias/2d_pos_bias in LayoutLMv2, v3 is VRAM-consuming due to the large one-hot matrix.

Considering the idea of 1d_pos_bias/2d_pos_bias is to categorize all relative positions into several buckets, assign each position id to a specific bucket based on its relative distance to another token, and embed the position id into a feature, we can drop the large one-hot matrix and directly use the Linear weight features like an nn.Embedding.

In my tests, as for an input sequence of $[10, 1024]$ (bz, nseq), this can save 3 Gb VRAM for 2d_pos_bias calculations

Before submitting

This PR fixes a typo or improves the docs (you can dismiss the other checks if that's the case).
Did you read the contributor guideline,
Pull Request section?
Was this discussed/approved via a Github issue or the forum? Please add a link
to it if that's the case.
Did you make sure to update the documentation with your changes? Here are the
documentation guidelines, and
here are tips on formatting docstrings.
Did you write any new necessary tests? # This PR can reuse previous tests

Who can review?

@ArthurZucker and @younesbelkada

ArthurZucker · 2023-09-22T00:11:18Z

Hey! Thanks for opening a PR, pinging @rafaelpadilla for a review here 😉

rafaelpadilla

Nice catch @NormXU !
Sorry for the delay on reviewing it. I had problems with the pytesseract dependency.
I was able to dig into the code and saw that your changes make the process simpler and produce the same outputs.
Just make sure to make the other tests pass.

NormXU · 2023-09-28T07:53:12Z

@rafaelpadilla I've reformatted the codes. It's ready to be merged.

LysandreJik

Thank you for your PR @NormXU and for your review @rafaelpadilla

HuggingFaceDocBuilderDev · 2023-09-28T08:11:04Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint.

…ating pos_bias in LayoutLM v2, v3 (#26139)" (#30988) * Revert "optimize VRAM for calculating pos_bias in LayoutLM v2, v3 (#26139)" This reverts commit a7e0ed8. * Instead of reverting commit, wrap indexing in torch.no_grad context * Apply wrapping in LayoutLMv2 * Add comments explaining reason for no_grad * Fix code format --------- Co-authored-by: Kevin Koehncke <kevin.koehncke@uipath.com>

optimize layoutv2, v3 for VRAM saving

42ef93a

ArthurZucker requested a review from rafaelpadilla September 22, 2023 00:11

rafaelpadilla approved these changes Sep 27, 2023

View reviewed changes

reformat codes

0b51ffa

LysandreJik approved these changes Sep 28, 2023

View reviewed changes

LysandreJik merged commit a7e0ed8 into huggingface:main Sep 28, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

optimize VRAM for calculating pos_bias in LayoutLM v2, v3 #26139

optimize VRAM for calculating pos_bias in LayoutLM v2, v3 #26139

Uh oh!

NormXU commented Sep 13, 2023 •

edited

Loading

Uh oh!

ArthurZucker commented Sep 22, 2023

Uh oh!

rafaelpadilla left a comment

Uh oh!

NormXU commented Sep 28, 2023 •

edited

Loading

Uh oh!

LysandreJik left a comment

Uh oh!

HuggingFaceDocBuilderDev commented Sep 28, 2023

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

optimize VRAM for calculating pos_bias in LayoutLM v2, v3 #26139

optimize VRAM for calculating pos_bias in LayoutLM v2, v3 #26139

Uh oh!

Conversation

NormXU commented Sep 13, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What does this PR do?

Before submitting

Who can review?

Uh oh!

ArthurZucker commented Sep 22, 2023

Uh oh!

rafaelpadilla left a comment

Choose a reason for hiding this comment

Uh oh!

NormXU commented Sep 28, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

LysandreJik left a comment

Choose a reason for hiding this comment

Uh oh!

HuggingFaceDocBuilderDev commented Sep 28, 2023

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

NormXU commented Sep 13, 2023 •

edited

Loading

NormXU commented Sep 28, 2023 •

edited

Loading