add minibatching #153

lvwerra · 2023-02-16T14:20:08Z

Until now the PPO mini batch size has been hardcoded to 1. This PR aims to change it by refactoring the forward/backward passing logic.

In summary this PR does the following things:

The batched_forward_pass returns new a mask which can be used to mask parts of the sequence to be ignored
enable mini-batching of PPO by creating a small dataloader with the mini_batch_size to sample from the current PPO batch
In the loss method we replace all operations affected by masked parts of the sequence with masked ones (masked_mean, masked_whiten)
remove compute_logits_vpred and use batched_forward_pass for everything
extend testing and refactor it (i don't think we need subfolders for the 3 test files we have)

W&B logs:

HuggingFaceDocBuilderDev · 2023-02-21T17:07:09Z

The documentation is not available anymore as the PR was closed or merged.

younesbelkada

Thanks a lot for this great addition! I left few comments and questions as a first pass!

younesbelkada · 2023-02-21T18:49:36Z

trl/trainer/ppo_trainer.py

+            mini_batch_data,
+            batch_size=self.config.mini_batch_size,
+            shuffle=True,
+            collate_fn=collator,


Suggested change

collate_fn=collator,

collate_fn=collator,

drop_last=True,

Maybe we can add this to avoid some corner-cases such as the one described on a previous issue

Sounds good, let's also set a warning if that's the case so the user knows that a batch will be dropped.

trl/trainer/ppo_trainer.py

younesbelkada · 2023-02-21T18:54:15Z

trl/trainer/ppo_trainer.py

-        bs = self.config.batch_size
-        fbs = self.config.forward_batch_size
+        bs = len(queries)
+        fbs = min(bs, self.config.forward_batch_size)


So this is the case where the last element has less instances than the mini_batch_size or the case a users put a batch_size that is smaller than mini_batch_size on the config? If it's the second case we can maybe add a warning on the config, if the first case since we have drop_last=True set here I don't think we'll face this case but I am not sure

It's for the case where mini_batch_size is smaller than forward_batch_size during the forward passes inside the minibatch loop. I am also not quite happy with how we do it actually.

trl/trainer/ppo_trainer.py

younesbelkada

Also, what about completely removing forward_batch_size from the config? I don't think this is a breaking change as the configs cannot be pushed on the Hub, just need to update the examples accodingly. I believe this can be done on a follow up PR too

lvwerra · 2023-02-22T10:59:03Z

The breaking change actually also happens for users who currently use the library with forward_batch_size. What do you think about setting it default to None and overwrite mini_batch_size if it's set to another value with a warning that it affects now also the mini_batch_size if set to a value?

younesbelkada · 2023-02-22T10:59:52Z

This solution makes a lot of sense yes!

Co-authored-by: Younes Belkada <49240599+younesbelkada@users.noreply.github.com>

review-notebook-app · 2023-02-22T11:35:55Z

Check out this pull request on

See visual diffs & provide feedback on Jupyter Notebooks.

Powered by ReviewNB

lvwerra · 2023-02-22T11:36:46Z

Deprecated forward_batch_size: feel free to have a look!

younesbelkada

Thanks a lot for your great work on this! 💯

leandro and others added 12 commits February 16, 2023 15:19

add minibatching

af2ccdc

all the fixes i missed

a70f8a5

ore fixes

1bda802

add dedicated variable for mini batch size

0d5e478

style

8f7eacb

minor fixes

bcff3a5

fix rewards

847930f

unbiased variance estimation

c6a5673

mask values/returns

6ac2742

moar fixes

ab9dcce

style

c504a27

change structure and add moar tests

9634b1d

lvwerra marked this pull request as ready for review February 21, 2023 17:01

Merge branch 'main' into mini-batching

e5d1030

lvwerra requested review from younesbelkada, edbeeching and natolambert February 21, 2023 17:13

younesbelkada reviewed Feb 21, 2023

View reviewed changes

lvwerra and others added 4 commits February 22, 2023 12:01

Apply suggestions from code review

aa05758

Co-authored-by: Younes Belkada <49240599+younesbelkada@users.noreply.github.com>

deprecate forward_batch_size

2859989

remove out of date warning about batching s2s and left padding models

475bf2e

Merge remote-tracking branch 'origin/mini-batching' into mini-batching

ed66bd4

lvwerra requested a review from younesbelkada February 22, 2023 11:36

make style

8bb1936

younesbelkada approved these changes Feb 22, 2023

View reviewed changes

lvwerra and others added 2 commits February 22, 2023 14:35

Merge branch 'main' into mini-batching

bea7408

fixed failed merge

d6a2fe5

lvwerra merged commit f1300ec into main Feb 23, 2023

lvwerra deleted the mini-batching branch February 23, 2023 14:24

raj47212 mentioned this pull request Feb 25, 2023

minibatching changes and masking #176

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

add minibatching #153

add minibatching #153

lvwerra commented Feb 16, 2023 •

edited

Loading

HuggingFaceDocBuilderDev commented Feb 21, 2023 •

edited

Loading

younesbelkada left a comment

younesbelkada Feb 21, 2023

lvwerra Feb 22, 2023

younesbelkada Feb 21, 2023

lvwerra Feb 22, 2023

younesbelkada left a comment

lvwerra commented Feb 22, 2023

younesbelkada commented Feb 22, 2023

review-notebook-app bot commented Feb 22, 2023

lvwerra commented Feb 22, 2023

younesbelkada left a comment

add minibatching #153

add minibatching #153

Conversation

lvwerra commented Feb 16, 2023 • edited Loading

HuggingFaceDocBuilderDev commented Feb 21, 2023 • edited Loading

younesbelkada left a comment

Choose a reason for hiding this comment

younesbelkada Feb 21, 2023

Choose a reason for hiding this comment

lvwerra Feb 22, 2023

Choose a reason for hiding this comment

younesbelkada Feb 21, 2023

Choose a reason for hiding this comment

lvwerra Feb 22, 2023

Choose a reason for hiding this comment

younesbelkada left a comment

Choose a reason for hiding this comment

lvwerra commented Feb 22, 2023

younesbelkada commented Feb 22, 2023

review-notebook-app bot commented Feb 22, 2023

lvwerra commented Feb 22, 2023

younesbelkada left a comment

Choose a reason for hiding this comment

lvwerra commented Feb 16, 2023 •

edited

Loading

HuggingFaceDocBuilderDev commented Feb 21, 2023 •

edited

Loading