GRPO - Accelerated - Dataloader Question #4594

Bhoy1 · 2025-11-28T05:29:18Z

Bhoy1
Nov 28, 2025

Hi everyone,

I’ve been digging into the GRPOTrainer source code and am trying to implement a custom DataLoader. I noticed that get_train_dataloader(self) internally calls accelerator.prepare(), which is used for distributed setups. However, it also (I think) reconstructs the DataLoader, which seems like it would overwrite or invalidate any custom DataLoader I try to provide.

Has anyone experimented with custom DataLoader implementations on top of GRPOTrainer while still preserving all the benefits of accelerate? I’m mainly trying to understand the best way to integrate a custom loader without breaking the accelerator workflow.

Any guidance or past experience would be greatly appreciated!

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

GRPO - Accelerated - Dataloader Question #4594

Uh oh!

{{title}}

Uh oh!

Replies: 0 comments

Select a reply

Uh oh!

GRPO - Accelerated - Dataloader Question #4594

Uh oh!

Bhoy1 Nov 28, 2025

Replies: 0 comments

Bhoy1
Nov 28, 2025