Improve learner multi-gpu ergonomics #2277

laggui · 2024-09-13T18:08:48Z

Especially w.r.t data loading, the data should automatically be transferred to the correct device.

Right now the batch is moved to the first GPU and users are moving the inputs to the correct device before inference (which works, but is less nice and might limit multi-gpu usage since the first GPU always gets filled first).

kingwingfly · 2024-10-18T11:44:48Z

Hi, bro. How do you find the way to correctly work with multi-gpus on burn.

Right now the batch is moved to the first GPU and users are moving the inputs to the correct device

Could you please tell me how?

I defined Learner with learnerbuilder.devices(vec![Cuda(0), Cuda(1)]), but init model on Cuda(0).

How to move mini_batch to correct GPU and in which part of the code to do this?
The model is on GPU(0), but data may be on GPU(1). They are different devices, how can it work?

It's so kind of you if you could provide me any help!

laggui · 2024-10-18T11:55:28Z

Take a look at the text classification example 🙂 more specifically, how the inputs are moved to the same device as the model.

kingwingfly · 2024-10-18T13:22:58Z

Thanks a looooot! I managed to. The key point is that the model is forked onto each device and model.devices() will return a Vec<Device> whose first elem is the device the model currenctly on.

laggui added the enhancement Enhance existing features label Sep 13, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Improve learner multi-gpu ergonomics #2277

Improve learner multi-gpu ergonomics #2277

laggui commented Sep 13, 2024

kingwingfly commented Oct 18, 2024

laggui commented Oct 18, 2024

kingwingfly commented Oct 18, 2024

Improve learner multi-gpu ergonomics #2277

Improve learner multi-gpu ergonomics #2277

Comments

laggui commented Sep 13, 2024

kingwingfly commented Oct 18, 2024

laggui commented Oct 18, 2024

kingwingfly commented Oct 18, 2024