Bugfix/get device by DrMicrobit · Pull Request #370 · TylerYep/torchinfo

DrMicrobit · 2025-07-09T12:13:10Z

This PR resolves #371: M2 Mac: Runtime error in training of model after call to torchinfo.summary()

For torchinfo 1.8 on a Mac with M2 chip, the following code resulted in a runtime error:

device = torch.accelerator.current_accelerator()
model = nn.Sequential(nn.Flatten(), nn.Linear(3072, 10)).to(device)
summary(model, input_size=(batch_size, 3, 32, 32))
...
out = model(data)

with the error message

RuntimeError: Tensor for argument weight is on cpu but expected on mps

The same code ran fine on Linux with a Nvidia card.

Cause of bug:
In torchinfo.py, the function get_device() seems to be focused on recognising only CUDA as accelerator, whereas other platforms may have different accelerators. E.g., M-chip Macs have "mps".
This apparently leads to torchinfo pushing the model to the "cpu" when device= was not given in the call to summary(), which then leads to a runtime error during model training (or evaluation) when the data is on the accelerator and the model (or parts of it) are on the CPU.

Bug fix:
I have create a PR that should fix the bug for any accelerator recognised by PyTorch.

New behaviour of get_device():
Unchanged:

If input_data is given, the device should not be changed (to allow for multi-device models, etc.)

Changed:

Otherwise gets device of first parameter of model and returns it,
otherwise returns current accelerator if it is available,
otherwise returns cpu.

Old version failed to recognise non-cuda accelerators, which led to bugs when torchinfo.summary() was called without "device=" parameter on, e.g., Macs with M-chips, where the accelerator is "mps". New version: - returns device of first parameter of model if present - else queries torch for an available accelerator and returns that - else returns "cpu"

Had left "Any" data type hint from my testing code

DrMicrobit added 2 commits July 9, 2025 13:39

fix: fix previous commit which had wrong data type

1d8784c

Had left "Any" data type hint from my testing code

ChanifRusydi mentioned this pull request Sep 16, 2025

Fixing #371 and tried adding directml support #368 #376

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Bugfix/get device#370

Bugfix/get device#370
DrMicrobit wants to merge 2 commits intoTylerYep:mainfrom
DrMicrobit:bugfix/get-device

DrMicrobit commented Jul 9, 2025 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

DrMicrobit commented Jul 9, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

DrMicrobit commented Jul 9, 2025 •

edited

Loading