-
Notifications
You must be signed in to change notification settings - Fork 196
Open
Labels
bugSomething isn't workingSomething isn't workingpluginPyTorch Profiler TensorBoard Plugin relatedPyTorch Profiler TensorBoard Plugin related
Description
Hi, I am using the sample script in this repository resnet50_ddp_profiler.py
from https://github.com/pytorch/kineto/blob/main/tb_plugin/examples/resnet50_ddp_profiler.py
Using
Python3.8
torch=2.0.1
torch-tb-profiler=0.4.3 # built from source
In tensorboard in the overview view the communication is 0.
In the distributed view:
- there are no bar charts shown for Synchronizing/Communication Overview.
- the table at the bottom called Communication Operation stats has 0 values in columns total latency, avg latency, data transfer time, avg data transfer time.
When I try using
Python3.8
torch=1.11.0
torch-tb-profiler=0.4.3 # built from source
There are no issues and the views show up properly.
However even for torch=1.12+
there are issues in communication and distributed view not showing up properly.
Does anyone have any insight into why this may be the case?
erliding and ringohoffman
Metadata
Metadata
Assignees
Labels
bugSomething isn't workingSomething isn't workingpluginPyTorch Profiler TensorBoard Plugin relatedPyTorch Profiler TensorBoard Plugin related