AORTA-24 : Add torch profiler for multi gpu workload run by prosenjitdhole · Pull Request #106 · ROCm/aorta

prosenjitdhole · 2026-02-18T09:26:13Z

Fix for enabling torch profiler for multi GPU stream.

Copilot

Pull request overview

This PR updates the hw_queue_eval run CLI command to enable PyTorch profiling in a way that matches the harness’s multi-GPU stream distribution, avoiding single-GPU-only profiling behavior when multiple GPUs are available.

Changes:

Add a dedicated profiling phase that creates multi-GPU streams (round-robin across available GPUs) to mirror the harness behavior.
Synchronize all involved CUDA devices after each profiled iteration to ensure multi-GPU work is fully captured.
Add CLI output describing whether profiling is using single- or multi-GPU mode and the stream-to-device distribution.

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

src/aorta/hw_queue_eval/cli.py

Commiting co-pilot suggestion for calling setup twice. Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>

AORTA-24 : Add torch profiler for multi gpu workload run

3f680e7

prosenjitdhole requested a review from oyazdanb February 18, 2026 09:26

amd-vivekag requested a review from Copilot February 19, 2026 09:58

Copilot started reviewing on behalf of amd-vivekag February 19, 2026 09:59 View session

Copilot AI reviewed Feb 19, 2026

View reviewed changes

src/aorta/hw_queue_eval/cli.py Outdated Show resolved Hide resolved

src/aorta/hw_queue_eval/cli.py Show resolved Hide resolved

src/aorta/hw_queue_eval/cli.py Show resolved Hide resolved

Update src/aorta/hw_queue_eval/cli.py

165a2cb

Commiting co-pilot suggestion for calling setup twice. Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

AORTA-24 : Add torch profiler for multi gpu workload run#106

AORTA-24 : Add torch profiler for multi gpu workload run#106
prosenjitdhole wants to merge 2 commits intomainfrom
prosenj_hw_q_eval_profiler_fix

prosenjitdhole commented Feb 18, 2026

Uh oh!

Copilot AI left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

prosenjitdhole commented Feb 18, 2026

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants