Re-enable MIOpen (cudnn) for amd cards, default MIOPEN_FIND_MODE=FAST, PYTORCH_MIOPEN_SUGGEST_NHWC=0 #11381

alexheretic · 2025-12-17T20:47:02Z

Remove cudnn.enabled = False for AMD cards so MIOpen is enabled again.

Default env vars if not specified (so these are easy to override by users if they care):

MIOPEN_FIND_MODE=FAST solves initial slowdown issues particularly for VAE (miopen searching also seems to have little actual perf benefit if you let it run, at least in my experience on rdna3 for sdxl & wan) so this seems a better default.
PYTORCH_MIOPEN_SUGGEST_NHWC=0 This resolves the significant regression in ImageUpscaleWithModel perf with miopen enabled on > rocm 7.

In particular this improves ImageUpscaleWithModel perf on rocm7.1: 7.9s -> 2.4s
(using a simple single image example workflow).

Tested on my 7900 GRE (rdna3) on Linux with rocm 7.1 & 6.4.

Resolves #10447
Relates to #10302, #10448, pytorch/pytorch#170764, ROCm/TheRock#2485

Default MIOPEN_FIND_MODE=FAST Default PYTORCH_MIOPEN_SUGGEST_NHWC=0

alexheretic · 2025-12-17T20:50:12Z

cc @comfyanonymous can you re-check if this works as well as disabling cudnn for your test scenarios? The additional PYTORCH_MIOPEN_SUGGEST_NHWC=0 switch resolves perf issues with rocm7.1 upscaling for me.

alexheretic requested review from Kosinkadink, comfyanonymous and guill as code owners December 17, 2025 20:47

Re-enable MIOpen for amd cards

67ca7fb

Default MIOPEN_FIND_MODE=FAST Default PYTORCH_MIOPEN_SUGGEST_NHWC=0

alexheretic force-pushed the amd-rdna3-miopen-on branch from 5f8e54a to 67ca7fb Compare December 17, 2025 20:48

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Re-enable MIOpen (cudnn) for amd cards, default MIOPEN_FIND_MODE=FAST, PYTORCH_MIOPEN_SUGGEST_NHWC=0 #11381

Re-enable MIOpen (cudnn) for amd cards, default MIOPEN_FIND_MODE=FAST, PYTORCH_MIOPEN_SUGGEST_NHWC=0 #11381

alexheretic commented Dec 17, 2025 •

edited

Loading

Uh oh!

alexheretic commented Dec 17, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Re-enable MIOpen (cudnn) for amd cards, default MIOPEN_FIND_MODE=FAST, PYTORCH_MIOPEN_SUGGEST_NHWC=0 #11381

Are you sure you want to change the base?

Re-enable MIOpen (cudnn) for amd cards, default MIOPEN_FIND_MODE=FAST, PYTORCH_MIOPEN_SUGGEST_NHWC=0 #11381

Conversation

alexheretic commented Dec 17, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

alexheretic commented Dec 17, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

alexheretic commented Dec 17, 2025 •

edited

Loading