release : fix windows hip release #13707

slaren · 2025-05-22T13:37:24Z

slaren · 2025-05-22T13:42:27Z

@no1wudi @IMbackK I would like to simplify the windows HIP releases, do you know if it would be possible to include all the GPU targets in a single binary, or is it strictly necessary to have a different release for each target? Currently we have different releases for gfx1100, gfx1101, gfx1030.

IMbackK · 2025-05-22T14:50:50Z

Its not necessary, no. you can have fat binaries.

Im not sure how useful the windows builds are atm really since the list of targets is so small and rather random.
Unfortunately atm rocm on windows is not really supportable as, unlike linux rocm, its very inflexible with the target architectures, to support rdna2 you would have to build for all gfx103x variants for instance, despite these being identical in code generation. This is feasible neither in terms of binary size nor in terms of build time.

The only really feasible way to support windows rocm is to have people self build.

slaren · 2025-05-22T15:18:47Z

Thanks for the insight. I wonder how other projects handle this. @YellowRoseCx I believe you distribute koboldcpp packages for rocm, can you share some insights of how you handle the different GPU targets in a single release?

IMbackK · 2025-05-22T17:08:32Z

koboldcpp is built for gfx803, gfx900, gfx906, gfx908, gfx90a, gfx1010, gfx1030, gfx1031, gfx1032, gfx1100, gfx1101 and gfx1102
This list includes various chips you can not actually run on windows anyhow (gfx803, gfx900, gfx906, gfx908, gfx90a) and fails to support various chips you could run on windows like gfx1012, gfx1033, gfx1034, gfx1035, gfx1036, gfx1103, gfx1150, gfx1151, gfx1152, gfx1153, gfx1200 and gfx1201

amd is currenly in the process of introducing generics ie gfx9-generic gfx10.3-generic, gfx11-generic and gfx12-generic that will make this mutch better as then we will only have to build those + gfx1100 and + gfx906, gfx908, gfx90a and gfx942 on linux. They are also in the process of introducing a amdgcn-spirv target to llvm that will allow similar functionality as building the cuda version for ptx

slaren · 2025-05-22T17:30:17Z

Alright, thanks. I am going to give it a try to include all of these targets in a single binary. The rocblas libraries that are included in our HIP releases are already 900MB, so I figure that even if the size of ggml-hip.dll increases by a factor of 10 it shouldn't change too much the size of the download. If that doesn't work then I guess I will just wait until AMD adds the generic targets.

slaren · 2025-05-22T22:21:30Z

Adding all of these targets didn't work very well, so instead I added the supported radeon targets listed in the documentation. This results in a 210MB ggml-hip.dll, which is similar to the size of the CUDA backend binary.

release : fix windows hip release

5bfa534

github-actions bot added the devops improvements to build systems and github actions label May 22, 2025

ggerganov approved these changes May 22, 2025

View reviewed changes

make single hip release with multiple targets

2a7e891

slaren merged commit 3079e9a into master May 22, 2025
2 checks passed

slaren deleted the sl/fix-win-hip-release branch May 22, 2025 22:21

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

release : fix windows hip release #13707

release : fix windows hip release #13707

Uh oh!

slaren commented May 22, 2025

Uh oh!

slaren commented May 22, 2025 •

edited

Loading

Uh oh!

IMbackK commented May 22, 2025 •

edited

Loading

Uh oh!

slaren commented May 22, 2025

Uh oh!

IMbackK commented May 22, 2025 •

edited

Loading

Uh oh!

slaren commented May 22, 2025

Uh oh!

slaren commented May 22, 2025

Uh oh!

Uh oh!

Uh oh!

release : fix windows hip release #13707

release : fix windows hip release #13707

Uh oh!

Conversation

slaren commented May 22, 2025

Uh oh!

slaren commented May 22, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

IMbackK commented May 22, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

slaren commented May 22, 2025

Uh oh!

IMbackK commented May 22, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

slaren commented May 22, 2025

Uh oh!

slaren commented May 22, 2025

Uh oh!

Uh oh!

Uh oh!

slaren commented May 22, 2025 •

edited

Loading

IMbackK commented May 22, 2025 •

edited

Loading

IMbackK commented May 22, 2025 •

edited

Loading