-
Notifications
You must be signed in to change notification settings - Fork 167
Closed
Labels
assignedThe issue is assigned to engineerThe issue is assigned to engineerreproducedThe issue is reproduced by CQEThe issue is reproduced by CQE
Description
Using the latest release of AMDVLK included in the 22.20 driver for Ubuntu 22.04 only gives single rate performance when testing with vkpeak. AMDGPU-PRO from the same package provides support for packed FP16 support, but not for int16. RADV recently had support for both packed fp16 and int16 merged. Merge Request
Are there plans to provide support for double rate 16 bit instructions in the Open Source driver?
AMDVLK:
$ ./vkpeak 0
device = AMD Radeon RX 6700 XT
fp32-scalar = 12728.32 GFLOPS
fp32-vec4 = 12413.83 GFLOPS
fp16-scalar = 12912.08 GFLOPS
fp16-vec4 = 12810.49 GFLOPS
fp64-scalar = 826.05 GFLOPS
fp64-vec4 = 824.24 GFLOPS
int32-scalar = 2251.93 GIOPS
int32-vec4 = 2607.93 GIOPS
int16-scalar = 12896.81 GIOPS
int16-vec4 = 12806.51 GIOPS
AMDGPU-Pro:
$ ./vkpeak 0
device = AMD Radeon RX 6700 XT
fp32-scalar = 12839.66 GFLOPS
fp32-vec4 = 12780.54 GFLOPS
fp16-scalar = 12205.66 GFLOPS
fp16-vec4 = 21747.10 GFLOPS
fp64-scalar = 828.87 GFLOPS
fp64-vec4 = 826.11 GFLOPS
int32-scalar = 2611.42 GIOPS
int32-vec4 = 2610.99 GIOPS
int16-scalar = 12142.63 GIOPS
int16-vec4 = 11723.00 GIOPS
RADV (git-f533dff 2022-07-01 jammy-oibaf-ppa):
$ ./vkpeak 0
device = AMD Radeon RX 6700 XT (RADV NAVI22)
fp32-scalar = 12746.13 GFLOPS
fp32-vec4 = 12810.01 GFLOPS
fp16-scalar = 12953.09 GFLOPS
fp16-vec4 = 20505.55 GFLOPS
fp64-scalar = 829.03 GFLOPS
fp64-vec4 = 826.07 GFLOPS
int32-scalar = 2244.89 GIOPS
int32-vec4 = 2610.19 GIOPS
int16-scalar = 12916.61 GIOPS
int16-vec4 = 20409.49 GIOPS
Metadata
Metadata
Assignees
Labels
assignedThe issue is assigned to engineerThe issue is assigned to engineerreproducedThe issue is reproduced by CQEThe issue is reproduced by CQE