cpu: aarch64: Expand ARM SVE support for forward convolution. #1838

kasturedeeksha · 2024-03-22T13:29:59Z

Description

This commit expands ARM SVE support for JIT SVE forward convolution in FP32, introducing compatibility with various vector lengths. The changes made are for implementing different ARM SVE vector lengths.

Major code changes:

Added common files jit_sve_convolution.hpp, jit_sve_convolution.cpp, jit_sve_conv_kernel.hpp, jit_sve_conv_kernel.cpp to accommodate the extended ARM SVE vector length.
Set data format tags according to the SVE length being used for forward convolution.
Replaced ldr, and str instructions for vector registers with ld1w and st1w to utilize predication.

Checklist

General

Do all unit and benchdnn tests (make test and make test_benchdnn_*) pass locally for each commit? Yes
Test output is same with and without this commit.

make test output:

99% tests passed, 2 tests failed out of 162

Total Test time (real) = 137.49 sec

The following tests FAILED:
        145 - test_graph_unit_dnnl_conv_usm_cpu (Failed)
        150 - test_graph_unit_dnnl_large_partition_usm_cpu (Failed)
Errors while running CTest
Output from these tests are in: /home/deekshak/xybak/oss_pr/forked_oss/oneDNN_Fork/build/Testing/Temporary/LastTest.log
Use "--rerun-failed --output-on-failure" to re-run the failed cases verbosely.
make: *** [Makefile:71: test] Error 8

make test_benchdnn_modeC_conv_ci_cpu/fast output:

tests:30510 passed:16848 skipped:13662 mistrusted:0 unimplemented:0 invalid_arguments:0 failed:0 listed:0
total: 316.16s; fill: 25.08s (8%); compute_ref: 6.38s (2%); compare: 8.97s (3%);

Have you formatted the code using clang-format? Yes
cc : @kawakami-k

igorsafo

Hi @kasturedeeksha ,
Could you please squash all commits into a single one and change the commit message to something like src: cpu: aarch64: conv: forward: add sve_256 support?

abhijain1204fujitsu · 2024-03-29T01:35:00Z

@igorsafo, mentioned changes have been done
Kindly support to review this PR and share the feedback

igorsafo

@kasturedeeksha Thank you, the changes LGTM!

abhijain1204fujitsu · 2024-04-08T17:11:38Z

@igorsafo, @vpirogov could you please support for merger, in case any update required at our end please let us know.

kasturedeeksha marked this pull request as ready for review March 26, 2024 06:43

igorsafo reviewed Mar 26, 2024

View reviewed changes

igorsafo added this to the v3.5 milestone Mar 26, 2024

src: cpu: aarch64: conv: forward: sve_fp32

e328f23

kasturedeeksha force-pushed the aarch64-sve-jit-convolution branch from 18b184d to e328f23 Compare March 27, 2024 12:49

kasturedeeksha changed the title ~~cpu: aarch64: Expand ARM SVE support for forward Convolution.~~ cpu: aarch64: Expand ARM SVE support for forward convolution. Mar 27, 2024

igorsafo approved these changes Mar 29, 2024

View reviewed changes

vpirogov added the platform:cpu-aarch64 Codeowner: @oneapi-src/onednn-cpu-aarch64 label Mar 29, 2024

igorsafo mentioned this pull request Apr 2, 2024

cpu: aarch64: Expand ARM SVE support in jit_uni_pool_kernel #1850

Merged

Merge branch 'main' into aarch64-sve-jit-convolution

0251388

densamoilov merged commit 3bb25fd into oneapi-src:main Apr 8, 2024
0 of 10 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

cpu: aarch64: Expand ARM SVE support for forward convolution. #1838

cpu: aarch64: Expand ARM SVE support for forward convolution. #1838

kasturedeeksha commented Mar 22, 2024 •

edited

Loading

igorsafo left a comment

abhijain1204fujitsu commented Mar 29, 2024

igorsafo left a comment

abhijain1204fujitsu commented Apr 8, 2024

cpu: aarch64: Expand ARM SVE support for forward convolution. #1838

cpu: aarch64: Expand ARM SVE support for forward convolution. #1838

Conversation

kasturedeeksha commented Mar 22, 2024 • edited Loading

Description

Checklist

General

igorsafo left a comment

Choose a reason for hiding this comment

abhijain1204fujitsu commented Mar 29, 2024

igorsafo left a comment

Choose a reason for hiding this comment

abhijain1204fujitsu commented Apr 8, 2024

kasturedeeksha commented Mar 22, 2024 •

edited

Loading