Porting to Arm #16

LorienLV · 2023-11-07T13:28:51Z

This pull request aims to enhance the compatibility and performance of GenomicsBench on Arm architectures. It introduces several noteworthy changes to these kernels:

BSW: Added SVE intrinsics version as an alternative to SSE, AVX2, and AVX-512.
CHAIN: This kernel is no longer linked to Minimap2, as the dependency is no required. This change ensures compatibility on Arm machines.
FAST-CHAIN: This is a version of CHAIN that removes all the heuristics to vectorize the kernel. It includes a x86-intrinsics version (SSE, AVX2, AVX512), and an SVE-intrinsics version. The x86 version of FAST-CHAIN uses 32-bit elements, which can be insufficient for some inputs. A 64-bit version of the kernel is also provided.
KMER-CNT: We have modified the kernel to allow better performance and parallel scalability at the cost of 2x more memory utilization. This version is not only faster in Arm but also in x86. The high-memory version can be enabled by using the --highmem parameter when executing the kernel.
NN-VARIANT: Ported the kernel from Clair (Tensorflow) to Clair3 (Tensorflow2) due to the predominance of Arm-optimized Tensorflow v2. Migrating to Clair3 means that the inputs and the model of the kernel have also changed. The command line added to the execution scripts uses Oxford Nanopore r941 prom hac g360+g422 pre-trained model, chromosome 20 of HG002 from NITS’s Genome in a Bottle (GIAB) project, and the following input regions: region_for_small_input.txt and region_for_lage_input.txt.
WFA: This is a new kernel that implements the Wavefront Alignment Algorithm (WFA). It uses the input format as BSW and applies the same multithreading scheduling.

…y and base versions

LorienLV added 11 commits September 28, 2023 16:49

BSW Arm-SVE version

c5247db

CHAIN does not need minimap2 to work

fde80f4

CHAIN Arm-specific bug fix

9d834c5

Vectorized version of CHAIN, FAST-CHAIN, added

262de4c

KMER-CNT now uses OpenMP to make thread binding easier

228bf7e

KMER-CNT version for high-memory systems

d4232f1

WFA with support for Arm-SVE added

33e2773

Added command line parameter to KMER-CNT to select between high-memor…

323b429

…y and base versions

Now WFA uses the same inputs as BSW

46c4750

NN-VARIANT migrated to Clair3

bf0e194

New kernels added to execution scripts

ce5e9b3

dslarm mentioned this pull request Feb 26, 2025

bwa-mem2: fix build for aarch64 to pin to not use SVE bioconda/bioconda-recipes#54134

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Porting to Arm #16

Porting to Arm #16

Uh oh!

LorienLV commented Nov 7, 2023

Uh oh!

Uh oh!

Porting to Arm #16

Are you sure you want to change the base?

Porting to Arm #16

Uh oh!

Conversation

LorienLV commented Nov 7, 2023

Uh oh!

Uh oh!