Ascend NPU kernels using pto-isa. Parallel Tile Operation (PTO) is a virtual instruction set architecture designed by Ascend CANN, focusing on tile-level operations.
source /usr/local/Ascend/ascend-toolkit/set_env.sh
pip3 install -r requirements.txt
make build_wheelThe above commands will generate a wheel (i.e., pto_kernels-0.1.0-*.whl) that is pip installable.
pip install --force-reinstall pto_isa_kernels-*.whlmake test