This library is an efficient GPU implemetation of zkSNARK. It contains source code of the paper cuZK: Accelerating Zero-Knowledge Proof with A Faster Parallel Multi-Scalar Multiplication Algorithm on GPUs submitted to TCHES 2023.
This library is licensed under the Apache License Version 2.0 and MIT licenses.
Our experiments can be finished in the following setup.
- Ubuntu 20.04
- CUDA 11.5
- gcc 7.5.0
- Nvidia V100 (32 GB)
First we need to make sure NVIDIA CUDA Toolkit is installed.
Our GPU Implemetation of MSM relies on CUB
, which provides state-of-the-art, reusable software components for every layer of the CUDA programming model.
By default, CUB is included in the CUDA Toolkit. If there is no CUB after installing the CUDA Toolkit, it is no need to build CUB separately. CUB is implemented as a C++ header library. To use CUB primitives in your code, simply:
- Download and unzip the latest CUB distribution
- #include the <cub/cub.cuh> header file in your CUDA C++ sources.
- Compile your program with NVIDIA's nvcc CUDA compiler, specifying a -I include-path flag to reference the location of the CUB header library.
After that, we build the library. (It will take some time to compile)
cd test
make
To run a test of an MSM of 2^20
scale and EC points on the BLS12-381 curve, run:
## (It will take some time to run for the first time.)
./msmtestb 20
To run a test of Groth protocol with 2^20
constraint scales and EC points on the BLS12-381 curve, run:
## (It will take some time to run for the first time.)
./testb 20
For EC points on the ALT_BN128 curve and MNT4, run:
## ALT_BN128
./msmtesta 20
./testb 20
## MNT4
./msmtestm 20
./testm 20
In addition, our BLS12-377 carve implementation has a Rust binding with the template from Sppark developed by Supranational LLC. To install the latest version of Rust, first install rustup. Once rustup is installed, install the Rust toolchain by invoking:
curl --proto '=https' --tlsv1.2 -sSf https://sh.rustup.rs | sh
rustup install stable
To test BLS12-377 carve implementation.
cd test/BLS377
cargo bench
Here are a selection of the results tested under NVIDIA V100 GPU card with BLS12-381 curve. More results can be found in the paper cuZK.
For MSM computation,
Scale | Bellperson | cuZK | Speedup |
---|---|---|---|
2^19 | 0.23 s | 0.12 | 2.08x |
2^20 | 0.41 s | 0.19 | 2.18x |
2^21 | 0.73 s | 0.33 | 2.20x |
2^22 | 1.30 s | 0.58 | 2.25x |
2^23 | 2.64 s | 1.15 | 2.29x |
For Groth's protocol,
Scale | Bellperson | cuZK | Speedup |
---|---|---|---|
2^19 | 2.62 s | 0.98 | 2.67x |
2^20 | 4.45 s | 1.68 | 2.65x |
2^21 | 7.96 s | 2.76 | 2.88x |
2^22 | 14.20 s | 5.08 | 2.80x |
2^23 | 29.13 s | 9.91 | 2.94x |