MULTIANS - Massively Parallel ANS Decoding on GPUs

An implementation of a novel algorithm for ANS (Asymmetric Numeral Systems) decoding on GPUs.

For a detailed description of the concept, please refer to our conference paper.

The algorithm is capable of decoding raw (unpartitioned) ANS-encoded datastreams of variable size at extremely high throughput rates.

The method does not require any vendor-specific features. Although this implementation uses the CUDA toolkit, porting it to related parallel programming frameworks, such as OpenCL, should be straightforward.

State count and alphabet size are configurable. At its current increment, the decoder supports input data encoded using a single table and a radix of b = 2 (i.e. encoder emits single bits during renormalization), and alphabet sizes of up to 256 symbols. Another implementation supporting multiple tables / multiple states is subject of future work.

The sourcecode also includes a (very basic) single-state tANS encoder for testing, as well as a multicore-based implementation of the method for comparison with the GPU version.

Requirements

CUDA-enabled GPU with compute capability 3.0 or higher
GNU/Linux
CUDA SDK 9 or higher
latest proprietary graphics drivers

Compilation process

Configuration

Please edit the Makefile:

Set ARCH to the compute capability of your GPU, i.e. ARCH = 35 for compute capability 3.5. If you'd like to compile the decoder for multiple generations of GPUs, please edit NVCC_FLAGS accordingly.

Test program

The test program will generate multiple random datasets (256 symbols) of user-specified size. The symbols are exponentially distributed with increasing rate parameters (λ), yielding different compression ratios for different sets.

For each dataset, the program will:

encode the data into a single compressed stream using tANS
copy / decode the compressed data on a specified GPU
decode the compressed data using a specified number of CPU threads
print the time elapsed for each decoding process

Compiling the test program

To compile the test program, configure the Makefile as described above. Run:

make

Running the test program

./bin/demo <compute device index> <size of input in megabytes> <number of CPU threads>

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
bin		bin
include		include
src		src
LICENSE		LICENSE
Makefile		Makefile
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

MULTIANS - Massively Parallel ANS Decoding on GPUs

Requirements

Compilation process

Configuration

Test program

Compiling the test program

Running the test program

About

Releases

Packages

Languages

License

weissenberger/multians

Folders and files

Latest commit

History

Repository files navigation

MULTIANS - Massively Parallel ANS Decoding on GPUs

Requirements

Compilation process

Configuration

Test program

Compiling the test program

Running the test program

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages