Skip to content

source code and data for comparing performance between atomic operations and graph coloring

Notifications You must be signed in to change notification settings

xizhang-hpc/sbac-atomicGraph

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

10 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

# ICPP-atomicGraph
source code and data for comparing performance between atomic operations and graph coloring

1 How to compile
The compiling script is in file makeCUDA for both double and single precision on GPU Tesla V100 and K80 respectively.

2 Data
Mesh connectivity information including only Mesh1 and Mesh2 are supplied in folder data, due to storage limit in Github.
If more mesh information is needed, please contact Xi Zhang by email address zhangx299@mail.sysu.edu.cn

3 How to simulate
Put executable file into Mesh information folder, then execute. e.g.
./makeCUDA
tar -xzvf Mesh1.tgz
cp compareDoubleBest Mesh1
./compareDoubleBest

4 performance
Performance of different algorithms is gain by executing time.
e.g.
Function No: 1, Time: 1.209404e-01 s, Loop No: 1000
It means that kernel of Function 1 (A4-SM) runs 1.209404e-01 seconds by 1000 times.

5 Function No and Algorithms
Function No: 1 for atomic operation on maximum pressure with shared memory, A-MP-SM
Function No: 2 for atomic operation on maximum pressure with no optimization, A-MP-NOPT
Function No: 3 for graph coloring operation on maximum pressure with no optimization, G-MP-NOPT
Function No: 4 for graph coloring operation on maximum pressure with reordered volume and face, G-MP-RVF
Function No: 5 for graph coloring operation on maximum pressure with shared memory, G-MP-SM
Function No: 6 for atomic operation on flux summation with no optimization, A-FS-NOPT
Function No: 7 for atomic operation on flux summation with shared memory, A-FS-SM
Function No: 8 for graph coloring on flux summation with no optimization, G-FS-NOPT
Function No: 9 for graph coloring on flux summation with reorderd flux, G-FS-RF
Function No: 10 for graph coloring on flux summation with reordered volume and face, G-FS-RVF
Function No: 11 for graph coloring on flux summation with shared memory, G-FS-SM

About

source code and data for comparing performance between atomic operations and graph coloring

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published