GPU implementation of Simrank

NOTE: [updated version : testing-beta branch ]

SimRank is a general similarity measure, that says "two objects are considered to be similar if they are referenced by similar objects. In this project we are calculating SimRank over a static graph and is defined as - "two objects are similar, if they reference to similar objects". This project Aims to improve SimRank's performance using CUDA, and using CUDA constructs using parallel programming abstractions to improve computation time of the program with some other optimisations to improve the time complexity of the program.

Compilation Instructions

Compilation

// if compute capability 'xx' (70 for 7.0, 65 for 6.5..etc)
nvcc -arch=sm_xx SimRankGPU.cu -o SimRankGPU

Execution

/* to visualize convergence graphs */
Un-comment the last line of the code.
    system("python numpy_test.py");

/* To Execute */
./SimRankGPU

Results & Analysis

[Below graphs are Strogatz Graphs, generated randomly in python, code of which can be found in tests/ folder]

// All the timings are averaged over 10 executions and is in seconds.

##1 : For a graph having 17 Vertices and 26 Edges

- CPU Time : 0.0493

- GPU Time : 0.0024

- Speed Up : 21.5

##2 : For a graph having 150 Vertices and 900 Edges

- CPU Time : 1.0885

- GPU Time : 0.1207

- Speed Up : 9.018

##3 : For a graph having 400 Vertices and 2400 Edges

- CPU Time : 18.6888

- GPU Time : 0.3473

- Speed Up : 51.03

##4 : For a graph having 800 Vertices and 4800 Edges

- CPU Time : 164.4657

- GPU Time : 1.4588

- Speed Up : 112.74

The above given results are the averaged time of the SimRank CPU and GPU bounded implementation over a given graph. As we can see, using parallel computations in GPU, we can get a considerable Speed Up even in smaller graph, given that there are sufficient large number of parallel computations in the given graph.

Places of Improvement

No code is perfect, here also there are places of improvement, like, optimisations in sending very precise data to the GPU, i.e. sending only that data which is extremely needed, as GPU memory allocation time is one of the most time consuming operations in a GPU bound program. Other optimisations could be rather than using static graphs, we could modify the algorithm for Dynamic Graphs.

Name		Name	Last commit message	Last commit date
Latest commit History 29 Commits
tests		tests
Flow Chart - SimRank.png		Flow Chart - SimRank.png
MTP_Presentation_Slides.pdf		MTP_Presentation_Slides.pdf
README.md		README.md
SimRankGPU.cu		SimRankGPU.cu
a.out		a.out
arraySumDouble2d		arraySumDouble2d
arraySumDouble2d.cu		arraySumDouble2d.cu
array_operations.h		array_operations.h
atomicAdd_double.cu		atomicAdd_double.cu
complete_graph.txt		complete_graph.txt
converge.h		converge.h
convergeGPU.h		convergeGPU.h
convergence.jpg		convergence.jpg
cpu-simrank		cpu-simrank
delete_l1_l2.sh		delete_l1_l2.sh
generate_watts_strogatz_graph.cpp		generate_watts_strogatz_graph.cpp
graph_input.txt		graph_input.txt
graphinput.txt		graphinput.txt
input.txt		input.txt
l1_norms_values.txt		l1_norms_values.txt
l2_norms_values.txt		l2_norms_values.txt
networkx_simrank.py		networkx_simrank.py
numpy_test.py		numpy_test.py
simrank-flow.png		simrank-flow.png
simrank3.cpp		simrank3.cpp
tests.cu		tests.cu
wiki-Vote.txt.gz		wiki-Vote.txt.gz

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

GPU implementation of Simrank

Compilation Instructions

Compilation

Execution

Results & Analysis

Places of Improvement

About

Uh oh!

Releases

Packages

Languages

adityanav123/SimRank

Folders and files

Latest commit

History

Repository files navigation

GPU implementation of Simrank

Compilation Instructions

Compilation

Execution

Results & Analysis

Places of Improvement

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages