CUDA Programming

Install CUDA and compile

Install reference
Compile Nvcc

# Example: after we write a test.cu we can use following command to compile and run
$ nvcc test.cu 
$ ./a.out

Hello World

#include <stdio.h>

// Run on GPU(when we see __global__ it means it run on GPU)
// __global__ can be called by CPU, but __device__ only can be called by __global__ or __device__
__global__ void kernel()
{
    printf("Hello World!");
}

// Run on CPU
int main()
{
    // <<<variable_1, variable2>>> 
    // * kernel function has variable_1 thread blocks
    // * In each thread blocks has variable_2 threads
    kernel<<<1, 1>>>(); 
    return 0;
}

Size: Grid(huge) > Block > Thread(small)

Execution unit is Block.

Data transmission between CPU and GPU

#include <stdio.h>

__global__ void add(int a, int b, int *c)
{
    *c = a + b;
}

int main() 
{
    int cpu_c;
    int *gpu_c;
    
    // melloc on GPU
    // cudaError_t cudaMalloc (void **devPtr, size_t size)
    cudaMalloc((void**)&gpu_c, sizeof(int));
    
    // calculate on GPU
    add<<<1, 1>>>(1, 2, gpu_c);
    
    // transmision the data from GPU to CPU
    // cudaError_t cudaMemcpy (void *dst, const void *src, size_t count, cudaMemcpyKind kind)
    cudaMemcpy(&cpu_c, gpu_c, sizeof(int), cudaMemcpyDeviceToHost);
    
    // cudaError_t cudaFree (void* devPtr)
    cudaFree(gpu_c);
    
    return 0;
}

Flow: cudaMelloc() -> cudaMemcpy() -> cudaFree()

Name		Name	Last commit message	Last commit date
Latest commit History 11 Commits
README.md		README.md
atomic_mulB_mulT.cu		atomic_mulB_mulT.cu
cuda_01.cu		cuda_01.cu
mem_in_same_B.cu		mem_in_same_B.cu
mulB_mulT.cu		mulB_mulT.cu
mulB_sinT.cu		mulB_sinT.cu
mulT_sinB.cu		mulT_sinB.cu

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

CUDA Programming

Install CUDA and compile

Hello World

Data transmission between CPU and GPU

About

Uh oh!

Releases

Packages

Languages

x123y123/CUDA_Programming

Folders and files

Latest commit

History

Repository files navigation

CUDA Programming

Install CUDA and compile

Hello World

Data transmission between CPU and GPU

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages