Skip to content

Comments

Process outstanding CUDA events in recordEvent#619

Merged
soumith merged 1 commit intotorch:masterfrom
colesbury:cached_pinned_memory_fix
Dec 2, 2016
Merged

Process outstanding CUDA events in recordEvent#619
soumith merged 1 commit intotorch:masterfrom
colesbury:cached_pinned_memory_fix

Conversation

@colesbury
Copy link
Contributor

Without this, the cuda_events could continuously grow from calls to
cudaMemcpyAsync, but would never be processed if there were no new
pinned memory allocations.

For example:

 t1 = cutorch.createCudaHostTensor(10)
 t2 = torch.CudaTensor(10)
 while true do t2:copyAsync(t1) end

Without this, the cuda_events could continuously grow from calls to
cudaMemcpyAsync, but would never be processed if there were no new
pinned memory allocations.

For example:

 t1 = cutorch.createCudaHostTensor(10)
 t2 = torch.CudaTensor(10)
 while true do t2:copyAsync(t1) end
@soumith soumith merged commit bcbb427 into torch:master Dec 2, 2016
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants