Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

OpenCL error #14

Open
skn123 opened this issue Jul 28, 2018 · 12 comments
Open

OpenCL error #14

skn123 opened this issue Jul 28, 2018 · 12 comments

Comments

@skn123
Copy link

skn123 commented Jul 28, 2018

Is this because of the graphics card ?

naths@naths-HP-Laptop-15-bs1xx:~/build/remora/bin$ ./Benchmark_GPU_Conv2D
performance float
Flops
35 4 8 32 10391.1
67 4 8 32 12546.4
CLBlast: OpenCL error: clEnqueueNDRangeKernel: -5
CLBlast: OpenCL error: clEnqueueNDRangeKernel: -5
CLBlast: OpenCL error: clEnqueueNDRangeKernel: -5
CLBlast: OpenCL error: clEnqueueNDRangeKernel: -5
CLBlast: OpenCL error: clEnqueueNDRangeKernel: -5
CLBlast: OpenCL error: clEnqueueNDRangeKernel: -5
CLBlast: OpenCL error: clEnqueueNDRangeKernel: -5
CLBlast: OpenCL error: clEnqueueNDRangeKernel: -5
CLBlast: OpenCL error: clEnqueueNDRangeKernel: -5
CLBlast: OpenCL error: clEnqueueNDRangeKernel: -5
CLBlast: OpenCL error: clEnqueueNDRangeKernel: -5
CLBlast: OpenCL error: clEnqueueNDRangeKernel: -5
CLBlast: OpenCL error: clEnqueueNDRangeKernel: -5
CLBlast: OpenCL error: clEnqueueNDRangeKernel: -5
CLBlast: OpenCL error: clEnqueueNDRangeKernel: -5
CLBlast: OpenCL error: clEnqueueNDRangeKernel: -5
CLBlast: OpenCL error: clEnqueueNDRangeKernel: -5
CLBlast: OpenCL error: clEnqueueNDRangeKernel: -5
CLBlast: OpenCL error: clEnqueueNDRangeKernel: -5
CLBlast: OpenCL error: clEnqueueNDRangeKernel: -5
CLBlast: OpenCL error: clEnqueueNDRangeKernel: -5
CLBlast: OpenCL error: clEnqueueNDRangeKernel: -5
CLBlast: OpenCL error: clEnqueueNDRangeKernel: -5
CLBlast: OpenCL error: clEnqueueNDRangeKernel: -5
CLBlast: OpenCL error: clEnqueueNDRangeKernel: -5
CLBlast: OpenCL error: clEnqueueNDRangeKernel: -5
CLBlast: OpenCL error: clEnqueueNDRangeKernel: -5
CLBlast: OpenCL error: clEnqueueNDRangeKernel: -5
CLBlast: OpenCL error: clEnqueueNDRangeKernel: -5
CLBlast: OpenCL error: clEnqueueNDRangeKernel: -5
CLBlast: OpenCL error: clEnqueueNDRangeKernel: -5
CLBlast: OpenCL error: clEnqueueNDRangeKernel: -5
CLBlast: OpenCL error: clEnqueueNDRangeKernel: -5
CLBlast: OpenCL error: clEnqueueNDRangeKernel: -5
CLBlast: OpenCL error: clEnqueueNDRangeKernel: -5
CLBlast: OpenCL error: clEnqueueNDRangeKernel: -5
CLBlast: OpenCL error: clEnqueueNDRangeKernel: -5
CLBlast: OpenCL error: clEnqueueNDRangeKernel: -5
CLBlast: OpenCL error: clEnqueueNDRangeKernel: -5
CLBlast: OpenCL error: clEnqueueNDRangeKernel: -5
CLBlast: OpenCL error: clEnqueueNDRangeKernel: -5
CLBlast: OpenCL error: clEnqueueNDRangeKernel: -5
CLBlast: OpenCL error: clEnqueueNDRangeKernel: -5
CLBlast: OpenCL error: clEnqueueNDRangeKernel: -5
131 4 8 32 64964.4
CLBlast: OpenCL error: clEnqueueNDRangeKernel: -5
CLBlast: OpenCL error: clEnqueueNDRangeKernel: -5
CLBlast: OpenCL error: clEnqueueNDRangeKernel: -5
CLBlast: OpenCL error: clEnqueueNDRangeKernel: -5
CLBlast: OpenCL error: clEnqueueNDRangeKernel: -5
CLBlast: OpenCL error: clEnqueueNDRangeKernel: -5
CLBlast: OpenCL error: clEnqueueNDRangeKernel: -5
CLBlast: OpenCL error: clEnqueueNDRangeKernel: -5
CLBlast: OpenCL error: clEnqueueNDRangeKernel: -5
CLBlast: OpenCL error: clEnqueueNDRangeKernel: -5
CLBlast: OpenCL error: clEnqueueNDRangeKernel: -5
CLBlast: OpenCL error: clEnqueueNDRangeKernel: -5
CLBlast: OpenCL error: clEnqueueNDRangeKernel: -5
CLBlast: OpenCL error: clEnqueueNDRangeKernel: -5
CLBlast: OpenCL error: clEnqueueNDRangeKernel: -5
CLBlast: OpenCL error: clEnqueueNDRangeKernel: -5
CLBlast: OpenCL error: clEnqueueNDRangeKernel: -5
CLBlast: OpenCL error: clEnqueueNDRangeKernel: -5
CLBlast: OpenCL error: clEnqueueNDRangeKernel: -5
CLBlast: OpenCL error: clEnqueueNDRangeKernel: -5
CLBlast: OpenCL error: clEnqueueNDRangeKernel: -5
CLBlast: OpenCL error: clEnqueueNDRangeKernel: -5
CLBlast: OpenCL error: clEnqueueNDRangeKernel: -5
CLBlast: OpenCL error: clEnqueueNDRangeKernel: -5
CLBlast: OpenCL error: clEnqueueNDRangeKernel: -5
CLBlast: OpenCL error: clEnqueueNDRangeKernel: -5
CLBlast: OpenCL error: clEnqueueNDRangeKernel: -5
CLBlast: OpenCL error: clEnqueueNDRangeKernel: -5
CLBlast: OpenCL error: clEnqueueNDRangeKernel: -5
CLBlast: OpenCL error: clEnqueueNDRangeKernel: -5
CLBlast: OpenCL error: clEnqueueNDRangeKernel: -5
CLBlast: OpenCL error: clEnqueueNDRangeKernel: -5
CLBlast: OpenCL error: clEnqueueNDRangeKernel: -5
CLBlast: OpenCL error: clEnqueueNDRangeKernel: -5
CLBlast: OpenCL error: clEnqueueNDRangeKernel: -5
CLBlast: OpenCL error: clEnqueueNDRangeKernel: -5
CLBlast: OpenCL error: clEnqueueNDRangeKernel: -5

@Ulfgard
Copy link
Member

Ulfgard commented Jul 28, 2018 via email

@skn123
Copy link
Author

skn123 commented Jul 31, 2018

CNugteren/CLBlast#301
Its like it is not being resolved. Tried all options.

@CNugteren
Copy link

CNugteren commented Jul 31, 2018

He reports that all regular CLBlast tests fail. And the OpenCL error -5 indicates CL_OUT_OF_RESOURCES, so I'm suspecting he wants to solve problems too large for his GPU memory. Do smaller problems work well?

@Ulfgard
Copy link
Member

Ulfgard commented Jul 31, 2018

Yeah i think i screwed up with the memory requirements on the Benchmark. Multiplying several small numbers might still lead to one very big one :)

i will fix once I am back from vacation

@skn123
Copy link
Author

skn123 commented Sep 1, 2018

@Ulfgard any fix ?

Ulfgard added a commit that referenced this issue Sep 2, 2018
@Ulfgard
Copy link
Member

Ulfgard commented Sep 6, 2018

should be fixed

@skn123
Copy link
Author

skn123 commented Sep 8, 2018

This is what I get
a.) Fresh build of CLBlast
b.) Fresh build of Remora

naths@naths-HP-Laptop-15-bs1xx:~/build/remora/bin$ ./Benchmark_GPU_Conv2D
performance float
Flops
35 4 8 32 9193.43
67 4 8 32 11677.4
CLBlast: OpenCL error: clEnqueueNDRangeKernel: -5
CLBlast: OpenCL error: clEnqueueNDRangeKernel: -5
CLBlast: OpenCL error: clEnqueueNDRangeKernel: -5
CLBlast: OpenCL error: clEnqueueNDRangeKernel: -5
CLBlast: OpenCL error: clEnqueueNDRangeKernel: -5
CLBlast: OpenCL error: clEnqueueNDRangeKernel: -5
CLBlast: OpenCL error: clEnqueueNDRangeKernel: -5
CLBlast: OpenCL error: clEnqueueNDRangeKernel: -5
CLBlast: OpenCL error: clEnqueueNDRangeKernel: -5
CLBlast: OpenCL error: clEnqueueNDRangeKernel: -5
CLBlast: OpenCL error: clEnqueueNDRangeKernel: -5
CLBlast: OpenCL error: clEnqueueNDRangeKernel: -5
CLBlast: OpenCL error: clEnqueueNDRangeKernel: -5
CLBlast: OpenCL error: clEnqueueNDRangeKernel: -5
CLBlast: OpenCL error: clEnqueueNDRangeKernel: -5
CLBlast: OpenCL error: clEnqueueNDRangeKernel: -5
CLBlast: OpenCL error: clEnqueueNDRangeKernel: -5
CLBlast: OpenCL error: clEnqueueNDRangeKernel: -5
CLBlast: OpenCL error: clEnqueueNDRangeKernel: -5
CLBlast: OpenCL error: clEnqueueNDRangeKernel: -5
CLBlast: OpenCL error: clEnqueueNDRangeKernel: -5
CLBlast: OpenCL error: clEnqueueNDRangeKernel: -5
CLBlast: OpenCL error: clEnqueueNDRangeKernel: -5
CLBlast: OpenCL error: clEnqueueNDRangeKernel: -5
CLBlast: OpenCL error: clEnqueueNDRangeKernel: -5
CLBlast: OpenCL error: clEnqueueNDRangeKernel: -5
CLBlast: OpenCL error: clEnqueueNDRangeKernel: -5
CLBlast: OpenCL error: clEnqueueNDRangeKernel: -5
CLBlast: OpenCL error: clEnqueueNDRangeKernel: -5
CLBlast: OpenCL error: clEnqueueNDRangeKernel: -5
CLBlast: OpenCL error: clEnqueueNDRangeKernel: -5
CLBlast: OpenCL error: clEnqueueNDRangeKernel: -5
CLBlast: OpenCL error: clEnqueueNDRangeKernel: -5
CLBlast: OpenCL error: clEnqueueNDRangeKernel: -5
CLBlast: OpenCL error: clEnqueueNDRangeKernel: -5
CLBlast: OpenCL error: clEnqueueNDRangeKernel: -5
CLBlast: OpenCL error: clEnqueueNDRangeKernel: -5
CLBlast: OpenCL error: clEnqueueNDRangeKernel: -5
CLBlast: OpenCL error: clEnqueueNDRangeKernel: -5
CLBlast: OpenCL error: clEnqueueNDRangeKernel: -5
CLBlast: OpenCL error: clEnqueueNDRangeKernel: -5
CLBlast: OpenCL error: clEnqueueNDRangeKernel: -5
CLBlast: OpenCL error: clEnqueueNDRangeKernel: -5
CLBlast: OpenCL error: clEnqueueNDRangeKernel: -5
131 4 8 32 62958.7
CLBlast: OpenCL error: clEnqueueNDRangeKernel: -5
CLBlast: OpenCL error: clEnqueueNDRangeKernel: -5
CLBlast: OpenCL error: clEnqueueNDRangeKernel: -5
CLBlast: OpenCL error: clEnqueueNDRangeKernel: -5
CLBlast: OpenCL error: clEnqueueNDRangeKernel: -5
CLBlast: OpenCL error: clEnqueueNDRangeKernel: -5
CLBlast: OpenCL error: clEnqueueNDRangeKernel: -5
CLBlast: OpenCL error: clEnqueueNDRangeKernel: -5
CLBlast: OpenCL error: clEnqueueNDRangeKernel: -5
CLBlast: OpenCL error: clEnqueueNDRangeKernel: -5
CLBlast: OpenCL error: clEnqueueNDRangeKernel: -5
CLBlast: OpenCL error: clEnqueueNDRangeKernel: -5
CLBlast: OpenCL error: clEnqueueNDRangeKernel: -5
CLBlast: OpenCL error: clEnqueueNDRangeKernel: -5
CLBlast: OpenCL error: clEnqueueNDRangeKernel: -5
CLBlast: OpenCL error: clEnqueueNDRangeKernel: -5
CLBlast: OpenCL error: clEnqueueNDRangeKernel: -5
CLBlast: OpenCL error: clEnqueueNDRangeKernel: -5
CLBlast: OpenCL error: clEnqueueNDRangeKernel: -5
CLBlast: OpenCL error: clEnqueueNDRangeKernel: -5
CLBlast: OpenCL error: clEnqueueNDRangeKernel: -5
CLBlast: OpenCL error: clEnqueueNDRangeKernel: -5
CLBlast: OpenCL error: clEnqueueNDRangeKernel: -5
CLBlast: OpenCL error: clEnqueueNDRangeKernel: -5
CLBlast: OpenCL error: clEnqueueNDRangeKernel: -5
CLBlast: OpenCL error: clEnqueueNDRangeKernel: -5
CLBlast: OpenCL error: clEnqueueNDRangeKernel: -5
CLBlast: OpenCL error: clEnqueueNDRangeKernel: -5
CLBlast: OpenCL error: clEnqueueNDRangeKernel: -5
CLBlast: OpenCL error: clEnqueueNDRangeKernel: -5
CLBlast: OpenCL error: clEnqueueNDRangeKernel: -5
CLBlast: OpenCL error: clEnqueueNDRangeKernel: -5
^C

and I get this error also now
[ 60%] Built target LinearRegression
/home/naths/srcs/Remora/examples/Benchmarks/opencl_conv2d.cpp:38:26: error: use
of undeclared identifier 'out_gpu'
double storage = double(out_gpu.size1() * out_gpu.size2())/1024/1024;
^
/home/naths/srcs/Remora/examples/Benchmarks/opencl_conv2d.cpp:38:44: error: use
of undeclared identifier 'out_gpu'
double storage = double(out_gpu.size1() * out_gpu.size2())/1024/1024;

@Ulfgard
Copy link
Member

Ulfgard commented Sep 8, 2018

Are you sure you pulled and rebuild the opencl_conv22d test? because this is the old output before my changes (i am now also printing the approximate storage requirements of the output)

@skn123
Copy link
Author

skn123 commented Sep 9, 2018

Now I cant even build it
naths@naths-HP-Laptop-15-bs1xx:~/build/remora$ make
[ 6%] Building CXX object examples/CMakeFiles/Benchmark_GPU_Conv2D.dir/Benchmarks/opencl_conv2d.cpp.o
/home/naths/srcs/Remora/examples/Benchmarks/opencl_conv2d.cpp:38:26: error: use
of undeclared identifier 'out_gpu'
double storage = double(out_gpu.size1() * out_gpu.size2())/1024/1024;
^
/home/naths/srcs/Remora/examples/Benchmarks/opencl_conv2d.cpp:38:44: error: use
of undeclared identifier 'out_gpu'
double storage = double(out_gpu.size1() * out_gpu.size2())/1024/1024;
^
2 errors generated.
examples/CMakeFiles/Benchmark_GPU_Conv2D.dir/build.make:62: recipe for target 'examples/CMakeFiles/Benchmark_GPU_Conv2D.dir/Benchmarks/opencl_conv2d.cpp.o' failed
make[2]: *** [examples/CMakeFiles/Benchmark_GPU_Conv2D.dir/Benchmarks/opencl_conv2d.cpp.o] Error 1
CMakeFiles/Makefile2:121: recipe for target 'examples/CMakeFiles/Benchmark_GPU_Conv2D.dir/all' failed
make[1]: *** [examples/CMakeFiles/Benchmark_GPU_Conv2D.dir/all] Error 2
Makefile:129: recipe for target 'all' failed
make: *** [all] Error 2

@Ulfgard
Copy link
Member

Ulfgard commented Sep 9, 2018

sorry for that. please rename out_gpu->out_opencl, some refactoring tool must have it gotten wrong.

now is the point i have to tell you that i will probably move the convolution out of Remora to shark, Or in a new repository with the other image-processing we implement

@skn123
Copy link
Author

skn123 commented Sep 9, 2018

This is what I get
naths@naths-HP-Laptop-15-bs1xx:~/build/remora/bin$ ./Benchmark_GPU_Conv2D
performance float
im_size filtpx incChan OutChan memOut Flops
19 4 3 16 0.0220337 756.088
35 4 3 16 0.0747681 1200.29
67 4 3 16 0.273987 1178.22
CLBlast: OpenCL error: clEnqueueNDRangeKernel: -5
CLBlast: OpenCL error: clEnqueueNDRangeKernel: -5
CLBlast: OpenCL error: clEnqueueNDRangeKernel: -5
CLBlast: OpenCL error: clEnqueueNDRangeKernel: -5
CLBlast: OpenCL error: clEnqueueNDRangeKernel: -5
CLBlast: OpenCL error: clEnqueueNDRangeKernel: -5
CLBlast: OpenCL error: clEnqueueNDRangeKernel: -5
CLBlast: OpenCL error: clEnqueueNDRangeKernel: -5
CLBlast: OpenCL error: clEnqueueNDRangeKernel: -5
CLBlast: OpenCL error: clEnqueueNDRangeKernel: -5
CLBlast: OpenCL error: clEnqueueNDRangeKernel: -5
CLBlast: OpenCL error: clEnqueueNDRangeKernel: -5
CLBlast: OpenCL error: clEnqueueNDRangeKernel: -5
CLBlast: OpenCL error: clEnqueueNDRangeKernel: -5
CLBlast: OpenCL error: clEnqueueNDRangeKernel: -5
CLBlast: OpenCL error: clEnqueueNDRangeKernel: -5
CLBlast: OpenCL error: clEnqueueNDRangeKernel: -5
CLBlast: OpenCL error: clEnqueueNDRangeKernel: -5
CLBlast: OpenCL error: clEnqueueNDRangeKernel: -5
CLBlast: OpenCL error: clEnqueueNDRangeKernel: -5
CLBlast: OpenCL error: clEnqueueNDRangeKernel: -5
CLBlast: OpenCL error: clEnqueueNDRangeKernel: -5
CLBlast: OpenCL error: clEnqueueNDRangeKernel: -5
CLBlast: OpenCL error: clEnqueueNDRangeKernel: -5
CLBlast: OpenCL error: clEnqueueNDRangeKernel: -5
CLBlast: OpenCL error: clEnqueueNDRangeKernel: -5
CLBlast: OpenCL error: clEnqueueNDRangeKernel: -5
CLBlast: OpenCL error: clEnqueueNDRangeKernel: -5
CLBlast: OpenCL error: clEnqueueNDRangeKernel: -5
CLBlast: OpenCL error: clEnqueueNDRangeKernel: -5
CLBlast: OpenCL error: clEnqueueNDRangeKernel: -5
CLBlast: OpenCL error: clEnqueueNDRangeKernel: -5
CLBlast: OpenCL error: clEnqueueNDRangeKernel: -5
CLBlast: OpenCL error: clEnqueueNDRangeKernel: -5
CLBlast: OpenCL error: clEnqueueNDRangeKernel: -5
CLBlast: OpenCL error: clEnqueueNDRangeKernel: -5
CLBlast: OpenCL error: clEnqueueNDRangeKernel: -5
CLBlast: OpenCL error: clEnqueueNDRangeKernel: -5
CLBlast: OpenCL error: clEnqueueNDRangeKernel: -5
CLBlast: OpenCL error: clEnqueueNDRangeKernel: -5
CLBlast: OpenCL error: clEnqueueNDRangeKernel: -5
CLBlast: OpenCL error: clEnqueueNDRangeKernel: -5
CLBlast: OpenCL error: clEnqueueNDRangeKernel: -5
CLBlast: OpenCL error: clEnqueueNDRangeKernel: -5
131 4 3 16 1.04742 3035.47
19 8 3 16 0.0220337 1504.48
35 8 3 16 0.0747681 2378.29
67 8 3 16 0.273987 2819.58
CLBlast: OpenCL error: clEnqueueNDRangeKernel: -5
CLBlast: OpenCL error: clEnqueueNDRangeKernel: -5
CLBlast: OpenCL error: clEnqueueNDRangeKernel: -5
CLBlast: OpenCL error: clEnqueueNDRangeKernel: -5
CLBlast: OpenCL error: clEnqueueNDRangeKernel: -5
CLBlast: OpenCL error: clEnqueueNDRangeKernel: -5
CLBlast: OpenCL error: clEnqueueNDRangeKernel: -5
CLBlast: OpenCL error: clEnqueueNDRangeKernel: -5
CLBlast: OpenCL error: clEnqueueNDRangeKernel: -5
CLBlast: OpenCL error: clEnqueueNDRangeKernel: -5
CLBlast: OpenCL error: clEnqueueNDRangeKernel: -5
CLBlast: OpenCL error: clEnqueueNDRangeKernel: -5
CLBlast: OpenCL error: clEnqueueNDRangeKernel: -5
CLBlast: OpenCL error: clEnqueueNDRangeKernel: -5
CLBlast: OpenCL error: clEnqueueNDRangeKernel: -5
CLBlast: OpenCL error: clEnqueueNDRangeKernel: -5
CLBlast: OpenCL error: clEnqueueNDRangeKernel: -5
CLBlast: OpenCL error: clEnqueueNDRangeKernel: -5
CLBlast: OpenCL error: clEnqueueNDRangeKernel: -5
CLBlast: OpenCL error: clEnqueueNDRangeKernel: -5
CLBlast: OpenCL error: clEnqueueNDRangeKernel: -5
CLBlast: OpenCL error: clEnqueueNDRangeKernel: -5
CLBlast: OpenCL error: clEnqueueNDRangeKernel: -5
CLBlast: OpenCL error: clEnqueueNDRangeKernel: -5
CLBlast: OpenCL error: clEnqueueNDRangeKernel: -5
CLBlast: OpenCL error: clEnqueueNDRangeKernel: -5
CLBlast: OpenCL error: clEnqueueNDRangeKernel: -5
CLBlast: OpenCL error: clEnqueueNDRangeKernel: -5
CLBlast: OpenCL error: clEnqueueNDRangeKernel: -5
CLBlast: OpenCL error: clEnqueueNDRangeKernel: -5
CLBlast: OpenCL error: clEnqueueNDRangeKernel: -5
CLBlast: OpenCL error: clEnqueueNDRangeKernel: -5
CLBlast: OpenCL error: clEnqueueNDRangeKernel: -5
CLBlast: OpenCL error: clEnqueueNDRangeKernel: -5
CLBlast: OpenCL error: clEnqueueNDRangeKernel: -5
CLBlast: OpenCL error: clEnqueueNDRangeKernel: -5
CLBlast: OpenCL error: clEnqueueNDRangeKernel: -5
CLBlast: OpenCL error: clEnqueueNDRangeKernel: -5
CLBlast: OpenCL error: clEnqueueNDRangeKernel: -5
CLBlast: OpenCL error: clEnqueueNDRangeKernel: -5
CLBlast: OpenCL error: clEnqueueNDRangeKernel: -5
CLBlast: OpenCL error: clEnqueueNDRangeKernel: -5
CLBlast: OpenCL error: clEnqueueNDRangeKernel: -5
CLBlast: OpenCL error: clEnqueueNDRangeKernel: -5
CLBlast: OpenCL error: clEnqueueNDRangeKernel: -5
CLBlast: OpenCL error: clEnqueueNDRangeKernel: -5
CLBlast: OpenCL error: clEnqueueNDRangeKernel: -5
CLBlast: OpenCL error: clEnqueueNDRangeKernel: -5
CLBlast: OpenCL error: clEnqueueNDRangeKernel: -5
CLBlast: OpenCL error: clEnqueueNDRangeKernel: -5
CLBlast: OpenCL error: clEnqueueNDRangeKernel: -5
CLBlast: OpenCL error: clEnqueueNDRangeKernel: -5
CLBlast: OpenCL error: clEnqueueNDRangeKernel: -5
CLBlast: OpenCL error: clEnqueueNDRangeKernel: -5
CLBlast: OpenCL error: clEnqueueNDRangeKernel: -5
CLBlast: OpenCL error: clEnqueueNDRangeKernel: -5
CLBlast: OpenCL error: clEnqueueNDRangeKernel: -5
CLBlast: OpenCL error: clEnqueueNDRangeKernel: -5
CLBlast: OpenCL error: clEnqueueNDRangeKernel: -5
CLBlast: OpenCL error: clEnqueueNDRangeKernel: -5
CLBlast: OpenCL error: clEnqueueNDRangeKernel: -5
CLBlast: OpenCL error: clEnqueueNDRangeKernel: -5
CLBlast: OpenCL error: clEnqueueNDRangeKernel: -5
CLBlast: OpenCL error: clEnqueueNDRangeKernel: -5
CLBlast: OpenCL error: clEnqueueNDRangeKernel: -5
CLBlast: OpenCL error: clEnqueueNDRangeKernel: -5
CLBlast: OpenCL error: clEnqueueNDRangeKernel: -5
CLBlast: OpenCL error: clEnqueueNDRangeKernel: -5
CLBlast: OpenCL error: clEnqueueNDRangeKernel: -5
CLBlast: OpenCL error: clEnqueueNDRangeKernel: -5
CLBlast: OpenCL error: clEnqueueNDRangeKernel: -5
CLBlast: OpenCL error: clEnqueueNDRangeKernel: -5
CLBlast: OpenCL error: clEnqueueNDRangeKernel: -5
CLBlast: OpenCL error: clEnqueueNDRangeKernel: -5
CLBlast: OpenCL error: clEnqueueNDRangeKernel: -5
CLBlast: OpenCL error: clEnqueueNDRangeKernel: -5
CLBlast: OpenCL error: clEnqueueNDRangeKernel: -5
CLBlast: OpenCL error: clEnqueueNDRangeKernel: -5
CLBlast: OpenCL error: clEnqueueNDRangeKernel: -5
CLBlast: OpenCL error: clEnqueueNDRangeKernel: -5
CLBlast: OpenCL error: clEnqueueNDRangeKernel: -5
CLBlast: OpenCL error: clEnqueueNDRangeKernel: -5
CLBlast: OpenCL error: clEnqueueNDRangeKernel: -5
CLBlast: OpenCL error: clEnqueueNDRangeKernel: -5
CLBlast: OpenCL error: clEnqueueNDRangeKernel: -5
CLBlast: OpenCL error: clEnqueueNDRangeKernel: -5
CLBlast: OpenCL error: clEnqueueNDRangeKernel: -5
CLBlast: OpenCL error: clEnqueueNDRangeKernel: -5
131 8 3 16 1.04742 5916.42

@Ulfgard
Copy link
Member

Ulfgard commented Sep 9, 2018

I see. that could be the algorithm. we use an explicit matrix approach which can take quite a lot of memory. This is unfortunately not going to change until clBLAST has something better, This is how far we can provide.

I am currently working on hip-support, which should allow me to use cudnn for this on nvidia cards (and MIOpen for AMD) but as is aid: this is moved out of Remora.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants