WIP: Add wrapper functions for setting/getting convolution group count #388

dbadrian · 2019-08-09T12:45:18Z

Based on a discussion over at Flux ( FluxML/Flux.jl#459 and FluxML/Flux.jl#330 ) @avik-pal proposed the inclusion of new functions in libcudnn.jl, namely to set the group count for a group convolution.

Further changes to other functions to actually enable group convolutions (and by that depthwise conv on the gpu), could be added to this PR based on discussion and support.
The latter because this is prolly out of depth at this point in time and will require some more diggin in.

Hopefully @avik-pal can add some more hints on what should be done :).

avik-pal · 2019-08-10T03:24:53Z

Reposting from the original thread

The next thing would be to modify the ConvDesc call here to call the wrapped function first.

You can pass the group count as a keyword argument maybe, but even I am not certain if this is the right way or we should modify NNlib cdims to incorporate it internally. @dhairyagandhi96 thoughts on the best way to do this?

dbadrian · 2019-08-13T12:11:09Z

As avik-pal pointed out, there could be a few potential points of integration, but I don't have the overview to see which are feasible and sensible.

Would be great to get some feedback from the library maintainers and I am happy to try draft up an implementation based on it.

Supporting group convolution is highly relevant to efficiently implement several modern NN architectures.

DhairyaLGandhi · 2019-08-13T12:57:39Z

I believe the correct way would be to capture it in the cdims object.

Pinging @staticfloat for his thoughts

staticfloat · 2019-08-14T22:42:24Z

Yes; we can make the DepthwiseConvDims object instead be a special-case of a GroupConvDims object, where you specify the group number (where depthwise conv is when the number of groups == number of channels).

This will require us to write an implementation of grouped convolution so that we can test all our implementations against eachother, but that shouldn't be too hard.

dbadrian · 2019-08-15T11:23:32Z

True and along the same line of argumentation a regular convolution is simply a special case with groupcount==1 (the default value)?

staticfloat · 2019-08-15T16:07:09Z

Yes, that's true. So really, we could do away with DepthwiseConvDims entirely, add a groupcount parameter to ConvDims, then have optimized implementations for groupcount == 1, groupcount == numchannels and have a fallback implementation for the general case. I'm not entirely sure how to express the groupcount == numchannels case in the type system, but I'm sure we'll figure it out. ;)

DhairyaLGandhi · 2019-08-15T16:41:52Z

I like the sound of that, Avik had a similar thing in mind as well

maleadt · 2019-12-02T07:44:19Z

Concerning this PR: the API wrapper is already in CuArrays, so we only need an API: #523

dbadrian and others added 2 commits August 9, 2019 14:24

Add wrapper functions for setting/getting convolution group count

de83892

Fix typo in function call

a4665f4

maleadt changed the title ~~Add wrapper functions for setting/getting convolution group count~~ WIP: Add wrapper functions for setting/getting convolution group count Aug 23, 2019

arhik mentioned this pull request Nov 30, 2019

Added groupwiseconv and modified depthwise conv for common interface FluxML/Flux.jl#948

Closed

maleadt mentioned this pull request Dec 2, 2019

Adding missing cudnnSetConvolutionGroupCount to pointers.json #521

Closed

maleadt closed this Dec 2, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

WIP: Add wrapper functions for setting/getting convolution group count #388

WIP: Add wrapper functions for setting/getting convolution group count #388

dbadrian commented Aug 9, 2019

avik-pal commented Aug 10, 2019

dbadrian commented Aug 13, 2019 •

edited

Loading

DhairyaLGandhi commented Aug 13, 2019

staticfloat commented Aug 14, 2019

dbadrian commented Aug 15, 2019 •

edited

Loading

staticfloat commented Aug 15, 2019

DhairyaLGandhi commented Aug 15, 2019

maleadt commented Dec 2, 2019

WIP: Add wrapper functions for setting/getting convolution group count #388

WIP: Add wrapper functions for setting/getting convolution group count #388

Conversation

dbadrian commented Aug 9, 2019

avik-pal commented Aug 10, 2019

dbadrian commented Aug 13, 2019 • edited Loading

DhairyaLGandhi commented Aug 13, 2019

staticfloat commented Aug 14, 2019

dbadrian commented Aug 15, 2019 • edited Loading

staticfloat commented Aug 15, 2019

DhairyaLGandhi commented Aug 15, 2019

maleadt commented Dec 2, 2019

dbadrian commented Aug 13, 2019 •

edited

Loading

dbadrian commented Aug 15, 2019 •

edited

Loading