-
Notifications
You must be signed in to change notification settings - Fork 207
Description
Hello,
I am trying to optimize on a GPU a network with more than 17 millions parameters and performing gradient clipping using the following code:
gradParameters[gradParameters:gt(opt.gradientClipping)]:mul(opt.gradientClipping/ l2norm)
However, I get the following error:
/opt/torch/install/bin/luajit: bad argument #3 to '?' (mask nElements exceeds single-precision float consecutive integer precision size (2^24) at /opt/torch/extra/cutorch/lib/THC/THCTensorMasked.cu:141)
Having a look inside https://github.com/torch/cutorch/blob/master/lib/THC/THCTensorMasked.cu it is very clear (line 142) that it is a limitation with using a FLOAT32.
Hence, I was wondering if there any smart way around this problem?
I am absolutely not expert of GPUs, would using a float64 be possible?
Thank you