Use faster activation functions #1837

mcabbott · 2022-01-16T18:55:59Z

This substitutes tanh_fast where possible, since tanh often dominates the forward pass of small networks. Builds on #1761, but in the months that sat waiting, I have mislaid the benchmarks I ran. Best case 2x better forwards, worst case no improvement, modulo noise.

Intersects with #1832, which would also do this to conv layers, but not to RNNs.

Closes #1272; this version is significantly faster, and this PR applies it to more cases.

src/layers/basic.jl

CarloLucibello · 2022-01-18T10:13:02Z

It's a pity to pollute the code a bit but I guess the performance increase is worth it.

I wonder what could be a more aesthetically pleasing alternative. Pointing NNlib.sigmod to sigmoid_fast and defining a NNlib.tanh as tanh_fast?

Just for reference, the reason why this PR is switching activation in the forward instead of construction time is here

My thinking is that we may later decide that it's better to skip tanh_fast on the GPU. I can't measure a difference so who knows. To do that, we can add a method in NNlibCUDA like fast_act(::typeof(tanh), ::CuArray) = tanh. Provided you call this like fast = NNlib.fast_act(fun, x) with an example of the array you plan to broadcast it over.

mcabbott · 2022-01-18T11:33:23Z

I wouldn't claim this as an aesthetic improvement. But it is a performance one.

I'm against less explicit names like NNlib.tanh, I think it's confusing to read packages which don't work how they look like they work, because some Base name means something different. Flux.rand used to be some such trick, and we ripped it out, for explicit rand32.

CarloLucibello · 2022-01-18T11:51:44Z

Ok. Should we wait for a final tag of the v0.12 before merging (#1838)? I wouldn't say this PR is breaking though

mcabbott · 2022-01-18T14:15:15Z

Not so breaking as to demand a release, but if there's one nearby perhaps that's the safe time. So after last 0.12.x tag, before v0.13, as you say.

ToucheSir · 2022-02-05T17:07:12Z

Buildkite is failing with weird errors again 😬

ToucheSir · 2022-02-05T17:08:26Z

bors try

bors · 2022-02-05T17:30:33Z

try

Build succeeded:

buildkite/flux-dot-jl

src/layers/basic.jl

src/layers/conv.jl

Co-authored-by: Brian Chen <ToucheSir@users.noreply.github.com>

codecov-commenter · 2022-02-05T18:11:26Z

Codecov Report

Merging #1837 (0a8fade) into master (8d3b8d3) will increase coverage by 0.09%.
The diff coverage is 100.00%.

@@            Coverage Diff             @@
##           master    #1837      +/-   ##
==========================================
+ Coverage   73.85%   73.94%   +0.09%     
==========================================
  Files          28       28              
  Lines        1683     1689       +6     
==========================================
+ Hits         1243     1249       +6     
  Misses        440      440

Impacted Files	Coverage Δ
src/layers/basic.jl	`75.00% <100.00%> (+0.19%)`	⬆️
src/layers/conv.jl	`80.11% <100.00%> (+0.44%)`	⬆️
src/layers/recurrent.jl	`75.67% <100.00%> (+0.22%)`	⬆️

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update 8d3b8d3...0a8fade. Read the comment docs.

Co-authored-by: Brian Chen <ToucheSir@users.noreply.github.com>

ToucheSir · 2022-02-05T18:13:53Z

src/layers/recurrent.jl

+  c′ = @. sigmoid_fast(forget) * c + sigmoid_fast(input) * tanh_fast(cell)
+  h′ = @. sigmoid_fast(output) * tanh_fast(c′)


Ought these to get the fast_act treatment too? I'm ok with revisiting too, RNN cell inflexibility is a bit of a long standing issue.

Oh right. If we decide to disable fast_tanh on CuArrays, that will be ignored here. But perhaps revisit if & when... it's a bit more clutter to squeeze that in.

It may be a blessing in disguise, as currently plain tanh.(...) will hit the likely obsolete https://github.com/FluxML/NNlibCUDA.jl/blob/master/src/cudnn/activations.jl.

mcabbott mentioned this pull request Jan 16, 2022

v0.13 deprecations #1751

Merged

mcabbott added performance RNN labels Jan 16, 2022

mcabbott added this to the v0.13 milestone Jan 16, 2022

CarloLucibello reviewed Jan 18, 2022

View reviewed changes

src/layers/basic.jl Outdated Show resolved Hide resolved

fast activation functions

7e4480b

mcabbott force-pushed the fastact branch from 9141b63 to 7e4480b Compare February 5, 2022 02:59

mcabbott requested a review from ToucheSir February 5, 2022 03:15

ToucheSir closed this Feb 5, 2022

ToucheSir reopened this Feb 5, 2022

bors bot added a commit that referenced this pull request Feb 5, 2022

Try #1837:

e1278a9

ToucheSir reviewed Feb 5, 2022

View reviewed changes

src/layers/basic.jl Outdated Show resolved Hide resolved

ToucheSir reviewed Feb 5, 2022

View reviewed changes

src/layers/conv.jl Outdated Show resolved Hide resolved

Update src/layers/basic.jl

3b43af0

Co-authored-by: Brian Chen <ToucheSir@users.noreply.github.com>

Update src/layers/conv.jl

0a8fade

Co-authored-by: Brian Chen <ToucheSir@users.noreply.github.com>

ToucheSir reviewed Feb 5, 2022

View reviewed changes

ToucheSir approved these changes Feb 5, 2022

View reviewed changes

mcabbott merged commit 841afe7 into FluxML:master Feb 5, 2022

mcabbott deleted the fastact branch February 5, 2022 18:36

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Use faster activation functions #1837

Use faster activation functions #1837

Uh oh!

mcabbott commented Jan 16, 2022 •

edited

Loading

Uh oh!

Uh oh!

CarloLucibello commented Jan 18, 2022 •

edited

Loading

Uh oh!

mcabbott commented Jan 18, 2022

Uh oh!

CarloLucibello commented Jan 18, 2022

Uh oh!

mcabbott commented Jan 18, 2022

Uh oh!

ToucheSir commented Feb 5, 2022

Uh oh!

ToucheSir commented Feb 5, 2022

Uh oh!

bors bot commented Feb 5, 2022

Uh oh!

Uh oh!

Uh oh!

codecov-commenter commented Feb 5, 2022 •

edited

Loading

Uh oh!

ToucheSir Feb 5, 2022

Uh oh!

mcabbott Feb 5, 2022

Uh oh!

ToucheSir Feb 5, 2022

Uh oh!

Uh oh!

		c′ = @. sigmoid_fast(forget) * c + sigmoid_fast(input) * tanh_fast(cell)
		h′ = @. sigmoid_fast(output) * tanh_fast(c′)

Uh oh!

Use faster activation functions #1837

Use faster activation functions #1837

Uh oh!

Conversation

mcabbott commented Jan 16, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

CarloLucibello commented Jan 18, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

mcabbott commented Jan 18, 2022

Uh oh!

CarloLucibello commented Jan 18, 2022

Uh oh!

mcabbott commented Jan 18, 2022

Uh oh!

ToucheSir commented Feb 5, 2022

Uh oh!

ToucheSir commented Feb 5, 2022

Uh oh!

bors bot commented Feb 5, 2022

try

Uh oh!

Uh oh!

Uh oh!

codecov-commenter commented Feb 5, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

ToucheSir Feb 5, 2022

Choose a reason for hiding this comment

Uh oh!

mcabbott Feb 5, 2022

Choose a reason for hiding this comment

Uh oh!

ToucheSir Feb 5, 2022

Choose a reason for hiding this comment

Uh oh!

Uh oh!

mcabbott commented Jan 16, 2022 •

edited

Loading

CarloLucibello commented Jan 18, 2022 •

edited

Loading

codecov-commenter commented Feb 5, 2022 •

edited

Loading