Profile: multithread getdict! (redo of #43805) #43816

IanButterworth · 2022-01-14T22:45:42Z

It was pointed out there were at least 2 problems with #43805 (which is now reverted)

Even preallocated dicts are unsafe to multithread write to due to internal fields that are written to each write
Spawning at once for every unique ip isn't a good idea

This PR moves to a channel-based approach, and uses one @spawn per nthreads().

I didn't use Threads.@threads here to avoid blocking, so this is possible:

julia> @profile using DifferentialEquations

julia> @time Profile.retrieve();
  1.956626 seconds (1.19 M allocations: 95.429 MiB, 0.41% gc time, 13.90% compilation time)

julia> Threads.@spawn while true
           rand(10,10) * rand(10,10)
       end
Task (runnable) @0x00007f803ffd5710

julia> @time Profile.retrieve();
  2.674690 seconds (4.53 M allocations: 3.756 GiB, 39.55% gc time)

julia> @time Profile.retrieve();
  2.276615 seconds (3.71 M allocations: 3.067 GiB, 39.71% gc time)

(I think the allocations and GC of the spawned loop are being picked up by @time here?)

cc. @vtjnash @timholy

jpsamaroo · 2022-01-15T14:26:24Z

stdlib/Profile/src/Profile.jl

-        Threads.@spawn begin
+    unique_ips = unique(has_meta(data) ? strip_meta(data) : data)
+    chnl = Channel{Tuple{UInt64,Vector{StackFrame}}}(length(unique_ips)) do ch
+        Threads.@threads for ip in unique_ips


This will hang if a thread is already running in non-yielding code. Can we at a minimum make this threading optional?

Good point. Is there a tidy way to make a @threads optional?

I have no idea, I was wondering the same thing when I wrote this. Maybe refactor the internals into its own function?

Just pushed what appears to be non-blocking in this case

timholy · 2022-01-15T20:38:16Z

stdlib/Profile/src/Profile.jl

+    # we don't want metadata here as we're just looking up ips
+    unique_ips = unique(has_meta(data) ? strip_meta(data) : data)
+    n_unique_ips = length(unique_ips)
+    n_unique_ips == 0 && return dict


The expensive part is the lookup. Can't you just

iplookups = similar(unique_ips, Vector{StackFrame}) Threads.@threads for i = 1:n_unique_ips iplookups[i] = _lookup_corrected(unique_ips[i]) end

and then stash in the Dict in single-threaded mode?

I didn't use @threads because @jpsamaroo pointed out @threads would get blocked if any thread is doing something that isn't yielding.

Though, regarding just putting it into a buffer, I was thinking that would be memory inefficient, but maybe I'm wrong because the dict is empty to start with..

I'll try removing the channel stuff.

Not a big deal, but you're not using the limited-capacity aspect of the Channel at all, so this is really just a glorified array-store. If you store by the index of the ip you don't have to worry about races.Indeed, you could replace the @threads with @spawned blocks.

stdlib/Profile/src/Profile.jl

tkf · 2022-01-15T23:40:26Z

This PR essentially just needs a parallel map. IMHO it doesn't seem like a good idea to write an ad-hoc inefficient parallel map everywhere that needs it. However, Base simply does not have a facility to implement efficient parallel folds, including parallel maps. How about decomposing Profile API in such a way that threaded_getdict! can be implemented outside the julia distribution?

Alternatively, maybe we can write slightly clumsy parallel folds inside Base that can be used for Base and stdlib but intentionally declared as internal to avoid external packages relying on it.

IanButterworth · 2022-01-16T08:51:14Z

How about decomposing Profile API in such a way that threaded_getdict! can be implemented outside the julia distribution?

I guess a faster getdict! could be provided externally for other consumers, but it would still be nice to have a fast method for the Profile.print() report format.

Regarding how to implement this if it stays in Profile, I've pushed what I think is a decent version of this that could exist in current Base. I'm not pushing for it to stay as is, but as an illustration if it were to be generalized into an internal parallel map func as you suggest.

Note that:

Only threads() tasks are spawned
Base.update_stackframes_callback[] is now executed sequentially to avoid breaking the race-free promise, as you pointed out @tkf
It won't block in the way @threads would, which lines it up for Add profiling of already running tasks via SIGINFO/SIGUSR1 #43179 if that gets approved
I kept the channel given there might be some benefit to operating the two loops in parallel

For a small profile data buffer:

This PR

julia> @btime Profile.retrieve();
  40.233 ms (19765 allocations: 10.06 MiB)

Master

julia> @btime Profile.retrieve();
  331.755 ms (295629 allocations: 26.67 MiB)

IanButterworth · 2022-01-21T05:11:38Z

How does this look @vtjnash ?

IanButterworth · 2022-01-31T22:59:46Z

Bump

tkf

LGTM

IanButterworth · 2022-02-01T22:24:12Z

I was hoping to get your review on this @vtjnash but will merge tomorrow if not

…aLang#43816)" This reverts commit e6fa3ec.

)

oscardssmith · 2022-03-02T01:06:08Z

Sometime between 1.7 and master, Profile printing has gotten way slower. Could this be the cause?

IanButterworth · 2022-03-02T01:30:10Z

Are you comparing both with the same number of threads?

I don't see a regression, and in larger tests saw an improvement with this PR.
The timing isn't precisely reliable because the number of samples won't be exactly the same

1.7.1

julia> Threads.nthreads()
6

julia> using Profile

julia> @profile begin 
       t = time()
       while time() < t + 5
       rand(10,10) * rand(10,10)
       end
       end

julia> @time Profile.print()
Overhead ╎ [+additional indent] Count File:Line; Function
=========================================================
     ╎2126  @Base/client.jl:495; _start()
     ╎ 2126  @Base/client.jl:309; exec_options(opts::Base.JLOpt...
     ╎  2126  @Base/client.jl:379; run_main_repl(interactive::B...
...
  1.470694 seconds (531.68 k allocations: 34.682 MiB, 80.94% compilation time)

julia> @time Profile.print()
Overhead ╎ [+additional indent] Count File:Line; Function
=========================================================
     ╎2126  @Base/client.jl:495; _start()
     ╎ 2126  @Base/client.jl:309; exec_options(opts::Base.JLOpt...
     ╎  2126  @Base/client.jl:379; run_main_repl(interactive::B...
...
  0.187074 seconds (44.49 k allocations: 8.632 MiB)

master

julia> Threads.nthreads()
6

julia> using Profile

julia> @profile begin 
       t = time()
       while time() < t + 5
       rand(10,10) * rand(10,10)
       end
       end

julia> @time Profile.print()
Overhead ╎ [+additional indent] Count File:Line; Function
=========================================================
     ╎2317  @Base/client.jl:522; _start()
     ╎ 2317  @Base/client.jl:318; exec_options(opts::Base.JLOpt...
     ╎  2317  @Base/client.jl:404; run_main_repl(interactive::B...
...
  1.441475 seconds (603.16 k allocations: 41.029 MiB, 0.54% gc time, 75.89% compilation time)

julia> @time Profile.print()
Overhead ╎ [+additional indent] Count File:Line; Function
=========================================================
     ╎2317  @Base/client.jl:522; _start()
     ╎ 2317  @Base/client.jl:318; exec_options(opts::Base.JLOpt...
     ╎  2317  @Base/client.jl:404; run_main_repl(interactive::B...
...
  0.138633 seconds (42.42 k allocations: 11.697 MiB, 0.80% gc time)

)

IanButterworth requested a review from vtjnash January 14, 2022 22:45

IanButterworth changed the title ~~Profile: safer getdict! via channels and limiting spawns (fix #43805)~~ Profile: safer getdict! (fix #43805) Jan 15, 2022

IanButterworth changed the title ~~Profile: safer getdict! (fix #43805)~~ Profile: fix unsafe getdict! (fix #43805) Jan 15, 2022

jpsamaroo suggested changes Jan 15, 2022

View reviewed changes

multithread getdict!

eaf99c0

IanButterworth force-pushed the ib/profile_safer_fetch branch from 12f5582 to eaf99c0 Compare January 15, 2022 17:52

IanButterworth changed the title ~~Profile: fix unsafe getdict! (fix #43805)~~ Profile: multithread getdict! (redo of #43805) Jan 15, 2022

timholy reviewed Jan 15, 2022

View reviewed changes

tkf reviewed Jan 15, 2022

View reviewed changes

stdlib/Profile/src/Profile.jl Outdated Show resolved Hide resolved

move update_stackframes_callback[] to sequential loop

1a360ac

remove channel

d4af208

kshyatt added the profiler label Jan 19, 2022

tkf approved these changes Feb 1, 2022

View reviewed changes

IanButterworth merged commit e6fa3ec into JuliaLang:master Feb 3, 2022

IanButterworth deleted the ib/profile_safer_fetch branch February 3, 2022 01:52

N5N3 added a commit to N5N3/julia that referenced this pull request Feb 4, 2022

Revert "Profile: multithread getdict! (redo of JuliaLang#43805) (Juli…

a505b78

…aLang#43816)" This reverts commit e6fa3ec.

N5N3 added a commit to N5N3/julia that referenced this pull request Feb 4, 2022

Revert "Profile: multithread getdict! (redo of JuliaLang#43805) (Juli…

ca206df

…aLang#43816)" This reverts commit e6fa3ec.

N5N3 added a commit to N5N3/julia that referenced this pull request Feb 5, 2022

Revert "Profile: multithread getdict! (redo of JuliaLang#43805) (Juli…

000cb5d

…aLang#43816)" This reverts commit e6fa3ec.

N5N3 added a commit to N5N3/julia that referenced this pull request Feb 5, 2022

Revert "Profile: multithread getdict! (redo of JuliaLang#43805) (Juli…

f9a5242

…aLang#43816)" This reverts commit e6fa3ec.

N5N3 added a commit to N5N3/julia that referenced this pull request Feb 6, 2022

Revert "Profile: multithread getdict! (redo of JuliaLang#43805) (Juli…

c9b112d

…aLang#43816)" This reverts commit e6fa3ec.

IanButterworth mentioned this pull request Feb 8, 2022

add progress bar to allocation preparation JuliaPerf/PProf.jl#53

Merged

N5N3 added a commit to N5N3/julia that referenced this pull request Feb 9, 2022

Revert "Profile: multithread getdict! (redo of JuliaLang#43805) (Juli…

ad6d049

…aLang#43816)" This reverts commit e6fa3ec.

N5N3 added a commit to N5N3/julia that referenced this pull request Feb 9, 2022

Revert "Profile: multithread getdict! (redo of JuliaLang#43805) (Juli…

91de1d4

…aLang#43816)" This reverts commit e6fa3ec.

N5N3 added a commit to N5N3/julia that referenced this pull request Feb 9, 2022

Revert "Profile: multithread getdict! (redo of JuliaLang#43805) (Juli…

8e4dbda

…aLang#43816)" This reverts commit e6fa3ec.

N5N3 added a commit to N5N3/julia that referenced this pull request Feb 9, 2022

Revert "Profile: multithread getdict! (redo of JuliaLang#43805) (Juli…

f9eb667

…aLang#43816)" This reverts commit e6fa3ec.

N5N3 added a commit to N5N3/julia that referenced this pull request Feb 9, 2022

Revert "Profile: multithread getdict! (redo of JuliaLang#43805) (Juli…

8a943b5

…aLang#43816)" This reverts commit e6fa3ec.

N5N3 added a commit to N5N3/julia that referenced this pull request Feb 10, 2022

Revert "Profile: multithread getdict! (redo of JuliaLang#43805) (Juli…

f717cd2

…aLang#43816)" This reverts commit e6fa3ec.

N5N3 added a commit to N5N3/julia that referenced this pull request Feb 11, 2022

Revert "Profile: multithread getdict! (redo of JuliaLang#43805) (Juli…

20adec5

…aLang#43816)" This reverts commit e6fa3ec.

N5N3 added a commit to N5N3/julia that referenced this pull request Feb 11, 2022

Revert "Profile: multithread getdict! (redo of JuliaLang#43805) (Juli…

961ca0c

…aLang#43816)" This reverts commit e6fa3ec.

N5N3 added a commit to N5N3/julia that referenced this pull request Feb 11, 2022

Revert "Profile: multithread getdict! (redo of JuliaLang#43805) (Juli…

7084d0d

…aLang#43816)" This reverts commit e6fa3ec.

N5N3 added a commit to N5N3/julia that referenced this pull request Feb 11, 2022

Revert "Profile: multithread getdict! (redo of JuliaLang#43805) (Juli…

75263eb

…aLang#43816)" This reverts commit e6fa3ec.

N5N3 added a commit to N5N3/julia that referenced this pull request Feb 12, 2022

Revert "Profile: multithread getdict! (redo of JuliaLang#43805) (Juli…

ccb6e55

…aLang#43816)" This reverts commit e6fa3ec.

N5N3 added a commit to N5N3/julia that referenced this pull request Feb 12, 2022

Revert "Profile: multithread getdict! (redo of JuliaLang#43805) (Juli…

caaec42

…aLang#43816)" This reverts commit e6fa3ec.

N5N3 added a commit to N5N3/julia that referenced this pull request Feb 13, 2022

Revert "Profile: multithread getdict! (redo of JuliaLang#43805) (Juli…

95c58d0

…aLang#43816)" This reverts commit e6fa3ec.

N5N3 added a commit to N5N3/julia that referenced this pull request Feb 13, 2022

Revert "Profile: multithread getdict! (redo of JuliaLang#43805) (Juli…

b71e211

…aLang#43816)" This reverts commit e6fa3ec.

N5N3 added a commit to N5N3/julia that referenced this pull request Feb 13, 2022

Revert "Profile: multithread getdict! (redo of JuliaLang#43805) (Juli…

8f623fe

…aLang#43816)" This reverts commit e6fa3ec.

N5N3 added a commit to N5N3/julia that referenced this pull request Feb 13, 2022

Revert "Profile: multithread getdict! (redo of JuliaLang#43805) (Juli…

d70d056

…aLang#43816)" This reverts commit e6fa3ec.

N5N3 added a commit to N5N3/julia that referenced this pull request Feb 14, 2022

Revert "Profile: multithread getdict! (redo of JuliaLang#43805) (Juli…

201f4de

…aLang#43816)" This reverts commit e6fa3ec.

N5N3 added a commit to N5N3/julia that referenced this pull request Feb 14, 2022

Revert "Profile: multithread getdict! (redo of JuliaLang#43805) (Juli…

dbcd63d

…aLang#43816)" This reverts commit e6fa3ec.

N5N3 added a commit to N5N3/julia that referenced this pull request Feb 15, 2022

Revert "Profile: multithread getdict! (redo of JuliaLang#43805) (Juli…

497607a

…aLang#43816)" This reverts commit e6fa3ec.

N5N3 added a commit to N5N3/julia that referenced this pull request Feb 15, 2022

Revert "Profile: multithread getdict! (redo of JuliaLang#43805) (Juli…

43f4512

…aLang#43816)" This reverts commit e6fa3ec.

N5N3 added a commit to N5N3/julia that referenced this pull request Feb 15, 2022

Revert "Profile: multithread getdict! (redo of JuliaLang#43805) (Juli…

f2e5028

…aLang#43816)" This reverts commit e6fa3ec.

N5N3 added a commit to N5N3/julia that referenced this pull request Feb 16, 2022

Revert "Profile: multithread getdict! (redo of JuliaLang#43805) (Juli…

2223d42

…aLang#43816)" This reverts commit e6fa3ec.

N5N3 added a commit to N5N3/julia that referenced this pull request Feb 17, 2022

Revert "Profile: multithread getdict! (redo of JuliaLang#43805) (Juli…

e30146b

…aLang#43816)" This reverts commit e6fa3ec.

LilithHafner pushed a commit to LilithHafner/julia that referenced this pull request Feb 22, 2022

Profile: multithread getdict! (redo of JuliaLang#43805) (JuliaLang#43816

d5680aa

)

LilithHafner pushed a commit to LilithHafner/julia that referenced this pull request Mar 8, 2022

Profile: multithread getdict! (redo of JuliaLang#43805) (JuliaLang#43816

098f7e6

)

This was referenced May 18, 2022

v1.8-beta3 breaks StatProfilerHTML.jl #45361

Closed

Profile.getdict!: Revert drive-by change-of-behavior in performance patch #45403

Closed

Uh oh!

Profile: multithread getdict! (redo of #43805) #43816

Profile: multithread getdict! (redo of #43805) #43816

Uh oh!

Conversation

IanButterworth commented Jan 14, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

jpsamaroo Jan 15, 2022

Choose a reason for hiding this comment

Uh oh!

IanButterworth Jan 15, 2022

Choose a reason for hiding this comment

Uh oh!

jpsamaroo Jan 15, 2022

Choose a reason for hiding this comment

Uh oh!

IanButterworth Jan 15, 2022

Choose a reason for hiding this comment

Uh oh!

timholy Jan 15, 2022

Choose a reason for hiding this comment

Uh oh!

IanButterworth Jan 15, 2022

Choose a reason for hiding this comment

Uh oh!

timholy Jan 16, 2022

Choose a reason for hiding this comment

Uh oh!

Uh oh!

tkf commented Jan 15, 2022

Uh oh!

IanButterworth commented Jan 16, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

IanButterworth commented Jan 21, 2022

Uh oh!

IanButterworth commented Jan 31, 2022

Uh oh!

tkf left a comment

Choose a reason for hiding this comment

Uh oh!

IanButterworth commented Feb 1, 2022

Uh oh!

oscardssmith commented Mar 2, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

IanButterworth commented Mar 2, 2022

Uh oh!

Uh oh!

IanButterworth commented Jan 14, 2022 •

edited

Loading

IanButterworth commented Jan 16, 2022 •

edited

Loading

oscardssmith commented Mar 2, 2022 •

edited

Loading