Figure out why multithreading gains aren't great #12

clbarnes · 2020-11-16T14:06:28Z

8x threads results in only a ~2x speedup for containment; 1.1x speedup for ray intersections.

Off the top of my head, there's probably some combination of 3 sources:

The python-rust bridge. This would manifest as the query spending a significant portion of its time thrashing a single CPU at the start and end of the query, and possibly the multithreading gains getting worse with the number of queries, but improving with the number of rays cast for each containment query. It would be improved by Use rust-numpy for data IO #1
The constant startup cost of rayon. This would manifest as the multithreading gains improving with the number of queries. Unavoidable, unless there's a lighter runtime available (smol?). Could automatically switch to single-threaded for small number of queries when threads=True?
The N = number of queries cost of the work-stealing job scheduler, if the ray casts are very cheap. Would be improved by chunking the queries so that N = number of chunks.

Some combination of 2 and 3 are certainly certainly already a problem: benchmarked containment checks are ~2.5x faster on 0 threads than on 1.

The text was updated successfully, but these errors were encountered:

clbarnes · 2021-11-09T14:55:49Z

The fact that containment checks (multiple ray casts per task) get a better speedup than ray intersections (1 ray cast per task) implies that 3 is definitely a factor.

clbarnes · 2022-02-22T21:38:48Z

Gains are still not great with #28 , which eliminates the python-rust bridge (although I think there's still a copy involved within the rust side) and at least some of rayon's startup cost (because it uses the global thread pool rather than building a new one every query). So I guess it's the job scheduler sapping our gains.

However, reorganising to use chunks would be a massive faff, unlikely to fix this any time soon.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Figure out why multithreading gains aren't great #12

Figure out why multithreading gains aren't great #12

clbarnes commented Nov 16, 2020 •

edited

Loading

clbarnes commented Nov 9, 2021

clbarnes commented Feb 22, 2022 •

edited

Loading

Figure out why multithreading gains aren't great #12

Figure out why multithreading gains aren't great #12

Comments

clbarnes commented Nov 16, 2020 • edited Loading

clbarnes commented Nov 9, 2021

clbarnes commented Feb 22, 2022 • edited Loading

clbarnes commented Nov 16, 2020 •

edited

Loading

clbarnes commented Feb 22, 2022 •

edited

Loading