Enable warp-per-tree inference in FIL for regression and binary classification #3760

levsnv · 2021-04-17T06:43:23Z

In FIL, enables using multiple threads to infer a single tree on multiple rows (less, equal or more than a warp, but often close to a warpful). Highly speeds up inference on the first ~6 levels of each tree, which can be significant even for rather deep models. E.g. random forests of max depth 40 may have an average depth of 13, which is barely double the highly sped up portion.

The breaking status is since we're adding a new C API parameter that has two mandatory values: threads_per_tree and n_items.
Python API is not breaking because if defaults to threads_per_tree=1 and n_items=0, which I am currently ensuring to have no performance regressions

…ug print

levsnv · 2021-04-19T06:22:42Z

python tests show a bug that did not show in C++ tests with similar parameters. I will investigate, but assuming the code will not change much from that.

cpp/include/cuml/fil/fil.h

cpp/src/fil/fil.cu

cpp/src/fil/infer.cu

cpp/test/sg/fil_test.cu

python/cuml/fil/fil.pyx

…per-tree-PR

…e-PR

dantegd

Approved pending CI

…-tree-PR

levsnv · 2021-06-09T00:19:13Z

in test_base.py and test_pickle.py
E ImportError: cannot import name 'make_meta' from 'dask.dataframe.core' (/opt/conda/envs/rapids/lib/python3.7/site-packages/dask/dataframe/core.py)

…p-per-tree-PR

levsnv · 2021-06-09T01:39:11Z

Hopefully, by the time CI finishes, it's a normal FIL-only PR again

…need them

levsnv · 2021-06-09T07:50:18Z

This has #3941 merged in, which, in turn, was scheduled for gpucibot to merge.

…-tree-PR

levsnv · 2021-06-10T07:40:09Z

Somehow, the build system changes did not vanish from "Files changed". Asked Dante :)

dantegd · 2021-06-14T15:01:43Z

rerun tests

codecov-commenter · 2021-06-14T17:31:38Z

Codecov Report

❗ No coverage uploaded for pull request base (branch-21.08@8fe1b05). Click here to learn what that means.
The diff coverage is n/a.

@@               Coverage Diff               @@
##             branch-21.08    #3760   +/-   ##
===============================================
  Coverage                ?   85.32%           
===============================================
  Files                   ?      230           
  Lines                   ?    18095           
  Branches                ?        0           
===============================================
  Hits                    ?    15439           
  Misses                  ?     2656           
  Partials                ?        0

Flag	Coverage Δ
dask	`47.90% <0.00%> (?)`
non-dask	`77.67% <0.00%> (?)`

Flags with carried forward coverage won't be shown. Click here to find out more.

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update 8fe1b05...cec71e5. Read the comment docs.

dantegd · 2021-06-14T17:49:16Z

@gpucibot merge

…ification (rapidsai#3760) In FIL, enables using multiple threads to infer a single tree on multiple rows (less, equal or more than a warp, but often close to a warpful). Highly speeds up inference on the first ~6 levels of each tree, which can be significant even for rather deep models. E.g. random forests of max depth 40 may have an average depth of 13, which is barely double the highly sped up portion. The breaking status is since we're adding a new C API parameter that has two mandatory values: `threads_per_tree` and `n_items`. Python API is not breaking because if defaults to `threads_per_tree=1` and `n_items=0`, which I am currently ensuring to have no performance regressions Authors: - https://github.com/levsnv - Dante Gama Dessavre (https://github.com/dantegd) Approvers: - Andy Adinets (https://github.com/canonizer) - Dante Gama Dessavre (https://github.com/dantegd) URL: rapidsai#3760

try 1

8fedfd7

github-actions bot added CUDA/C++ Cython / Python Cython or Python issue labels Apr 17, 2021

fixed python test

80b919f

levsnv added breaking Breaking change CUDA / C++ CUDA issue Perf Related to runtime performance of the underlying code labels Apr 18, 2021

levsnv added 2 commits April 17, 2021 17:30

copyright year

f54ffa6

enhanced python tests; threaded n_items through python layer; one deb…

18651ce

…ug print

levsnv added the 4 - Waiting on Author Waiting for author to respond to review label Apr 18, 2021

raydouglass removed the CUDA/C++ label Apr 19, 2021

canonizer suggested changes Apr 23, 2021

View reviewed changes

levsnv changed the title ~~[WIP] Enable warp-per-tree inference in FIL~~ [WIP] Enable warp-per-tree inference in FIL for regression and binary classification Apr 27, 2021

levsnv added 6 commits April 27, 2021 19:07

refactor

3f4a5d6

style

9e5c5a4

style

8ae9e5f

addressed some review comments

5149604

Merge remote-tracking branch 'levs/refactor-cython-kwargs' into warp-…

cc7e365

…per-tree-PR

fixed all bugs

aa40a64

levsnv changed the title ~~[WIP] Enable warp-per-tree inference in FIL for regression and binary classification~~ Enable warp-per-tree inference in FIL for regression and binary classification May 4, 2021

levsnv requested a review from canonizer May 4, 2021 08:06

levsnv added 3 - Ready for Review Ready for review by team and removed 4 - Waiting on Author Waiting for author to respond to review labels May 4, 2021

levsnv marked this pull request as ready for review May 4, 2021 08:06

levsnv requested review from a team as code owners May 4, 2021 08:06

Merge remote-tracking branch 'rapidsai/branch-0.20' into warp-per-tre…

39e05af

…e-PR

github-actions bot added the CUDA/C++ label May 4, 2021

levsnv added the improvement Improvement / enhancement to an existing function label May 4, 2021

dantegd added 5 - Ready to Merge Testing and reviews complete, ready to merge and removed 3 - Ready for Review Ready for review by team labels Jun 6, 2021

dantegd approved these changes Jun 6, 2021

View reviewed changes

dantegd and others added 8 commits June 6, 2021 17:16

DBG Playing with using mamba a little bit more

e1bca08

DBG Remove xgboost instead of all of the above

24b3198

DBG correct ucx-py version

9692b43

FIX Change order of commented code to make the script happy

0f28f31

Merge branch 'branch-21.08' into 2108-fix-likerinfo

3440b8e

FIX Merge main and use dask main

8d20def

Merge branch 'branch-21.08' of github.com:rapidsai/cuml into warp-per…

3f4d875

…-tree-PR

FIX add back xgboost now that package is published

71e0685

Merge branch '2108-fix-likerinfo' of github.com:dantegd/cuml into war…

2bf79d9

…p-per-tree-PR

github-actions bot added CMake gpuCI gpuCI issue labels Jun 9, 2021

added void accumulate(..., int num_rows) back since vector leaf will …

e71ef74

…need them

levsnv requested a review from a team as a code owner June 9, 2021 03:00

readability

7b1f2d3

Merge branch 'branch-21.08' of github.com:rapidsai/cuml into warp-per…

cec71e5

…-tree-PR

github-actions bot removed CMake gpuCI gpuCI issue labels Jun 10, 2021

levsnv removed the request for review from a team June 10, 2021 23:29

rapids-bot bot merged commit bcb7b6c into rapidsai:branch-21.08 Jun 14, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Enable warp-per-tree inference in FIL for regression and binary classification #3760

Enable warp-per-tree inference in FIL for regression and binary classification #3760

levsnv commented Apr 17, 2021 •

edited

Loading

levsnv commented Apr 19, 2021

dantegd left a comment

levsnv commented Jun 9, 2021 •

edited

Loading

levsnv commented Jun 9, 2021

levsnv commented Jun 9, 2021

levsnv commented Jun 10, 2021

dantegd commented Jun 14, 2021

codecov-commenter commented Jun 14, 2021

dantegd commented Jun 14, 2021

Enable warp-per-tree inference in FIL for regression and binary classification #3760

Enable warp-per-tree inference in FIL for regression and binary classification #3760

Conversation

levsnv commented Apr 17, 2021 • edited Loading

levsnv commented Apr 19, 2021

dantegd left a comment

Choose a reason for hiding this comment

levsnv commented Jun 9, 2021 • edited Loading

levsnv commented Jun 9, 2021

levsnv commented Jun 9, 2021

levsnv commented Jun 10, 2021

dantegd commented Jun 14, 2021

codecov-commenter commented Jun 14, 2021

Codecov Report

dantegd commented Jun 14, 2021

levsnv commented Apr 17, 2021 •

edited

Loading

levsnv commented Jun 9, 2021 •

edited

Loading