Switch to native traceback in `cuml` #6078

galipremsagar · 2024-09-23T20:35:22Z

In cudf we have observed a ~10% speed up of pytest suite execution by switching pytest traceback to --native:

currently:

102474 passed, 2117 skipped, 902 xfailed in 892.16s (0:14:52)

--tb=short:

102474 passed, 2117 skipped, 902 xfailed in 898.99s (0:14:58)

--tb=no:

102474 passed, 2117 skipped, 902 xfailed in 815.98s (0:13:35)

--tb=native:

102474 passed, 2117 skipped, 902 xfailed in 820.92s (0:13:40)

This PR makes similar change to cuml.

xref: rapidsai/cudf#16851

dantegd · 2024-09-23T20:38:41Z

@galipremsagar those times are for the cuDF pytest suite? It might be worth to check the time savings for cuML, I got a fresh container so can check those times in the next couple of days if so

galipremsagar · 2024-09-23T22:41:15Z

@dantegd Yes, the timing improvements I posted above are of cudf pytest suite. cuml seems to have even better speedups, almost ~20% faster:

This PR:

==== 466 skipped, 9 xfailed, 201 xpassed, 25 warnings in 1432.43s (0:23:52) ====

link: https://github.com/rapidsai/cuml/actions/runs/11002114744/job/30550052027?pr=6078#step:7:1802

Other most recently merged PR:

==== 466 skipped, 9 xfailed, 201 xpassed, 3 warnings in 1696.39s (0:28:16) =====

link: https://github.com/rapidsai/cuml/actions/runs/10999896809/job/30543198516#step:7:1622

trivialfis · 2024-09-24T20:31:30Z

Super nice! I should try this for XGBoost as well.

dantegd · 2024-10-17T17:43:12Z

/merge

divyegala · 2024-10-21T14:20:20Z

@galipremsagar could you please check what's going wrong with the pytests in this PR? I can't quite figure out what the error is exactly.

bdice · 2024-12-11T18:53:48Z

I merged in the upstream. Dask tests (and maybe others) still appear to be failing without a clear indication of the problem. It will probably be necessary to try a local reproduction of CI with more pytest verbosity.

Maybe we can skip this change to Dask tests if it's just Dask tests failing. Still waiting on CI to find out.

betatim · 2024-12-12T09:04:42Z

I am less excited about saving five minutes on 30minutes than everyone else here. Both those times are basically "infinite" in terms of developer experience, which means my workflow is something like "Make a change, start tests, context switch to something else, eventually come back to it later. Reconstruct context, act". This means you need a lot of information when you return to the test output to be able to reconstruct the context and figure out the failure. I had a quick google around but couldn't find examples of what the default (long?) vs native tracebacks look like in terms of information you get.

Do we have some examples for cuml or cudf?

The five minute saving is quickly wiped out if you need to re-run the test suite more often because you are missing information.

bdice · 2024-12-12T16:29:05Z

@betatim We had a conversation about this on Slack and there was consensus that there is effectively no loss of information by making this change. It takes a lot of time for pytest to walk through all the traces and pretty-print things. I’ll link you to the discussions in Slack.

See also: rapidsai/cudf#16851 (comment)

betatim · 2024-12-12T16:44:40Z

I guess it is simple enough to switch back to full tracebacks if it turns out to be annoying.

Though it seems the real solution is to not have so many xfailed tests. If anyone wants to tackle those :D

vyasr · 2024-12-12T17:35:21Z

Both those times are basically "infinite" in terms of developer experience, which means my workflow is something like "Make a change, start tests, context switch to something else, eventually come back to it later. Reconstruct context, act".

Keep in mind that unfortunately this isn't really true when CI is so backed up that you come back and it hasn't run yet, which has historically been a fairly common occurrence. I's especially not true when we're up against a release timeline and we can't get things done fast enough, or if we have a global breaking change that requires rerunning CI on all PRs and it takes days to get caught up.

bdice · 2024-12-17T23:42:28Z

Maybe we can skip this change to Dask tests if it's just Dask tests failing.

I made this change in bf65363 and merged in the upstream. Hopefully CI will pass now.

galipremsagar · 2024-12-18T01:57:35Z

Maybe we can skip this change to Dask tests if it's just Dask tests failing.

I made this change in bf65363 and merged in the upstream. Hopefully CI will pass now.

I tried reproing locally without timeouts but the tests are still running after 8hrs+. If these tests are designed to run under 2hrs on CI and they finish we should be good

fixes #6194 Wheel tests in this project are emitting tons of warnings like this: > test_random_forest.py:1247 /__w/cuml/cuml/python/cuml/cuml/tests/test_random_forest.py:1247: PytestUnknownMarkWarning: Unknown pytest.mark.memleak - is this a typo? You can register custom marks to avoid this warning - for details, see https://docs.pytest.org/en/stable/how-to/mark.html @pytest.mark.memleak I think that's because the introduction of a `pytest.ini` file in #6078 resulted in all of the `pytest` options from `pyproject.toml` being ignored. From https://docs.pytest.org/en/stable/reference/customize.html#pytest-ini > pytest.ini files take precedence over other files, even when empty. I think "take precedence" there means that if `pytest` finds a `pytest.ini`, it stops searching for other configuration files. Authors: - James Lamb (https://github.com/jameslamb) - Vyas Ramasubramani (https://github.com/vyasr) Approvers: - Tim Head (https://github.com/betatim) - Jake Awe (https://github.com/AyodeAwe) - Vyas Ramasubramani (https://github.com/vyasr) URL: #6201

Switch to native traceback

c529a1f

galipremsagar requested a review from a team as a code owner September 23, 2024 20:35

galipremsagar requested review from teju85 and dantegd September 23, 2024 20:35

github-actions bot added the Cython / Python Cython or Python issue label Sep 23, 2024

galipremsagar added improvement Improvement / enhancement to an existing function non-breaking Non-breaking change labels Sep 23, 2024

galipremsagar added 2 commits September 23, 2024 20:41

add more

b806121

Merge branch 'branch-24.10' into tb

a36580a

Merge branch 'branch-24.10' into tb

6b13ca4

Merge branch 'branch-24.10' into tb

9755f16

divyegala approved these changes Sep 25, 2024

View reviewed changes

Merge branch 'branch-24.10' into tb

a5a6ec7

dantegd changed the base branch from branch-24.10 to branch-24.12 October 6, 2024 18:59

dantegd and others added 2 commits October 6, 2024 13:59

Merge branch 'branch-24.12' into tb

e8aaddd

Merge branch 'branch-24.12' into tb

af97d25

dantegd approved these changes Oct 17, 2024

View reviewed changes

Merge branch 'branch-24.12' into tb

eb43ec6

bdice changed the base branch from branch-24.12 to branch-25.02 December 11, 2024 14:20

Merge branch 'branch-25.02' into tb

4c60639

bdice mentioned this pull request Dec 11, 2024

Try to reduce network usage in cuML tests. #6174

Merged

bdice added 2 commits December 17, 2024 17:41

Disable native tracebacks in dask tests.

bf65363

Merge branch 'branch-25.02' into tb

8555225

rapids-bot bot merged commit 191f7ef into rapidsai:branch-25.02 Dec 18, 2024
61 of 62 checks passed

jameslamb mentioned this pull request Dec 31, 2024

consolidate pytest config in pyproject.toml #6201

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Switch to native traceback in `cuml` #6078

Switch to native traceback in `cuml` #6078

galipremsagar commented Sep 23, 2024

dantegd commented Sep 23, 2024

galipremsagar commented Sep 23, 2024

trivialfis commented Sep 24, 2024

dantegd commented Oct 17, 2024

divyegala commented Oct 21, 2024

bdice commented Dec 11, 2024 •

edited

Loading

betatim commented Dec 12, 2024

bdice commented Dec 12, 2024 •

edited

Loading

betatim commented Dec 12, 2024

vyasr commented Dec 12, 2024

bdice commented Dec 17, 2024

galipremsagar commented Dec 18, 2024

Switch to native traceback in cuml #6078

Switch to native traceback in cuml #6078

Conversation

galipremsagar commented Sep 23, 2024

dantegd commented Sep 23, 2024

galipremsagar commented Sep 23, 2024

trivialfis commented Sep 24, 2024

dantegd commented Oct 17, 2024

divyegala commented Oct 21, 2024

bdice commented Dec 11, 2024 • edited Loading

betatim commented Dec 12, 2024

bdice commented Dec 12, 2024 • edited Loading

betatim commented Dec 12, 2024

vyasr commented Dec 12, 2024

bdice commented Dec 17, 2024

galipremsagar commented Dec 18, 2024

Switch to native traceback in `cuml` #6078

Switch to native traceback in `cuml` #6078

bdice commented Dec 11, 2024 •

edited

Loading

bdice commented Dec 12, 2024 •

edited

Loading