[air] remove fully_executed from Tune. #25750

xwjiang2010 · 2022-06-14T04:45:07Z

Why are these changes needed?

Remove fully_executed from Tune layer.

I tried using my test bench by adding the following in ProgressReporter (printed every 5 seconds).

from ray.worker import _global_node
if _global_node.address_info:
    from ray.internal.internal_api import memory_summary
    meminfo = memory_summary(
        _global_node.address_info["address"], stats_only=True
    )
    print(meminfo)

And run Tune job with 10 trials (they should all share the same Dataset object).

The console output is a few lines of

Plasma memory usage 152 MiB, 1 objects

And then the output changes to

Plasma memory usage 305 MiB, 2 objects

I cannot quite explain why the usage is doubled, but at least it's not going linear with the size of trials (which is 10). I take it as a proof...

For check-ingest, I intentionally removed the session around preprocessing, for two reasons:

Would like to focus on dataset sharing topic for now, without further complicating the subject
I am not convinced that the current behavior is what we want or what we should advertise. Like even if I do

Trainer(datasets=xxx, preprocessor=yyy)
param_space={"learning_rate": tune.grid_search([0.001, 0.005])}
tuner=zzzz
tuner.fit()

I still have 2 copies of post-processed dataset, which seems very inefficient.

Related issue number

Checks

I've run scripts/format.sh to lint the changes in this PR.
I've included any doc changes needed for https://docs.ray.io/en/master/.
I've made sure the tests are passing. Note that there might be a few flaky tests, see the recent failures at https://flakey-tests.ray.io/
Testing Strategy
- Unit tests
- Release tests
- This PR is not tested :(

ericl

Code changes look good, but what is the reason to change the docs to such as large extent? I had in mind to tweak the doc slightly only:

Say the for experiment-wide dataset, you need to call ds.fully_executed() so the blocks are materialized before the tuner is created. This enables dataset sharing. Note that if a preprocessor is used, then only the input blocks are shared, preprocessing is still separate.
Otherwise, the dataset will be read separately in each trial.

doc/source/ray-air/doc_code/air_ingest.py

xwjiang2010 added 2 commits June 13, 2022 21:43

[air] remove fully_executed from Tune.

8779df4

check-ingest

79658c6

xwjiang2010 requested review from maxpumperla, pcmoritz, richardliaw, edoakes, simon-mo and ericl as code owners June 14, 2022 21:01

xwjiang2010 assigned ericl, clarkzinzow and jianoaix Jun 14, 2022

xwjiang2010 added this to the Ray AIR milestone Jun 14, 2022

xwjiang2010 added the air label Jun 14, 2022

xwjiang2010 added 2 commits June 14, 2022 14:26

fix doc

66bcc93

drawing

bdcdba7

ericl reviewed Jun 14, 2022

View reviewed changes

ericl added the @author-action-required The PR author is responsible for the next step. Remove tag to send back to the reviewer. label Jun 14, 2022

Update check-ingest.rst

74c821f

ericl reviewed Jun 14, 2022

View reviewed changes

doc/source/ray-air/doc_code/air_ingest.py Outdated Show resolved Hide resolved

Remove code.

4912e50

xwjiang2010 removed the @author-action-required The PR author is responsible for the next step. Remove tag to send back to the reviewer. label Jun 14, 2022

ericl approved these changes Jun 14, 2022

View reviewed changes

ericl merged commit 88d824d into ray-project:master Jun 15, 2022

xwjiang2010 deleted the fully_executed branch July 26, 2023 19:51

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[air] remove fully_executed from Tune. #25750

[air] remove fully_executed from Tune. #25750

xwjiang2010 commented Jun 14, 2022 •

edited

Loading

ericl left a comment

[air] remove fully_executed from Tune. #25750

[air] remove fully_executed from Tune. #25750

Conversation

xwjiang2010 commented Jun 14, 2022 • edited Loading

Why are these changes needed?

Related issue number

Checks

ericl left a comment

Choose a reason for hiding this comment

xwjiang2010 commented Jun 14, 2022 •

edited

Loading