feat: enable ray by default #787

xzdandy · 2023-05-28T06:47:11Z

This pr intends to enable ray by default in EVA.

Ray is enabled by default in eva server, test cases, notebooks
Remove experimental code directory
~~Introduce a new UDF decorator parralizable and uses it in the optimizer to filter out inexpensive functions.~~
- This is not possible now. We need UDF decorator to be associated with the class instead of the setup function, so we can get the UDF properties without initializing the UDF.

review-notebook-app · 2023-05-29T06:14:27Z

Check out this pull request on

See visual diffs & provide feedback on Jupyter Notebooks.

Powered by ReviewNB

xzdandy · 2023-05-30T08:15:29Z

Currently, the optimizer rule for ray does not work for UDFs in the projection's target list. So only the following two notebooks uses ray. Their results look expected.

The reason is that LogicalGetToSeqScan for ray does not make sense, since function expressions are in the LogicalProject instead of Logical Get. We need to create a LogicalProjectToPhysical for ray.
https://github.com/georgia-tech-db/eva/blob/e26c36dd71242de005e57baa3c4a6c2ab818d2d2/eva/experimental/parallel/optimizer/rules/rules.py#L90

xzdandy · 2023-05-31T09:08:13Z

@gaurav274 @jiashenC Any idea, why we have the ray skip tag for chatgpt test cases? Also, this seems to be unit test cases instead of integration test cases.

gaurav274 · 2023-05-31T12:41:44Z

@gaurav274 @jiashenC Any idea, why we have the ray skip tag for chatgpt test cases? Also, this seems to be unit test cases instead of integration test cases.

It is a unit test because of the missing openai_api_key. Not sure about ray.

gaurav274 · 2023-05-31T12:42:19Z

@jiashenC testcases are failing because of the coverage issue, right?

jiashenC · 2023-05-31T15:03:22Z

@jiashenC testcases are failing because of the coverage issue, right?

Yes, we probably still need to have a separate ray testing without generating coverage reports. Otherwise, it will cause timeout issue because of too many coverage trace to parse which are generated by many ray processes.

jiashenC · 2023-05-31T15:04:12Z

eva/experimental/parallel/optimizer/rules/rules.py

@@ -87,45 +86,39 @@ def apply(self, before: LogicalApplyAndMerge, context: OptimizerContext):
        yield exchange_plan


-class LogicalGetToSeqScan(Rule):
+class LogicalProjectToPhysical(Rule):


Just double check, so we won't get any functional expression in the SeqScan operator anymore?

Yes. It seems that the current statement to operator generation always creates a project operator.

jarulraj · 2023-06-01T03:10:30Z

We can get rid of experimental folder?

xzdandy · 2023-06-01T05:14:48Z

@jiashenC testcases are failing because of the coverage issue, right?

Yes, we probably still need to have a separate ray testing without generating coverage reports. Otherwise, it will cause timeout issue because of too many coverage trace to parse which are generated by many ray processes.

It is pending on local also, after all testcases are completed before the coverage report is generated. It seems that there may be some ray related resources are not correctly garbage cleaned.

xzdandy · 2023-06-01T05:15:23Z

We can get rid of experimental folder?

Make sense. We should get rid of it. Will do.

This reverts commit babadd4.

This reverts commit 84ba13e.

xzdandy · 2023-06-01T09:09:19Z

Update:

Rollback the UDF decorator changes for ray. The current UDF decorator design can not support this feature in a proper manner.
After removing the experimental directory, the coverage is dropping, given we are collecting coverage with ray disabled. Need to update .coveragerc and add some unit ray testcases even when ray is disabled.

This pr intends to enable ray by default in EVA. - [x] Ray is enabled by default in eva server, test cases, notebooks - [x] Remove experimental code directory - [ ] ~~Introduce a new UDF decorator `parralizable` and uses it in the optimizer to filter out inexpensive functions.~~ - This is not possible now. We need UDF decorator to be associated with the class instead of the setup function, so we can get the UDF properties without initializing the UDF.

xzdandy added 5 commits May 28, 2023 02:45

Enable ray be default

934e92c

disable ray specific test case

05cde5c

rollback

26480f1

fix typo

84a60be

Check Notebook under ray

6b5a496

jarulraj changed the title ~~Enable ray be default~~ feat: enable ray by default May 29, 2023

jarulraj added High Priority ⚡️ Integrations 🧩 Pull requests that update an integration High Effort 🏋 Difficult solution or problem to solve labels May 29, 2023

xzdandy added 11 commits May 30, 2023 05:38

fix the ray for project plan

c506d5e

fix LINTER

74f73be

fix test_rule

10d99dd

fix typo

961c750

Fix test_mat_executor

5aac2c2

remove unused imports

66ff703

Update all notebook with ray support for logicalproject

a3e5b3a

improve ray test cases

5366c70

fix LINTER

54df732

Merge branch 'master' into ray_default

3529986

fix LINTER

f92a88b

xzdandy marked this pull request as ready for review May 31, 2023 08:42

xzdandy requested review from gaurav274 and jiashenC May 31, 2023 08:42

fix testcases

65d620e

jiashenC reviewed May 31, 2023

View reviewed changes

xzdandy added 8 commits June 1, 2023 02:24

Fix cov setup for ray

1c5e333

Remove experimental directory

85f714a

LINTER

5542099

add parallelizable UDF decorators

84ba13e

fix linter

babadd4

update coverage configuration

2f8ef20

Revert "fix linter"

9340558

This reverts commit babadd4.

Revert "add parallelizable UDF decorators"

537dea9

This reverts commit 84ba13e.

gaurav274 approved these changes Jun 1, 2023

View reviewed changes

jiashenC approved these changes Jun 1, 2023

View reviewed changes

gaurav274 merged commit 22b5160 into master Jun 1, 2023

gaurav274 deleted the ray_default branch June 1, 2023 20:05

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: enable ray by default #787

feat: enable ray by default #787

xzdandy commented May 28, 2023 •

edited

Loading

review-notebook-app bot commented May 29, 2023

xzdandy commented May 30, 2023 •

edited

Loading

xzdandy commented May 31, 2023

gaurav274 commented May 31, 2023

gaurav274 commented May 31, 2023

jiashenC commented May 31, 2023 •

edited

Loading

jiashenC May 31, 2023

xzdandy Jun 1, 2023

jarulraj commented Jun 1, 2023

xzdandy commented Jun 1, 2023

xzdandy commented Jun 1, 2023

xzdandy commented Jun 1, 2023

feat: enable ray by default #787

feat: enable ray by default #787

Conversation

xzdandy commented May 28, 2023 • edited Loading

review-notebook-app bot commented May 29, 2023

xzdandy commented May 30, 2023 • edited Loading

xzdandy commented May 31, 2023

gaurav274 commented May 31, 2023

gaurav274 commented May 31, 2023

jiashenC commented May 31, 2023 • edited Loading

jiashenC May 31, 2023

Choose a reason for hiding this comment

xzdandy Jun 1, 2023

Choose a reason for hiding this comment

jarulraj commented Jun 1, 2023

xzdandy commented Jun 1, 2023

xzdandy commented Jun 1, 2023

xzdandy commented Jun 1, 2023

xzdandy commented May 28, 2023 •

edited

Loading

xzdandy commented May 30, 2023 •

edited

Loading

jiashenC commented May 31, 2023 •

edited

Loading