Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Draft proposal to move DataFusion to a new top level apache project #8491

Closed
3 of 4 tasks
Tracked by #8490 ...
alamb opened this issue Dec 11, 2023 · 9 comments
Closed
3 of 4 tasks
Tracked by #8490 ...

Draft proposal to move DataFusion to a new top level apache project #8491

alamb opened this issue Dec 11, 2023 · 9 comments
Assignees
Labels
enhancement New feature or request

Comments

@alamb
Copy link
Contributor

alamb commented Dec 11, 2023

Is your feature request related to a problem or challenge?

In general, the idea of moving DataFusion to a new top level project within the Apache Software Foundation seems to have consensus (see #6475)

Describe the solution you'd like

I would like a clear and concise proposal with a writeup of what it would take to "promote" DataFusion to a new top level project.

Describe alternatives you've considered

Tasks

@alamb alamb added the enhancement New feature or request label Dec 11, 2023
@alamb alamb self-assigned this Dec 11, 2023
@alamb
Copy link
Contributor Author

alamb commented Dec 15, 2023

Update: I spent a while researching the approproate procedure and haven't found one yet. Thus, I sent a note to the incubator email list asking for help: https://lists.apache.org/list.html?general@incubator.apache.org

Update: https://lists.apache.org/thread/r4n73pmms1lv0jbohyx1o1z13d615t99

Except from response

I believe the process is for the Board to create a new PMC whose IP and PMC members are an exact copy of the parent PMC. (Like the Unix “fork” function.) And then both PMCs delete the stuff, and PMC members, they no longer need.

Arrow itself was created via this process (from Drill). You could search the archives for the board resolution that created Arrow. The Incubator is not involved in the process. (Except in some minor ways, such as a name search.)

@alamb
Copy link
Contributor Author

alamb commented Dec 15, 2023

Update: I think I now understand the process required for the propsal and I researched what is necessary (and I updated this ticket with the relevant items).

The first thing will be to ensure the naming is fine. I think it is but we should go through the official naming approval process

@andygrove in order to do a name search / validation on DataFusion, one of the fields is "derivation". Do you have any background / text I can refer to related to how you picked the name DataFusion?

For reference, here is the relevant entry for the Arrow project:
https://issues.apache.org/jira/browse/PODLINGNAMESEARCH-92?jql=project%20%3D%20PODLINGNAMESEARCH%20AND%20description%20~%20Arrow

@alamb
Copy link
Contributor Author

alamb commented Dec 20, 2023

I started a discussion on initial PMC membership on the mailing list: https://lists.apache.org/thread/pymrzcdw4qdptvby85f69rg3pcckl15b

@liurenjie1024
Copy link
Contributor

Just curious, will arrow-ballista be included in the new top project or still remain under arrow?

@alamb
Copy link
Contributor Author

alamb commented Dec 21, 2023

Just curious, will arrow-ballista be included in the new top project or still remain under arrow?

@liurenjie1024 that is an excellent question. I think ballista would naturally belong in the DataFusion project, but I don't know for sure.

I have stared a (new) discussion about that on the mailing list: https://lists.apache.org/thread/ob3n0d9ky0bgrryl3xn39w9k566bq00q

@liurenjie1024
Copy link
Contributor

Just curious, will arrow-ballista be included in the new top project or still remain under arrow?

@liurenjie1024 that is an excellent question. I think ballista would naturally belong in the DataFusion project, but I don't know for sure.

I have stared a (new) discussion about that on the mailing list: https://lists.apache.org/thread/ob3n0d9ky0bgrryl3xn39w9k566bq00q

+1, it would be more nature to move ballista along with datafusion.

@alamb
Copy link
Contributor Author

alamb commented Dec 26, 2023

I have filed a name search request / research item on https://issues.apache.org/jira/browse/PODLINGNAMESEARCH-219

The next step will be to pull the entire proposal together into one coherent document for discussion

@alamb
Copy link
Contributor Author

alamb commented Dec 27, 2023

Here is a draft proposal https://docs.google.com/document/d/11WTNYS8KWScOt3ySTX39WVS6krPhUvHsuJRY9PZQx4g/edit

I also sent a note to the arrow dev list:
https://lists.apache.org/thread/c150t1s1x0kcb3r03cjyx31kqs5oc341

@alamb
Copy link
Contributor Author

alamb commented Jan 5, 2024

The proposal is drafted and under review. ETA on a board vote is April 2024. Details #6475 (comment)

I am claiming the proposal is written so closing this ticket

@alamb alamb closed this as completed Jan 5, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request
Projects
None yet
Development

No branches or pull requests

2 participants