Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[KED-1106] Add Roadmap, Objectives and better actual use cases #114

Closed
arita37 opened this issue Oct 8, 2019 · 8 comments
Closed

[KED-1106] Add Roadmap, Objectives and better actual use cases #114

arita37 opened this issue Oct 8, 2019 · 8 comments
Labels
Issue: Feature Request New feature or improvement to existing feature

Comments

@arita37
Copy link

arita37 commented Oct 8, 2019

Description

Roadmap and objective of the framework seems missing to see which direction it goes.
Expecially, what will be improved/supported and (what Not).
It allows to plan how to "potentiall" integrate into existing framework workflow.

Data Connector seems actively developed.
BUT machine learning model lifefcyle is not handle (and will or will not be).
Objectives and Roadmap will help identifying.

All major frameworks have clear roadmaps : TensorFlow, Pandas, ...

  1. Use case example :
    Example are not well chosen (ie space flight, not really relevant for most people).
    If your targets are : Data Engineer, Data Science, please five example using :
    Specific Kaggle Dataset.
    Specific dataset (embed with the repo) to see the full process.

    It will raise more attention to the community.

@arita37 arita37 added the Issue: Feature Request New feature or improvement to existing feature label Oct 8, 2019
@lorenabalan lorenabalan changed the title Add Roadmap, Objecives and better actual use cases Add Roadmap, Objectives and better actual use cases Oct 9, 2019
@lorenabalan
Copy link
Contributor

Hi @arita37, thank you for your feedback! Please see the responses below.

In terms of public roadmap and objectives, we’re a bit too young and nimble for this. :) There is a list of prioritised goals we adhere to internally, but it is flexible in terms of change, and we quite like it that way at this stage of Kedro. Maybe when we're closer to ruling the world like pandas or TensorFlow this can be revisited. Meanwhile, things that are unlikely to change and easier to grasp can be found here:
https://kedro.readthedocs.io/en/latest/06_resources/02_architecture_overview.html
https://kedro.readthedocs.io/en/latest/06_resources/01_faq.html#how-does-kedro-compare-to-other-projects
https://kedro.readthedocs.io/en/latest/06_resources/01_faq.html#what-is-the-philosophy-behind-kedro

The Spaceflight tutorial is meant to be a simple, easy-to-follow example even for data practitioners of little experience. We also didn’t want to force users to download some large datasets every time they install or upgrade Kedro, or when creating a Kedro project, which is why we keep it in a separate repository. Also, this makes it easier for tutorial-only contributions! You’re welcome to contribute a more advanced tutorial to https://github.com/quantumblacklabs/kedro-examples :)

@arita37
Copy link
Author

arita37 commented Oct 10, 2019 via email

@lorenabalan
Copy link
Contributor

It has never been implied that "there is no roadmap" or "no direction", but a matter of how we choose to share this with the public, in a constructive way. This item is on our backlog and we'll have a think about how to approach this in the future. We would also never open-source if we didn't expect/want the community to contribute and get involved in making this project even better. I believe we've been doing decently well in proving that throughout our engagement with the OS community, and we've been nothing but welcoming and open to contributions & feedback. :)

@yetudada
Copy link
Contributor

Thanks for this feedback @arita37. We've noted your concerns and will look at different ways to raise the themes that we're exploring for Kedro. It might be that we make this public via GitHub Projects so that we can get community input on what we're tackling next.

You're right that this needs to be evidenced. I'll reopen this issue, complete that task and then close it again.

@yetudada yetudada reopened this Oct 10, 2019
@arita37
Copy link
Author

arita37 commented Oct 10, 2019 via email

@lorenabalan lorenabalan changed the title Add Roadmap, Objectives and better actual use cases [KED-1106] Add Roadmap, Objectives and better actual use cases Oct 11, 2019
@yetudada
Copy link
Contributor

Hi @arita37, I'm getting to create some actionable tasks from this issue so that I can close it.

You will see, our uploaded product themes here: https://github.com/quantumblacklabs/kedro/projects/1

And in terms of your thoughts on the extending the AbstractDataSet, that is something that our user have been doing. We have a contrib system, see here that users have been adding new data connecters and in some cases users have built their own model classes like the MlflowDataset, see #113.

What do you have in mind for the AbstractPipeline?

And lastly, have you been able to pick up Kedro at all? Or did you go for Mlflow?

@arita37
Copy link
Author

arita37 commented Oct 31, 2019

Thanks for the start of roadmap.
This is indeed useful to see the visbility over 1 year.
Main point of MLFlow, besides the functionality, is we have confidence that DataBricks will support
the Open Source for some time (ie as they do for Spark) and also integrate their external users concerns.

Kedro being 1/2 proprietary and 1/2 open,
it might end up like GraphLab (bought by Apple)
Check the story of sframe.... https://github.com/turi-code/SFrame

Setting the mid-term ojectives of Kedros will help a lot, especially where it stands between the framework.

Thanks !

@yetudada
Copy link
Contributor

yetudada commented Nov 4, 2019

@arita37 Thanks for the comment. This isn't even the roadmap for the year but rather the quarter with possible extensions into Q1 2020. In terms of support model, the Kedro team is fully staffed to support it because the tool is so heavily used internally and by our clients.

We define the vision of where we see Kedro going in our contribution docs. I'm going to close this ticket for now. Thank you!

@yetudada yetudada closed this as completed Nov 4, 2019
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Issue: Feature Request New feature or improvement to existing feature
Projects
None yet
Development

No branches or pull requests

3 participants