Saving plotly figures made inside pipeline and reload in Kedro Viz. #839

lucasjamar · 2021-07-19T11:02:58Z

Description

Many plotly figures are too complex to be defined inside of the data catalog (ex: mixing lines and bar charts, double y axis, faceted plots). It would be great to be able to build the plots inside of a node and then pass the plotly figure to the data catalog to be saved as a json and reloaded in the Kedro Viz.

Possible Implementation

Use the current implementation, if a pd.DataFrame is passed, build the plotly figure based on the save args and save it to json. If a plotly.goFigure is passed, save it to straight to json.
Split the datasets into two datasets? Rename the current implementation as plotly.PandasDataset and create a plotly.JSONDataSet for saving plotly figures as json.

Please let me know your thoughts!

limdauto · 2021-07-19T20:08:04Z

Hi @lucasjamar thanks for the feedback. I think option 2. is a really good idea. We were considering it for the first release but in the end went with the current implementation as it was already being used internally by one of our teams. Having a pure JSONDataSet for plotly would be the good natural next step.

…ml (kedro-org#839)

antonymilne · 2021-08-31T11:22:23Z

Thanks for writing this @lucasjamar - I agree with what you say, and in my (personal, not kedro team) opinion our current implementation of the plotly dataset isn't quite right. Here's what we have currently:

This is confusing because the save/load operations aren't symmetric, since the process of going from dataframe to figure is obviously irreversible. We should consider the two different ways of generating a plotly plot:

Simple standard plots on a pd.DataFrame that can be defined inside the data catalog
More complex or general plots that are written inside a node. These would most likely be based on pd.DataFrame but shouldn't have to be: I should be able to make a go.Figure object any way I like and then save it so that it works with kedro viz

As you say, the current implementation doesn't cater for 2.

Here's my proposed solution:

The pandas.PlotlyWriter (name inspired by matplotlib.MatplotlibWriter) enables method 1; plotly.JSONDataSet enables method 2. Method 2 is a general, conventional kedro sort of dataset; method 1 is really just a convenience wrapper to enable you to do plots from the data catalog.

The only issue I see here is that kedro viz would need to load up pandas.PlotlyWriter using something other than the dataset's load method if that's not defined. I would expect the same issue to be true for matplotlib.MatplotlibWriter though (hopefully we should be able to load up pngs on kedro viz, even if you can't load them in a pipeline?).

Also note that if you do want to load up a pandas.PlotlyWriter dataset as part of your pipeline (e.g. to write most of your plot definition in the catalog but then tweak it in a node) you would be able to do so by transcoding and loading it as a plotly.JSONDataSet. This sounds cleaner to me than defining pandas.PlotlyWrite.load in an asymmetric way.

antonymilne · 2021-10-22T15:06:12Z

In #981 we've implemented the new plotly.JSONDataSet as suggested by @lucasjamar (thank you for raising the issue). This will be part of 0.17.6. In the future we might return to renaming plotly.PlotlyDataSet and/or removing its load method but these would both be breaking changes.

Also, possibly in future we should be moving plotly_args to a subkey of save_args rather than a top-level key? Not sure...

lucasjamar added the Issue: Feature Request New feature or improvement to existing feature label Jul 19, 2021

austospumanto pushed a commit to austospumanto/kedro that referenced this issue Aug 24, 2021

[KED-2188] Drop support for .kedro.yml file in favour of pyproject.to…

d9a7905

…ml (kedro-org#839)

limdauto mentioned this issue Oct 15, 2021

Update PlotlyDataSet save method to accept Figure object #955

Closed

6 tasks

antonymilne mentioned this issue Oct 22, 2021

[KED-2906] Add new plotly.JSONDataSet #981

Merged

6 tasks

antonymilne closed this as completed Oct 22, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Saving plotly figures made inside pipeline and reload in Kedro Viz. #839

Saving plotly figures made inside pipeline and reload in Kedro Viz. #839

lucasjamar commented Jul 19, 2021

limdauto commented Jul 19, 2021

antonymilne commented Aug 31, 2021 •

edited

Loading

antonymilne commented Oct 22, 2021 •

edited

Loading

Saving plotly figures made inside pipeline and reload in Kedro Viz. #839

Saving plotly figures made inside pipeline and reload in Kedro Viz. #839

Comments

lucasjamar commented Jul 19, 2021

Description

Possible Implementation

limdauto commented Jul 19, 2021

antonymilne commented Aug 31, 2021 • edited Loading

antonymilne commented Oct 22, 2021 • edited Loading

antonymilne commented Aug 31, 2021 •

edited

Loading

antonymilne commented Oct 22, 2021 •

edited

Loading