Pipeline AutoMLStep notebook fixed #811
Closed
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
The issues fixed here are:
I think we don't need the RunConfiguration with the explicit Docker IMAGE and pip command. And this is what I think is causing the error above in versions missmatch related to the Docker image+pip. I simplified the notebook not to use the RunConfiguration and it works good for me now.
The AutoMLConfig is using outdated data approach by providing X and y instead of an AML Dataset and the name of the label column. That's why the user gets this warning:
WARNING - The AutoMLConfig inputs you have specified will soon be deprecated. Please use the AutoMLConfig shown in our documentation: https://aka.ms/AutoMLConfigI also fixed that in my notebook by using an AML Dataset and a label name in the AutoMLConfig class.
Warning/Exception from the widget if the dataset doesn’t have a name.
• If
‘dataset = Dataset.Tabular.from_delimited_files(example_data)’doesn’t require a name/Id, why are we raising that exception below?• This is also a bug in the widget, so I created a bug in VSO about it: 615924
However, I created a WORKAROUND in the notebook by saving/registering the Dataset in the Workspace, so after that the dataset has a name and the widget doesn’t raise that exception/warning.