Please rework the pipeline interactions with azureml.data.OutputFileDatasetConfig

### Problem Description

Azure ML Python SDK documentation has provided numerous options to pass data between training pipelines, but currently the recommended option appears to be `azureml.data.OutputFileDatasetConfig`. 

However, `azureml.data.OutputFileDatasetConfig` has a limitation that it cannot be accepted as a valid input for the `inputs`  parameter for all the classes in `azureml.pipeline.steps` - e.g. `PythonScriptStep` and `HyperDriveStep`. 

To define the `OutputFileDatasetConfig` as an input of a pipeline step, the function `as_input()` has to be called on the object, and the function is not called if the `OutputFileDatasetConfig`  is used as an output of a pipeline step.

This is extremely convoluted, as it clearly suggests that the `OutputFileDatasetConfig` was originally designed only as an output to a pipeline step. 

### Proposed solution
1. The name of the class should be changed - `OutputFileDatasetConfig` suggests that it is meant only as an output, and it is some kind of a config file to be used by internal classes (which it clearly is not). If the intention is to use it also as the input to downstream pipeline steps then the name should reflect that.
2. Allow this class to be used in the `inputs` parameter for all classes in `azureml.pipeline.steps`. The `azureml.pipeline.core.PipelineData` class allows the user to specify it as both the input and output of a pipeline step.  However, it is [not the recommended approach](https://docs.microsoft.com/en-us/python/api/azureml-pipeline-core/azureml.pipeline.core.pipelinedata?view=azure-ml-py#methods). `PipelineData` is also a much better name for a class that transfer data between pipeline steps.
4. Alternatively to point 2. above, please remove the `inputs` and `outputs` parameters for all classes in `azureml.pipeline.steps` and enforce that inputs be declared with `as_input()`  and outputs as `as_output()`. 


Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Please rework the pipeline interactions with azureml.data.OutputFileDatasetConfig #23565

Problem Description

Proposed solution

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Please rework the pipeline interactions with azureml.data.OutputFileDatasetConfig #23565

Description

Problem Description

Proposed solution

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions