Skip to content

AutoML 2.0: how to set sampling key #6643

@torronen

Description

@torronen

Is your feature request related to a problem? Please describe.
I am trying to run crossvalidation with a sampling key.

However, .SetDataset(data, fold: 10); only accepts the number of folds. I can create cross-validation set with sampling key using ctx.Data.CrossValidationSplit, but the .SetDataset does not have overload to accept a set of TrainTestData.

How can I set my own IReadOnlyList<TrainTestData> cvdata using the new AutoML API ?

Examples:

AutoMLExperiment experiment = ctx.Auto().CreateExperiment();

IReadOnlyList<TrainTestData> cvdata = ctx.Data.CrossValidationSplit(data, numberOfFolds: 10, samplingKeyColumnName: SamplingKeyColumn); // how to use this in the experiment?

            // Configure experiment
experiment
                .SetPipeline(pipeline)
                .SetBinaryClassificationMetric(BinaryClassificationMetric.Accuracy, labelColumn: columnInference.ColumnInformation.LabelColumnName)
                .SetTrainingTimeInSeconds(60)
                .SetDataset(data, fold: 10); //  how to set sampling key?

Metadata

Metadata

Type

No type

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions