Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

add auto featurizer api #6187

Merged

Conversation

LittleLittleCloud
Copy link
Contributor

We are excited to review your PR.

So we can do the best job, please check:

  • There's a descriptive title that will make sense to other developers some time from now.
  • There's associated issues. All PR's should have issue(s) associated - unless a trivial self-evident change such as fixing a typo. You can use the format Fixes #nnnn in your description to cause GitHub to automatically close the issue(s) when your PR is merged.
  • Your change description explains what the change does, why you chose your approach, and anything else that reviewers should know.
  • You have included any necessary tests in the same PR.

This PR includes API for auto featurizer

usage

// data, context

// infer column information from data
var pipeline = context.Auto().Featurizer(data, excludeColumnNames: new[] {"Label"})
                      .Append(context.Auto().BinaryClassification("Label");

// using auto().InferColumns
var columnInfo = context.Auto().InferColumn(data);
var pipeline = context.Auto().Featurizer(columnInfo.Columns, excludeColumnNames: new[]{"Label"})
...

@codecov
Copy link

codecov bot commented May 5, 2022

Codecov Report

Merging #6187 (50facc5) into main (97a920a) will increase coverage by 0.05%.
The diff coverage is n/a.

@@            Coverage Diff             @@
##             main    #6187      +/-   ##
==========================================
+ Coverage   68.29%   68.35%   +0.05%     
==========================================
  Files        1098     1103       +5     
  Lines      242080   242513     +433     
  Branches    25166    25204      +38     
==========================================
+ Hits       165326   165764     +438     
+ Misses      70187    70178       -9     
- Partials     6567     6571       +4     
Flag Coverage Δ
Debug 68.35% <ø> (+0.05%) ⬆️
production 62.79% <ø> (+0.03%) ⬆️
test 88.86% <ø> (+0.05%) ⬆️

Flags with carried forward coverage won't be shown. Click here to find out more.

Impacted Files Coverage Δ
...st/Microsoft.ML.SearchSpace.Tests/ParameterTest.cs 100.00% <0.00%> (ø)
...Microsoft.ML.SearchSpace.Tests/ChoiceOptionTest.cs 100.00% <0.00%> (ø)
src/Microsoft.ML.SearchSpace/Option/NestOption.cs 54.79% <0.00%> (ø)
...t/Microsoft.ML.SearchSpace.Tests/NestOptionTest.cs 100.00% <0.00%> (ø)
.../Microsoft.ML.AutoML.Tests/CostFrugalTunerTests.cs 96.44% <0.00%> (ø)
.../Microsoft.ML.AutoML.Tests/GridSearchTunerTests.cs 84.00% <0.00%> (ø)
...ft.ML.SearchSpace/Attribute/NestOptionAttribute.cs 100.00% <0.00%> (ø)
...ML.Transforms/Text/StopWordsRemovingTransformer.cs 86.38% <0.00%> (+0.14%) ⬆️
src/Microsoft.ML.Core/Utilities/Contracts.cs 45.47% <0.00%> (+0.20%) ⬆️
...soft.ML.SearchSpace/Option/UniformNumericOption.cs 97.46% <0.00%> (+1.26%) ⬆️
... and 7 more

@luisquintanilla luisquintanilla added the AutoML.NET Automating various steps of the machine learning process label May 12, 2022
@michaelgsharp
Copy link
Member

/azp run

@azure-pipelines
Copy link

Azure Pipelines successfully started running 2 pipeline(s).

@michaelgsharp michaelgsharp merged commit a2ad53c into dotnet:main May 24, 2022
@ghost ghost locked as resolved and limited conversation to collaborators Jun 23, 2022
Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.
Labels
AutoML.NET Automating various steps of the machine learning process
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants