Skip to content
This repository was archived by the owner on Nov 16, 2023. It is now read-only.
This repository was archived by the owner on Nov 16, 2023. It is now read-only.

Default value char_feature_extractor is different from ML.NET for NGramFeaturizer #3

Closed
@ganik

Description

@ganik

By default NGramFeaturizer in ML.NET uses CharFeatureExtractor = new NgramExtractorTransform.NgramExtractorArguments() { NgramLength = 3, AllLengths = false };
while in NimbusML its set to null. So using default values of NGramFeaturizer between ML.NET and NimbusML causes significant difference in AUC. Need to fix NimbusML NGramFeaturizer

Metadata

Metadata

Assignees

No one assigned

    Labels

    bugSomething isn't workinggood first issueGood for newcomers

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions