Skip to content
This repository was archived by the owner on Nov 16, 2023. It is now read-only.
This repository was archived by the owner on Nov 16, 2023. It is now read-only.

Dataprep integration #146

Closed
Closed
@ganik

Description

@ganik

pip install dataprep.
dataprep is a data preparation and cleansing package with internals in .NET CLR
We will need to do changes so the code below is possible:

from nimbusml import Pipeline
from nimbusml.datasets import get_dataset
from nimbusml.ensemble import FastTreesBinaryClassifier
from nimbusml.feature_extraction.text import NGramFeaturizer
 
train_data = dataflow(customer—provided--traindata)
test_data = dataflow(customer—provided-testdata)
 
pipeline = Pipeline([ # nimbusml pipeline
    NGramFeaturizer(columns={'Features': ['Text']}),
    FastTreesBinaryClassifier(feature=['Features'], label='Label')
])
 
# fit and predict
pipeline.fit(train_data)
results = pipeline.predict(test_data)

Metadata

Metadata

Assignees

Labels

No labels
No labels

Type

No type

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions