Skip to content
This repository was archived by the owner on Aug 25, 2024. It is now read-only.
This repository was archived by the owner on Aug 25, 2024. It is now read-only.

docs: tutorial: dataflow: Clustering dataflow source filter #654

Open
@johnandersen777

Description

@johnandersen777

We should add a dataflow tutorial where we run a clustering model within the
dataflow preprocessing source and not yield records in the small clusters, we're
going to use this source / dataflow as the input to training a model.

So essentially this demo shows how to use a clustering model to not train on
outliers.

Depends on: #652

Metadata

Metadata

Assignees

No one assigned

    Labels

    documentationChanges to documentationkind/dataflowIssues partaining to DataFlowskind/mlIssues partaining to machine learningp2Medium PrioritytSEsitmated Time To Complete: Short

    Type

    No type

    Projects

    No projects

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions