Skip to content
This repository was archived by the owner on Aug 12, 2020. It is now read-only.
This repository was archived by the owner on Aug 12, 2020. It is now read-only.

Preprocessing - What do we need? What do we have? #1

Open
@LukeMathWalker

Description

@LukeMathWalker

Context: see rust-ml/discussion#1.

This is meant to be a list of functionality we want to implement (a roadmap?) - I have refrained from including more sophisticated methods, limiting to what I believe to be a set of "core" routines we should absolutely offer.
For each piece of functionality I'd like to document what is already available in the Rust ecosystem.

This is meant to be a WIP list, so feel free to chip in @jblondin and edit/add things I might have missed.

  • Dimensionality reduction:
    • Principal Component Analysis (PCA)
    • Singular Value Decomposition (SVD)
    • Independent component analysis (ICA)
    • Non-negative matrix factorization (NMF)
  • Scaling:
    • Standard scaling (zero mean and unit variance)
    • Range scaling (specify min and max)
  • Encoding:
    • One hot encoding
    • Ordinal encoding
    • Discretization of continuous variables
  • Missing values:
    • Naive imputation (constant value or using common statistics)

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions