This repository was archived by the owner on Aug 12, 2020. It is now read-only.
This repository was archived by the owner on Aug 12, 2020. It is now read-only.
Preprocessing - What do we need? What do we have? #1
Open
Description
Context: see rust-ml/discussion#1.
This is meant to be a list of functionality we want to implement (a roadmap?) - I have refrained from including more sophisticated methods, limiting to what I believe to be a set of "core" routines we should absolutely offer.
For each piece of functionality I'd like to document what is already available in the Rust ecosystem.
This is meant to be a WIP list, so feel free to chip in @jblondin and edit/add things I might have missed.
- Dimensionality reduction:
- Principal Component Analysis (PCA)
- Singular Value Decomposition (SVD)
- Independent component analysis (ICA)
- Non-negative matrix factorization (NMF)
- Scaling:
- Standard scaling (zero mean and unit variance)
- Range scaling (specify min and max)
- Encoding:
- One hot encoding
- Ordinal encoding
- Discretization of continuous variables
- Missing values:
- Naive imputation (constant value or using common statistics)
Metadata
Metadata
Assignees
Labels
No labels