Skip to content

[TASK][FEA] Setting build_algo=auto as default for UMAP #5985

Open
@jinsolp

Description

@jinsolp

Description

The default build algo for building the knn graph in UMAP is set to auto, which decides which algorithm to use to build the knn graph depending on the given dataset size (<= 50K uses brute force knn, >50K uses nn descent). Other options for build_algo are brute_force_knn and nn_descent, which uses that knn building algorithm without looking at the data size.
Related PR

However, to keep consistency for users who expect the same results as the previous release, we will use brute_force_knn to build the knn graph for the following case regardless of the data size.

  • random_state is explicitly set and build_algo = auto

TODO

Go through a deprecation cycle to let build_algo be auto (i.e. let the build algo be decided during runtime based on the data size) if given as auto regardless of whether random_state is explicitly given or not.

Metadata

Metadata

Assignees

No one assigned

    Labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions