Skip to content

[ENH] Support for scattered precomputed distance matrix for MNMG DBSCAN #3615

Open
@Nyrio

Description

@Nyrio

Support for precomputed distance matrices in DBSCAN was added in #3585
The precomputed distance matrix is quadratic in size, which means two things:

  • If you can fit it in memory, your dataset is probably too small to take advantage of MNMG.
  • Sending the complete matrix to all the nodes is long and therefore MNMG is currently extremely slow.

Is there a use case for supporting a distance matrix that is scattered across the nodes (solves both points above)? It will require changing the API a bit, as well as the kernels.

Metadata

Metadata

Assignees

No one assigned

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions