Open
Description
Support for precomputed distance matrices in DBSCAN was added in #3585
The precomputed distance matrix is quadratic in size, which means two things:
- If you can fit it in memory, your dataset is probably too small to take advantage of MNMG.
- Sending the complete matrix to all the nodes is long and therefore MNMG is currently extremely slow.
Is there a use case for supporting a distance matrix that is scattered across the nodes (solves both points above)? It will require changing the API a bit, as well as the kernels.