Sparsehc-dm is a python wrapper for sparcehc distance matrix clustering algorithm, integrated with STXXL[](http://dx.doi.org/10.1002/spe.v38:6) for on-disk sorting. SparseHC [](http://dx.doi.org/10.1016/j.procs.2014.05.001) is a memory-efficient hierarchical agglomerative clustering implementation. It has close to linear memory complexity, enabling clustering of ~900000 structures/points on 32GB RAM.
0 commit comments