Replies: 1 comment 2 replies
-
Thank you @lukegre for the suggestion. This looks interesting indeed! As you said the biggest bottleneck isn't quite the clustering step. If you feel like it you are welcome to push a PR adding this method. |
Beta Was this translation helpful? Give feedback.
2 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
I know that clustering probably isn't the biggest bottleneck in the process, but I've used FAISS in the past, which is a lot faster.
This provides a few more details: https://www.kdnuggets.com/2021/01/k-means-faster-lower-error-scikit-learn.html
This might make searching for the number of optimal clusters quite a bit quicker (20 x if the article is anything to go by).
Beta Was this translation helpful? Give feedback.
All reactions