-
Notifications
You must be signed in to change notification settings - Fork 378
Open
Description
Hi, I'm loving KeyBERT and using it for a project now. However, I'm noticing that performance is very slow at scale when using MMR. I'm observing that running the embedding model on GPU speeds things up, but it seems that the bottleneck is now MMR computation on CPU. Does KeyBERT natively support multiprocessing that?
My plan was to break this all out- start by computing my own n-grams, then embedding the n-grams and documents directly, and passing the embeddings to KeyBERT in a multiprocessing setup (i.e. map a huge list of embeddings to multiple processes of KeyBERT to perform the MMR). But before I go down that road, I just want to double check that this is not already supported natively in KeyBERT?
Metadata
Metadata
Assignees
Labels
No labels