Adds support for stella_en_v5 embedding model -400M variant #2608

iskng · 2024-11-09T19:42:20Z

Stella_en_400m_v5 is #6 on MTEB as of 9th Nov 2024.

This PR adds support for the model along with some examples.

license: Model is licensed MIT

Authors example from the model card added and reproduced.

AnubhabB · 2024-11-10T08:59:38Z

@iskng let's try and figure out if we can have one single stella_en_v5 module instead of stella_en_v5 and stella_en_v5_400m. Allow me some time to go through this and discuss possible ways of merging this.

I guess that way, it'll be easier for end users and maintainers.

It would be great if you could mark this as a draft PR for the time being till we sort this out?

Thanks

LaurentMazare · 2024-11-10T09:02:30Z

@iskng let's try and figure out if we can have one single stella_en_v5 module instead of stella_en_v5 and stella_en_v5_400m. Allow me some time to go through this and discuss possible ways of merging this.

+1 to this, if it's easy to add support for the 400m model in the existing one that would make it simpler to maintain over time (though there is already a lot of duplication among models so if it's a significant effort to merge the two, I'm happy with the separate file).

iskng · 2024-11-12T04:50:00Z

Should have mentioned I only really tested this for inference on metal and cpu so not sure if the cuda implementation is right, had to disable the ues_efficient_memory because trying to get xformers on a mac was rough.

Also curious if its just my implementation, but its about 3 times slower than sentence transformers for the same model.
Would love to learn to make this faster, if you know of any resources I'm just starting to dig around candle. Thx

Adds support for stella_en_v5 embedding model -400M variant

91d4602

iskng mentioned this pull request Nov 9, 2024

[QUESTION] Protocol of adding a new model (Stella_en_<*>_v5 family) implementation with Candle #2525

Closed

iskng marked this pull request as draft November 10, 2024 18:40

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Adds support for stella_en_v5 embedding model -400M variant #2608

Adds support for stella_en_v5 embedding model -400M variant #2608

iskng commented Nov 9, 2024

AnubhabB commented Nov 10, 2024

LaurentMazare commented Nov 10, 2024

iskng commented Nov 12, 2024 •

edited

Loading

Adds support for stella_en_v5 embedding model -400M variant #2608

Are you sure you want to change the base?

Adds support for stella_en_v5 embedding model -400M variant #2608

Conversation

iskng commented Nov 9, 2024

AnubhabB commented Nov 10, 2024

LaurentMazare commented Nov 10, 2024

iskng commented Nov 12, 2024 • edited Loading

iskng commented Nov 12, 2024 •

edited

Loading