Embedding model and Engine?? #62

muhtalhakhan · 2023-10-31T20:43:27Z

Hey guys,

I am shifting from GPT to Mistral and I am facing one problem which is that I could not find the embedding model and engine for Mistral yet.

I am using the service from DeepInfra

Here's the code snippet which I wrote for GPT:

def get_embedding(text, model="embedding-ada-002"):
  text = text.replace("\n", " ")
  if not text: 
    text = "this is blank"
  return openai.Embedding.create(
          input=[text], model=model)['data'][0]['embedding']


if __name__ == '__main__':
#   gpt_parameter = {"engine": "text-davinci-003", "max_tokens": 50, 
#                    "temperature": 0, "top_p": 1, "stream": False,
#                    "frequency_penalty": 0, "presence_penalty": 0, 
#                    "stop": ['"']}
  gpt_parameter = {"max_tokens": 50, 
                   "temperature": 0, "top_p": 1, "stream": False,
                   "frequency_penalty": 0, "presence_penalty": 0, 
                   "stop": ['"']}

All I want to know is which embedding model and engine should be used?

Thank you 🙂

The text was updated successfully, but these errors were encountered:

praveen555 · 2023-11-01T17:34:20Z

There is no embedding model defined as such.

For each input sentence you have to tokenize using the tokenizer provided my Mistral and then pass those tokens to the model.

Check out the example below posted from the mistral

with torch.no_grad():
featurized_x = []
# compute an embedding for each sentence
for i, (x, y) in tqdm.tqdm(enumerate(data)):
tokens = tokenizer.encode(x, bos=True)
tensor = torch.tensor(tokens).to(model.device)
features = model.forward_partial(tensor, [len(tokens)]) # (n_tokens, model_dim)
featurized_x.append(features.float().mean(0).cpu().detach().numpy())

concatenate sentence embeddings

X = np.concatenate([x[None] for x in featurized_x], axis=0) # (n_points, model_dim)

muhtalhakhan · 2023-11-01T18:02:02Z

Is there any working example which can help me better with understanding to code?

I am getting some of the lines as a prompt back from the Mistral and I want them to embedded.

praveen555 · 2023-11-01T18:15:20Z

check the tutorial example provided in the folder by mistral. The code I gave earlier is given on the same.

muhtalhakhan · 2023-11-02T18:13:18Z

check the tutorial example provided in the folder by mistral. The code I gave earlier is given on the same.

Thanks but I didn't find anything useful. Well, I was just playing with prompts and afterwards I was embedding them to some other function.

zhzfight · 2023-11-20T08:27:16Z

hi, dude, have you solve the problem?

muhtalhakhan · 2023-11-21T00:52:40Z

hi, dude, have you solve the problem?

hey, I tried but did not get the enough of good response from the model.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Embedding model and Engine?? #62

Embedding model and Engine?? #62

muhtalhakhan commented Oct 31, 2023

praveen555 commented Nov 1, 2023

muhtalhakhan commented Nov 1, 2023

praveen555 commented Nov 1, 2023

muhtalhakhan commented Nov 2, 2023

zhzfight commented Nov 20, 2023

muhtalhakhan commented Nov 21, 2023

Embedding model and Engine?? #62

Embedding model and Engine?? #62

Comments

muhtalhakhan commented Oct 31, 2023

praveen555 commented Nov 1, 2023

concatenate sentence embeddings

muhtalhakhan commented Nov 1, 2023

praveen555 commented Nov 1, 2023

muhtalhakhan commented Nov 2, 2023

zhzfight commented Nov 20, 2023

muhtalhakhan commented Nov 21, 2023