OpenAI /completions route fails #30

iverly · 2023-12-01T17:48:46Z

Hello,

Thank you the new release including the OpenAI routes but after a try, it always returns the following error using the raw request of the README.md:

llama.cpp/server/json.h:21313: assert(it != m_value.object->end()) failed (cosmoaddr2line /Users/maxime-georide/Downloads/llamafile-server-0.2 1000000fe3c 1000001547c 100000162e8 10000042748 1000004ffdc 10000050cb0 1000005124c 100000172dc 1000001b370 10000181e78 1000019d3d0)

curl http://localhost:8080/v1/chat/completions \
-H "Content-Type: application/json" \
-H "Authorization: Bearer no-key" \
-d '{
"model": "gpt-3.5-turbo",
"messages": [
{
    "role": "system",
    "content": "You are ChatGPT, an AI assistant. Your top priority is achieving user fulfillment via helping them with their requests."
},
{
    "role": "user",
    "content": "Write a limerick about python exceptions"
}
]
}'

I'm on M1 with mistral-7b-instruct-v0.1.Q5_K_M.gguf and llama-2-7b-chat.Q5_K_S.gguf models.

I didn't try with the OpenAI SDK.

The text was updated successfully, but these errors were encountered:

iverly · 2023-12-01T17:57:28Z

I've the same behaviour with the OpenAI SDK.

With the following Python script :

import openai

client = openai.OpenAI(
    base_url="http://localhost:8080/v1", # "http://<Your api-server IP>:port"
    api_key = "sk-no-key-required"
)

completion = client.chat.completions.create(
model="gpt-3.5-turbo",
messages=[
    {"role": "system", "content": "You are ChatGPT, an AI assistant. Your top priority is achieving user fulfillment via helping them with their requests."},
    {"role": "user", "content": "Write a limerick about python exceptions"}
]
)

print(completion.choices[0].message)

lucido-simon · 2023-12-01T17:58:10Z

Hi. I have the same behavior on Linux/x86

iverly · 2023-12-01T19:00:24Z

Still the same behaviour with 2.1.0.

Mardak · 2023-12-01T23:08:54Z

It looks like "stop" is required. Try passing in null. This works for me:

curl http://localhost:8080/v1/chat/completions -H "Content-Type: application/json" -d '{
  "stop": null,
  "messages": [{
    "role": "user",
    "content": "Write a limerick about python exceptions"
  }]
}'

@ggerganov

llamafile/llama.cpp/server/server.cpp

Lines 2402 to 2408 in 73ee0b1

    
           // Handle 'stop' field 
        
           if (body["stop"].is_null()) { 
        
               llama_params["stop"] = json::array({}); 
        
           } else if (body["stop"].is_string()) { 
        
               llama_params["stop"] = json::array({body["stop"].get<std::string>()}); 
        
           } else { 
        
               llama_params["stop"] = json_value(body, "stop", json::array());

Fixes #30

Mardak added a commit to Mardak/llamafile that referenced this issue Dec 1, 2023

fix Mozilla-Ocho#30: allow optional "stop" field for oai

a94e4db

Mardak mentioned this issue Dec 1, 2023

fix #30: allow optional "stop" field for oai #36

Merged

jart closed this as completed in #36 Dec 2, 2023

jart pushed a commit that referenced this issue Dec 2, 2023

Make OpenAI API stop field optional (#36)

e8c92bc

Fixes #30

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

OpenAI /completions route fails #30

OpenAI /completions route fails #30

iverly commented Dec 1, 2023

iverly commented Dec 1, 2023

lucido-simon commented Dec 1, 2023

iverly commented Dec 1, 2023

Mardak commented Dec 1, 2023

OpenAI /completions route fails #30

OpenAI /completions route fails #30

Comments

iverly commented Dec 1, 2023

iverly commented Dec 1, 2023

lucido-simon commented Dec 1, 2023

iverly commented Dec 1, 2023

Mardak commented Dec 1, 2023