bug: responses from /chat/completions endpoint contain a leading space in the content #2548

Propheticus · 2024-03-31T12:22:44Z

Jan's API server responds with a leading space. This leads to broken output (markdown tables don't render right) and illegal file names when the output is used to generate note titles which are in turn used as the .md filename.

Call:

POST http://127.0.0.1:1337/v1/chat/completions
**headers**
content-type: application/json
**body**
{
  "messages": [
    {
      "content": "You are a helpful assistant.",
      "role": "system"
    },
    {
      "content": "Hello!",
      "role": "user"
    }
  ],
  "model": "mistral-ins-7b-q5",
  "stream": true,
  "max_tokens": 4096,
  "stop": [
    "</s>"
  ],
  "frequency_penalty": 0,
  "presence_penalty": 0,
  "temperature": 0.7,
  "top_p": 0.95
}

Response:

data: {"choices":[{"delta":{"content":" Hello"},"finish_reason":null,"index":0}],"created":1711886550,"id":"K75WwlMq7nBjqPW4FGlR","model":"_","object":"chat.completion.chunk"}
data: {"choices":[{"delta":{"content":" there"},"finish_reason":null,"index":0}],"created":1711886550,"id":"ADqnjtzqwx1zbUiWVXVm","model":"_","object":"chat.completion.chunk"}
...etc

Expected output:

data: {"choices":[{"delta":{"content":"Hello"},"finish_reason":null,"index":0}],"created":1711886550,"id":"K75WwlMq7nBjqPW4FGlR","model":"_","object":"chat.completion.chunk"}
data: {"choices":[{"delta":{"content":" there"},"finish_reason":null,"index":0}],"created":1711886550,"id":"ADqnjtzqwx1zbUiWVXVm","model":"_","object":"chat.completion.chunk"}
...etc

Tested with "stream": false as well and the same is true for un-chunked chat.completion objects.

The text was updated successfully, but these errors were encountered:

Propheticus · 2024-03-31T12:28:27Z

causes longy2k/obsidian-bmo-chatbot#66 and longy2k/obsidian-bmo-chatbot#67

Propheticus · 2024-03-31T15:59:38Z

ggerganov/llama.cpp#3664 might be related? (would mean it's in nitro.exe which uses llama.cpp)
also: ggerganov/llama.cpp#367 (comment)

Propheticus · 2024-04-01T17:11:16Z

Reading the 2 issues above plus ggerganov/llama.cpp#4081 the leading space appears to be added during tokenization on purpose and is even needed for some models to work correctly.
I'm still unsure how/if tokenization, of what I thought was done to the input to be processed by a model, relates to the generation of a response.
Is the same done/needed for the output? Perhaps because of threads of messages where previous replies become context/input for the next prompt?

Propheticus · 2024-04-02T09:17:15Z

Without going further into the rabbit hole of how tokenization works internally and whether it applies to completion....
The OpenAI API spec (and Mistral AI API as well) gives examples of expected response, without a leading space.

Van-QA · 2024-04-17T09:58:14Z

hi @Propheticus, dev team resolved the issue, would you mind retrying it? many thanks 🙏

Propheticus · 2024-04-17T11:48:47Z

Looks good to me @Van-QA 👍
tested on Jan v0.4.11-386 nightly.

Propheticus added the type: bug Something isn't working label Mar 31, 2024

This was referenced Mar 31, 2024

Leading space in output is not trimmed longy2k/obsidian-bmo-chatbot#66

Open

Rename note title can output illegal characters (edit: caused by leading space) longy2k/obsidian-bmo-chatbot#67

Open

Propheticus changed the title ~~bug: responses from /chat/completions endpoint contain a leading space in the first chunk~~ bug: responses from /chat/completions endpoint contain a leading space in the content Mar 31, 2024

Van-QA added this to the v0.4.11 milestone Apr 1, 2024

Van-QA assigned louis-jan Apr 1, 2024

Van-QA added the P2: nice to have Nice to have feature label Apr 1, 2024

louis-jan assigned CameronNg and unassigned louis-jan Apr 5, 2024

tikikun assigned vansangpfiev and unassigned CameronNg Apr 5, 2024

Van-QA modified the milestones: v0.4.11, v0.4.12 Apr 10, 2024

vansangpfiev mentioned this issue Apr 11, 2024

fix: responses from /chat/completions endpoint contain a leading space in the content janhq/cortex.cpp#488

Merged

Van-QA closed this as completed Apr 17, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

bug: responses from /chat/completions endpoint contain a leading space in the content #2548

bug: responses from /chat/completions endpoint contain a leading space in the content #2548

Propheticus commented Mar 31, 2024 •

edited

Loading

Propheticus commented Mar 31, 2024

Propheticus commented Mar 31, 2024 •

edited

Loading

Propheticus commented Apr 1, 2024

Propheticus commented Apr 2, 2024

Van-QA commented Apr 17, 2024

Propheticus commented Apr 17, 2024 •

edited

Loading

bug: responses from /chat/completions endpoint contain a leading space in the content #2548

bug: responses from /chat/completions endpoint contain a leading space in the content #2548

Comments

Propheticus commented Mar 31, 2024 • edited Loading

Propheticus commented Mar 31, 2024

Propheticus commented Mar 31, 2024 • edited Loading

Propheticus commented Apr 1, 2024

Propheticus commented Apr 2, 2024

Van-QA commented Apr 17, 2024

Propheticus commented Apr 17, 2024 • edited Loading

Propheticus commented Mar 31, 2024 •

edited

Loading

Propheticus commented Mar 31, 2024 •

edited

Loading

Propheticus commented Apr 17, 2024 •

edited

Loading