server: Retrieve prompt template in /props #8337

bviksoe · 2024-07-06T15:30:39Z

This PR adds the following:

Expose the actual Jinja2 prompt template of the model in the /props endpoint.
Change log-level from Error to Warning for warning about template mismatch.

The front-end stands a better chance of actually executing the Jinja template format correctly. Server is currently just guessing it.

Ideally this should have been inside a JSON block that expose the same key/value pairs as listed during startup in llm_load_print_meta function, allowing front-end to read a plethora of model properties.

I have read the contributing guidelines
Self-reported review complexity:
- Low
- Medium
- High

This PR adds the following: - Expose the model's Jinja2 prompt template from the model in the /props endpoint. - Change log-level from Error to Warning for warning about template mismatch. The front-end stands a better chance of actually executing the Jinja template format correctly. Server is currently just guessing it. Ideally this should have been inside a JSON block that expose the same key/value pairs as listed during startup in "llm_load_print_meta" function.

ngxson · 2024-07-06T16:57:21Z

I'm not very confident about this change. Even when frontend can get the jinja template, how it will be used? I suppose it requires a jinja parser on frontend?

Another way I can think of is that backend can format a chat template with some pre-defined placeholders, like {{user_message}}, and the JS code can do simple string manipulation to get the correct template.

examples/server/server.cpp

bviksoe · 2024-07-06T17:58:29Z

Even when frontend can get the jinja template, how it will be used? I suppose it requires a jinja parser on frontend?

Yes. Adding a real Jinja parser in c++ has already been discussed I think. It's not likely to be added. But lots of Jinja exist for Python and JavaScript.

The /chat/completions endpoint will surely help standard usage of server, and it does its assumptions about the template content. But for those on the /completion endpoint will want to have access to all model properties, including the template - so they can ensure any model prompt formats correctly.

examples/server/server.cpp

* server: Retrieve prompt template in /props This PR adds the following: - Expose the model's Jinja2 prompt template from the model in the /props endpoint. - Change log-level from Error to Warning for warning about template mismatch. The front-end stands a better chance of actually executing the Jinja template format correctly. Server is currently just guessing it. Ideally this should have been inside a JSON block that expose the same key/value pairs as listed during startup in "llm_load_print_meta" function. * Make string buffer dynamic * Add doc and better string handling * Using chat_template naming convention * Use intermediate vector for string assignment

github-actions bot added examples server labels Jul 6, 2024

ngxson requested changes Jul 6, 2024

View reviewed changes

examples/server/server.cpp Show resolved Hide resolved

Make string buffer dynamic

a21b89f

Add doc and better string handling

bc3ed77

ngxson reviewed Jul 6, 2024

View reviewed changes

examples/server/server.cpp Show resolved Hide resolved

ngxson reviewed Jul 6, 2024

View reviewed changes

examples/server/server.cpp Outdated Show resolved Hide resolved

bviksoe added 2 commits July 6, 2024 23:07

Using chat_template naming convention

ada3cbf

Use intermediate vector for string assignment

dff9b2e

ngxson approved these changes Jul 6, 2024

View reviewed changes

ngxson merged commit cb4d86c into ggerganov:master Jul 7, 2024
53 checks passed

ngxson mentioned this pull request Jul 25, 2024

Feature Request: server : make chat_example available through /props endpoint #8694

Closed

4 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

server: Retrieve prompt template in /props #8337

server: Retrieve prompt template in /props #8337

bviksoe commented Jul 6, 2024

ngxson commented Jul 6, 2024

bviksoe commented Jul 6, 2024

server: Retrieve prompt template in /props #8337

server: Retrieve prompt template in /props #8337

Conversation

bviksoe commented Jul 6, 2024

ngxson commented Jul 6, 2024

bviksoe commented Jul 6, 2024