server : add bad input handling in embeddings #10866

krystiancha · 2024-12-17T12:17:27Z

This patch improves the behavior of the embedding endpoint with bad input.

fixed unhandled exception on non-string in "content"
fixed crash on empty string in "content" or "input"

ggerganov · 2024-12-17T15:12:13Z

examples/server/server.cpp

+        // with "content", we only support single prompt
+        if (!oaicompat && prompt.type() != json::value_t::string) {
+            res_error(res, format_error_response("\"content\" must be a string", ERROR_TYPE_INVALID_REQUEST));
+            return;
+        }
+


I wonder what is the reason to apply this restriction on the "content" field? Why don't we treat it as an alias to "input" and allow the same inputs for it? cc @ngxson

Hmm I have no idea why, seems like remnant from the past. It makes more sense to consider content as an alias of input as you said.

ngxson · 2024-12-17T17:37:56Z

examples/server/server.cpp

        // create and queue the task
        json responses = json::array();
        bool error = false;
        {
            std::vector<server_task> tasks;
            std::vector<llama_tokens> tokenized_prompts = tokenize_input_prompts(ctx_server.ctx, prompt, /* add_special */ false, true);
            for (size_t i = 0; i < tokenized_prompts.size(); i++) {
+                if (tokenized_prompts[i].size() == 0) {


This test should not be here. Also, empty string is still a valid input.

Please remove this check

Hmm ok sorry seems like OAI embedding does not accept empty string either. So this check is relevant, it's just not in the correct place.

Ref: https://platform.openai.com/docs/api-reference/embeddings/create

Should be fixed in my PR and it won't crash if the input is empty. We're now adding BOS token to the sequence.

server : add bad input handling in embeddings

38725ef

krystiancha requested a review from ngxson as a code owner December 17, 2024 12:17

github-actions bot added examples python python script changes server labels Dec 17, 2024

ggerganov reviewed Dec 17, 2024

View reviewed changes

ggerganov mentioned this pull request Dec 17, 2024

server : output embeddings for all tokens when pooling = none #10861

Merged

ngxson reviewed Dec 17, 2024

View reviewed changes

ngxson mentioned this pull request Dec 17, 2024

server : (embeddings) using same format for "input" and "content" #10872

Merged

ngxson closed this Dec 17, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

server : add bad input handling in embeddings #10866

server : add bad input handling in embeddings #10866

krystiancha commented Dec 17, 2024 •

edited

Loading

ggerganov Dec 17, 2024 •

edited

Loading

ngxson Dec 17, 2024

ngxson Dec 17, 2024

ngxson Dec 17, 2024

ngxson Dec 17, 2024

server : add bad input handling in embeddings #10866

server : add bad input handling in embeddings #10866

Conversation

krystiancha commented Dec 17, 2024 • edited Loading

ggerganov Dec 17, 2024 • edited Loading

Choose a reason for hiding this comment

ngxson Dec 17, 2024

Choose a reason for hiding this comment

ngxson Dec 17, 2024

Choose a reason for hiding this comment

ngxson Dec 17, 2024

Choose a reason for hiding this comment

ngxson Dec 17, 2024

Choose a reason for hiding this comment

krystiancha commented Dec 17, 2024 •

edited

Loading

ggerganov Dec 17, 2024 •

edited

Loading