Add support for stopwords in huggingface handler #1118

ydm-amazon · 2023-09-27T20:02:52Z

Description

This adds support for stopwords (also known as stop sequences) to the HuggingFace handler.

lanking520 · 2023-09-29T15:41:52Z

class StopSequenceCriteria:
    def __init__(self, stop_sequence: str):
        stop_sequence = re.escape(stop_sequence)
        self.regex = re.compile(f".*{stop_sequence}$")

    def __call__(self, output: str) -> bool:
        if self.regex.findall(output):
            return True
        return False

reference implementation in LMI_Dist

lanking520 · 2023-09-29T20:17:56Z

@KexinFeng could you review

KexinFeng · 2023-09-30T00:49:27Z

Is there any unittest code that tests this feature?

KexinFeng · 2023-09-30T18:42:23Z

Need to rebase to the master to get the patch that helps pass the unit test

ydm-amazon · 2023-10-03T16:55:54Z

Need to rebase to the master to get the patch that helps pass the unit test

Done

Is there any unittest code that tests this feature?

No, but I am starting to work on it. I have added a subtask to Asana. For now, here is an example of how I am testing:

serving.properties:

engine=MPI
option.entryPoint=huggingface.py
option.model_id=bigscience/bloom-560m
option.tensor_parallel_degree=1
option.task=text-generation
option.paged_attention=true
option.dtype=fp16
option.stop_sequence=["<User>", "See you later"]

Model server:

docker run -it --runtime=nvidia --gpus "device=0" --shm-size 4g -v /home/ubuntu/rds:/opt/ml/model  -v /home/ubuntu/model_server_logs:/opt/djl/logs  -p 8080:8080  deepjavalibrary/djl-serving:0.23.0-deepspeed

Example request:

curl http://127.0.0.1:8080/invocations -X POST -d '{"inputs":"<Assistant>: Hi! What can I do for you? <User>: Why apple is red?","parameters":{"max_new_tokens":50}}' -H "Content-type: application/json"

Result:

[
  {
    "generated_text":"<Assistant>: Hi! What can I do for you? <User>: Why apple is red? <Assistant>: I have a problem with the apple. <User>:"
  }
]

Another example request:

curl http://127.0.0.1:8080/invocations -X POST -d '{"inputs":"When User says See you tomorrow, Assistant replies See you later. Assistant: Hi! What can I do for you? User: See you tomorrow","parameters":{"max_new_tokens":50}}' -H "Content-type: application/json"

Result:

[
  {
    "generated_text":"When User says See you tomorrow, Assistant replies See you later. Assistant: Hi! What can I do for you? User: See you tomorrow. User: See you later"
  }
]

ydm-amazon requested review from zachgk, frankfliu and a team as code owners September 27, 2023 20:02

lanking520 requested a review from KexinFeng September 29, 2023 20:18

Add support for stopwords in huggingface handler

1cbbf1c

ydm-amazon force-pushed the master branch from f27dd20 to 1cbbf1c Compare October 3, 2023 16:38

KexinFeng approved these changes Oct 3, 2023

View reviewed changes

lanking520 merged commit 7c9ea81 into deepjavalibrary:master Oct 4, 2023
8 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add support for stopwords in huggingface handler #1118

Add support for stopwords in huggingface handler #1118

ydm-amazon commented Sep 27, 2023

lanking520 commented Sep 29, 2023 •

edited

Loading

lanking520 commented Sep 29, 2023

KexinFeng commented Sep 30, 2023

KexinFeng commented Sep 30, 2023

ydm-amazon commented Oct 3, 2023 •

edited

Loading

Add support for stopwords in huggingface handler #1118

Add support for stopwords in huggingface handler #1118

Conversation

ydm-amazon commented Sep 27, 2023

Description

lanking520 commented Sep 29, 2023 • edited Loading

lanking520 commented Sep 29, 2023

KexinFeng commented Sep 30, 2023

KexinFeng commented Sep 30, 2023

ydm-amazon commented Oct 3, 2023 • edited Loading

lanking520 commented Sep 29, 2023 •

edited

Loading

ydm-amazon commented Oct 3, 2023 •

edited

Loading