openchat/openchat-3.5-1210 · Hugging Face #418

irthomasthomas · 2024-01-24T19:12:08Z

openchat/openchat-3.5-1210 · Hugging Face

Using the OpenChat Model

We highly recommend installing the OpenChat package and using the OpenChat OpenAI-compatible API server for an optimal experience. The server is optimized for high-throughput deployment using vLLM and can run on a consumer GPU with 24GB RAM.

Installation Guide: Follow the installation guide in our repository.
Serving: Use the OpenChat OpenAI-compatible API server by running the serving command from the table below. To enable tensor parallelism, append --tensor-parallel-size N to the serving command.

Model Size Context Weights Serving

OpenChat 3.5 1210 7B 8192 python -m ochat.serving.openai_api_server --model openchat/openchat-3.5-1210 --engine-use-ray --worker-use-ray

API Usage: Once started, the server listens at localhost:18888 for requests and is compatible with the OpenAI ChatCompletion API specifications. Here's an example request:

curl http://localhost:18888/v1/chat/completions \
  -H "Content-Type: application/json" \
  -d '{
        "model": "openchat_3.5",
        "messages": [{"role": "user", "content": "You are a large language model named OpenChat. Write a poem to describe yourself"}]
      }'

Web UI: Use the OpenChat Web UI for a user-friendly experience.

Online Deployment

If you want to deploy the server as an online service, use the following options:

--api-keys sk-KEY1 sk-KEY2 ... to specify allowed API keys
--disable-log-requests --disable-log-stats --log-file openchat.log for logging only to a file.

For security purposes, we recommend using an HTTPS gateway in front of the server.

Mathematical Reasoning Mode

The OpenChat model also supports mathematical reasoning mode. To use this mode, include condition: "Math Correct" in your request.

```bash
curl http://localhost:18888/v1/chat/completions \
  -H "Content-Type: application/json" \
  -d '{
        "model": "openchat_3.5",
        "condition": "Math Correct",
        "messages": [{"role": "user", "content": "10.3 − 7988.8133 = "}]
      }'
```

Conversation Templates

We provide several pre-built conversation templates to help you get started.

Default Mode (GPT4 Correct):

GPT4 Correct User: Hello<|end_of_turn|>
GPT4 Correct Assistant: Hi<|end_of_turn|>
GPT4 Correct User: How are you today?<|end_of_turn|>
GPT4 Correct Assistant:

Mathematical Reasoning Mode:
```
Math Correct User: 10.3 − 7988.8133=<|end_of_turn|>
Math Correct Assistant:
```
NOTE: Remember to set <|end_of_turn|> as end of generation token.
Integrated Tokenizer: The default (GPT4 Correct) template is also available as the integrated tokenizer.chat_template, which can be used instead of manually specifying the template.

Suggested labels

{ "label": "chat-templates", "description": "Pre-defined conversation structures for specific modes of interaction." }

The text was updated successfully, but these errors were encountered:

irthomasthomas mentioned this issue Mar 6, 2024

TextDiffuser-2: Unleashing the Power of Language Models for Text Rendering #704

Open

1 task

This was referenced Mar 16, 2024

Command-R - An low-latency LLM with long context, opitmized for RAG and tool use. #729

Open

No More Hustleporn: How Does OpenAI Plugins/Browser Work? A Thread of Detailed Analysis on the Server Interaction #774

Open

ShellLM mentioned this issue Aug 2, 2024

namuan/chat-circuit: Branch Out Your Conversations #858

Open

1 task

ShellLM mentioned this issue Aug 16, 2024

MoA/README.md at main · togethercomputer/MoA #884

Open

1 task

ShellLM mentioned this issue Sep 7, 2024

codelion/optillm - Automatic prompt strategy proxy #922

Open

1 task

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

openchat/openchat-3.5-1210 · Hugging Face #418

openchat/openchat-3.5-1210 · Hugging Face #418

irthomasthomas commented Jan 24, 2024

openchat/openchat-3.5-1210 · Hugging Face #418

openchat/openchat-3.5-1210 · Hugging Face #418

Comments

irthomasthomas commented Jan 24, 2024

Using the OpenChat Model

Online Deployment

Mathematical Reasoning Mode

Conversation Templates

Suggested labels

{ "label": "chat-templates", "description": "Pre-defined conversation structures for specific modes of interaction." }