Implementation of Positional Interpolation (PI) Feature #690

andy-yang-1 · 2023-08-07T18:57:47Z

This Pull Request aims to introduce the Positional Interpolation (PI) feature to the vllm library. Positional Interpolation is a novel encoding technique that enhances the effects of Rotary Position Encoding (RoPE) by providing more precise positional information. The code now supports long-context models.

Try with:

from vllm import LLM, SamplingParams

# Sample prompts.
prompts = [
    "Hello, my name is",
    "The president of the United States is",
    "The capital of France is",
    "The future of AI is",
]
# Create a sampling params object.
sampling_params = SamplingParams(temperature=0.8, top_p=0.95)

# Create an LLM.
llm = LLM(model="lmsys/vicuna-7b-v1.5-16k")
# Generate texts from the prompts. The output is a list of RequestOutput objects
# that contain the prompt, generated text, and other information.
outputs = llm.generate(prompts, sampling_params)
# Print the outputs.
for output in outputs:
    prompt = output.prompt
    generated_text = output.outputs[0].text
    print(f"Prompt: {prompt!r}, Generated text: {generated_text!r}")

zhuohan123 · 2023-08-08T20:12:28Z

@andy-yang-1 Does this PR have overlaps with #555? Can you take a look at that as well?

andy-yang-1 · 2023-08-08T20:28:25Z

@zhuohan123 Yes, it overlaps with #555

WoosukKwon · 2023-09-28T18:01:57Z

HI @andy-yang-1, we recently merged #555. This feature is now supported. Thanks for the contribution!

1. This PR updates habana_main README_GAUDI to the Technical Writer reviewed version as seen in v1.19.0. (habana_main README_GAUDI and v1.19.0 README_GAUDI had diverged. ) 2. It also fixes broken urls due to recent restructuring in upstream vllm examples folder. 3. Adds notes in examples folder for new users and redirects them to see the Gaudi specific examples in README_GAUDI.md.

andy-yang-1 added 8 commits August 7, 2023 00:17

support rope scaling

a89ba0f

fix style

2da6023

fix style

171a021

read max position embedding from config

915007e

fix config for max_model_len

f47fdd3

fix style

8d7acc4

black

b07138a

change style

0544c58

syskn mentioned this pull request Aug 24, 2023

Potential degredation in sampling/too repetitive #712

Closed

zhuohan123 force-pushed the main branch from 3affdce to 0080d83 Compare August 30, 2023 09:26

zhuohan123 added the new-model Requests to new models label Sep 12, 2023

WoosukKwon closed this Sep 28, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Implementation of Positional Interpolation (PI) Feature #690

Implementation of Positional Interpolation (PI) Feature #690

Uh oh!

andy-yang-1 commented Aug 7, 2023

Uh oh!

zhuohan123 commented Aug 8, 2023

Uh oh!

andy-yang-1 commented Aug 8, 2023

Uh oh!

WoosukKwon commented Sep 28, 2023

Uh oh!

Uh oh!

Uh oh!

Implementation of Positional Interpolation (PI) Feature #690

Implementation of Positional Interpolation (PI) Feature #690

Uh oh!

Conversation

andy-yang-1 commented Aug 7, 2023

Uh oh!

zhuohan123 commented Aug 8, 2023

Uh oh!

andy-yang-1 commented Aug 8, 2023

Uh oh!

WoosukKwon commented Sep 28, 2023

Uh oh!

Uh oh!