add internlm model #528

gqjia · 2023-07-20T07:23:01Z

No description provided.

Add support for LLaMA-2 (vllm-project#505)

zhuohan123

Thank you for your contribution! Can you add your models to EADME.md and docs/source/models/supported_models.rst? Specifically, have you made sure that your implementation matches the official implementation? For example, does the greedy sampling results from this PR matches the official implementation?

zhuohan123

Thank you for your contribution! I tested internlm/internlm-chat-7b and it works pretty well!

beyondguo · 2023-08-14T12:23:34Z

Hi, why I still got: ValueError: Model architectures ['InternLMForCausalLM'] are not supported for now.

vllm version: 0.1.3

script:

from vllm import LLM
model = LLM(model='internlm/internlm-chat-7b', trust_remote_code=True)

NOTE: This includes a couple importing order changes. It is because I made vllm.anyscale pakcages to go to the bottom to avoid merge conflict. Allow to build via pip install -e . Basic integration with an env var ANYSCALE_USE_SCRATCH=1 Working with Llama 7B Basic testing Batch working (but scratch only allows small number of batch now. And scratch doesn't have efficient batching yet) It is working with both scratch sampler and vllm sampler Sessions are cleaned based on LRU cache. It will be fixed in a couple weeks. support prompt logprob and some sampler features (except beam search) async execution like torch kernels llama 3 + llama 2 works Do input config validation more thorough testing Future TODO preemption not working (future work) It currently doesn't use kv cache allocated from vllm (not a strict requirement). The PR needs cleanup before merging.

gqjia added 4 commits July 20, 2023 15:20

Create internlm.py

91f4864

Update __init__.py

2530dd6

Update model_loader.py

f96025f

Merge pull request #1 from vllm-project/main

c33a2f4

Add support for LLaMA-2 (vllm-project#505)

zhuohan123 reviewed Jul 25, 2023

View reviewed changes

zhuohan123 added 3 commits August 8, 2023 23:25

Merge branch 'main' into sh1y111/main

e2e5b00

Format

a3c3b10

format

f706cb2

zhuohan123 approved these changes Aug 8, 2023

View reviewed changes

zhuohan123 merged commit 735ecff into vllm-project:main Aug 8, 2023

hongxiayang pushed a commit to hongxiayang/vllm that referenced this pull request Feb 13, 2024

add internlm model (vllm-project#528)

c6be8b7

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

add internlm model #528

add internlm model #528

gqjia commented Jul 20, 2023

zhuohan123 left a comment

zhuohan123 left a comment

beyondguo commented Aug 14, 2023

add internlm model #528

add internlm model #528

Conversation

gqjia commented Jul 20, 2023

zhuohan123 left a comment

Choose a reason for hiding this comment

zhuohan123 left a comment

Choose a reason for hiding this comment

beyondguo commented Aug 14, 2023