Merge branch 'main' into patch-1

jie-wu5 · Sep 13, 2024 · 20284ae · 20284ae
2 parents 976cb1e + 2649959
commit 20284ae
Showing 1 changed file with 3 additions and 5 deletions.
diff --git a/README.md b/README.md
@@ -1072,19 +1072,17 @@ Note:
 
 ## Deployment
 
-We recommend using vLLM for fast Qwen2-VL deployment and inference. You can use [this fork](https://github.com/fyabc/vllm/tree/add_qwen2_vl_new) (we are working on merging this PR into vLLM main repository). You can also use our [official docker image](#-docker).
+We recommend using vLLM for fast Qwen2-VL deployment and inference. You need to use `vllm>=0.6.1` to enable Qwen2-VL support. You can also use our [official docker image](#-docker).
 
 ### Installation
 ```bash
 pip install git+https://github.com/huggingface/transformers@21fac7abba2a37fae86106f87fcf9974fd1e3830
 pip install accelerate
 pip install qwen-vl-utils
-git clone https://github.com/fyabc/vllm
-cd vllm
-git checkout add_qwen2_vl_new
 # Change to your CUDA version
 CUDA_VERSION=cu121
-pip install . --extra-index-url https://download.pytorch.org/whl/${CUDA_VERSION}
+pip install 'vllm==0.6.1' --extra-index-url https://download.pytorch.org/whl/${CUDA_VERSION}
+
 ```
 ### Start an OpenAI API Service