-
-
Notifications
You must be signed in to change notification settings - Fork 4.4k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[Core][Frontend][Doc] Initial support for LLaVA-NeXT and GPT-4V Chat Completions API #3978
Commits on Apr 10, 2024
-
Add basic support for OpenAI image input API
- Refactor `OpenAIServingChat` and add function for loading image - Move `pillow` dev dependency to common - Add example chat template for LLaVA model
Configuration menu - View commit details
-
Copy full SHA for 874a581 - Browse repository at this point
Copy the full SHA 874a581View commit details -
- Add general guide for using VLMs - Add LLavA to list of supported models
Configuration menu - View commit details
-
Copy full SHA for 607434e - Browse repository at this point
Copy the full SHA 607434eView commit details -
Add tests for OpenAI image input API and image loader
- Move `ServerRunner` to common file
Configuration menu - View commit details
-
Copy full SHA for aaa6bfe - Browse repository at this point
Copy the full SHA aaa6bfeView commit details
Commits on Apr 11, 2024
-
Configuration menu - View commit details
-
Copy full SHA for 26e7b2a - Browse repository at this point
Copy the full SHA 26e7b2aView commit details -
Configuration menu - View commit details
-
Copy full SHA for 44829b5 - Browse repository at this point
Copy the full SHA 44829b5View commit details -
Configuration menu - View commit details
-
Copy full SHA for bccb367 - Browse repository at this point
Copy the full SHA bccb367View commit details -
Configuration menu - View commit details
-
Copy full SHA for b9302e8 - Browse repository at this point
Copy the full SHA b9302e8View commit details -
- Incorrect loading of config (also rename `openai_api` to `image_openai`) - Incorrect await of stream generator
Configuration menu - View commit details
-
Copy full SHA for a44d7d1 - Browse repository at this point
Copy the full SHA a44d7d1View commit details
Commits on Apr 12, 2024
-
Configuration menu - View commit details
-
Copy full SHA for 561ad49 - Browse repository at this point
Copy the full SHA 561ad49View commit details -
Configuration menu - View commit details
-
Copy full SHA for 4479605 - Browse repository at this point
Copy the full SHA 4479605View commit details -
Improve async behaviour of loading images
- Also, use the type definitions from `openai` directly
Configuration menu - View commit details
-
Copy full SHA for 20852d9 - Browse repository at this point
Copy the full SHA 20852d9View commit details -
Configuration menu - View commit details
-
Copy full SHA for ce770f4 - Browse repository at this point
Copy the full SHA ce770f4View commit details -
Configuration menu - View commit details
-
Copy full SHA for 6b016bc - Browse repository at this point
Copy the full SHA 6b016bcView commit details -
Configuration menu - View commit details
-
Copy full SHA for 7620354 - Browse repository at this point
Copy the full SHA 7620354View commit details -
Configuration menu - View commit details
-
Copy full SHA for 7c3e6d9 - Browse repository at this point
Copy the full SHA 7c3e6d9View commit details -
Configuration menu - View commit details
-
Copy full SHA for e74b0a7 - Browse repository at this point
Copy the full SHA e74b0a7View commit details -
Configuration menu - View commit details
-
Copy full SHA for 9925dcb - Browse repository at this point
Copy the full SHA 9925dcbView commit details -
Configuration menu - View commit details
-
Copy full SHA for ceb4e35 - Browse repository at this point
Copy the full SHA ceb4e35View commit details -
Refactor prompt parsing so that it can be shared between Chat Complet…
…ions API and legacy Completions API
Configuration menu - View commit details
-
Copy full SHA for 7bdc84e - Browse repository at this point
Copy the full SHA 7bdc84eView commit details -
Configuration menu - View commit details
-
Copy full SHA for a7d1098 - Browse repository at this point
Copy the full SHA a7d1098View commit details -
Configuration menu - View commit details
-
Copy full SHA for 8b9d636 - Browse repository at this point
Copy the full SHA 8b9d636View commit details -
Configuration menu - View commit details
-
Copy full SHA for 9754142 - Browse repository at this point
Copy the full SHA 9754142View commit details -
Configuration menu - View commit details
-
Copy full SHA for c48c13a - Browse repository at this point
Copy the full SHA c48c13aView commit details -
Configuration menu - View commit details
-
Copy full SHA for 3530362 - Browse repository at this point
Copy the full SHA 3530362View commit details -
Configuration menu - View commit details
-
Copy full SHA for b8feec9 - Browse repository at this point
Copy the full SHA b8feec9View commit details -
Configuration menu - View commit details
-
Copy full SHA for 9cae113 - Browse repository at this point
Copy the full SHA 9cae113View commit details
Commits on Apr 13, 2024
-
Configuration menu - View commit details
-
Copy full SHA for 89d9086 - Browse repository at this point
Copy the full SHA 89d9086View commit details -
Configuration menu - View commit details
-
Copy full SHA for cc1a5b3 - Browse repository at this point
Copy the full SHA cc1a5b3View commit details -
Configuration menu - View commit details
-
Copy full SHA for f9c1135 - Browse repository at this point
Copy the full SHA f9c1135View commit details
Commits on Apr 14, 2024
-
Configuration menu - View commit details
-
Copy full SHA for ecc2d50 - Browse repository at this point
Copy the full SHA ecc2d50View commit details -
Configuration menu - View commit details
-
Copy full SHA for f2e8180 - Browse repository at this point
Copy the full SHA f2e8180View commit details -
Configuration menu - View commit details
-
Copy full SHA for ce04842 - Browse repository at this point
Copy the full SHA ce04842View commit details -
Load image processor from HuggingFace
- Note that multi modal processing logic has been moved from `LLM` to `LLMEngine`
Configuration menu - View commit details
-
Copy full SHA for cdbf08a - Browse repository at this point
Copy the full SHA cdbf08aView commit details -
Configuration menu - View commit details
-
Copy full SHA for 9a336ec - Browse repository at this point
Copy the full SHA 9a336ecView commit details -
Allow disabling image processor
- Also fix missing arguments to config in `test_llava.py`
Configuration menu - View commit details
-
Copy full SHA for 5722dd8 - Browse repository at this point
Copy the full SHA 5722dd8View commit details
Commits on Apr 15, 2024
-
Configuration menu - View commit details
-
Copy full SHA for 6e1fa67 - Browse repository at this point
Copy the full SHA 6e1fa67View commit details -
Configuration menu - View commit details
-
Copy full SHA for 7ce44da - Browse repository at this point
Copy the full SHA 7ce44daView commit details
Commits on Apr 16, 2024
-
Configuration menu - View commit details
-
Copy full SHA for 9804604 - Browse repository at this point
Copy the full SHA 9804604View commit details -
Configuration menu - View commit details
-
Copy full SHA for 21434df - Browse repository at this point
Copy the full SHA 21434dfView commit details -
Configuration menu - View commit details
-
Copy full SHA for a5907b0 - Browse repository at this point
Copy the full SHA a5907b0View commit details
Commits on Apr 17, 2024
-
Configuration menu - View commit details
-
Copy full SHA for f08ff10 - Browse repository at this point
Copy the full SHA f08ff10View commit details -
Configuration menu - View commit details
-
Copy full SHA for c126646 - Browse repository at this point
Copy the full SHA c126646View commit details
Commits on Apr 18, 2024
-
Configuration menu - View commit details
-
Copy full SHA for 49ba216 - Browse repository at this point
Copy the full SHA 49ba216View commit details -
Configuration menu - View commit details
-
Copy full SHA for 11e9921 - Browse repository at this point
Copy the full SHA 11e9921View commit details -
Configuration menu - View commit details
-
Copy full SHA for 7ae80a2 - Browse repository at this point
Copy the full SHA 7ae80a2View commit details -
Configuration menu - View commit details
-
Copy full SHA for 2610bea - Browse repository at this point
Copy the full SHA 2610beaView commit details -
Configuration menu - View commit details
-
Copy full SHA for 5ad2b67 - Browse repository at this point
Copy the full SHA 5ad2b67View commit details -
Refactor image processing,
MultiModalData
and LLaVA model- Remove channel conversion and resizing from OpenAI server preprocessing since the image processor in HuggingFace should be able to handle that - `MultiModalData` is now an abstract class that outputs additional kwargs to be input into the model. This was intially done to support LLaVA-NeXT's `image_size` parameter but can be extended to other models as well. - The application of image processor is now defined inside `MultiModalData` so that there is no need to extensively edit the engine to support other types of data - New `MultiModalData` subclasses: `ImagePixelData` and `ImageFeatureData` to better differentiate the two cases of image input - Refactored LLaVA-1.5 model to make it easier to inherit for defining LLaVA-NeXT model
Configuration menu - View commit details
-
Copy full SHA for 696357b - Browse repository at this point
Copy the full SHA 696357bView commit details -
Fix image processing not working directly, due to tensor being passed
- Now, `ImagePixelData` only accepts `PIL.Image` input - Also move `torch` import out of `TYPE_CHECKING` as it is loaded anyways when importing `SamplingParams`
Configuration menu - View commit details
-
Copy full SHA for 483b190 - Browse repository at this point
Copy the full SHA 483b190View commit details -
Configuration menu - View commit details
-
Copy full SHA for 3e22017 - Browse repository at this point
Copy the full SHA 3e22017View commit details -
Configuration menu - View commit details
-
Copy full SHA for 0b6af35 - Browse repository at this point
Copy the full SHA 0b6af35View commit details -
Get LLaVA-Next to work with fixed-size images
- Note the patch in `ImagePixelData`. To fully leverage the potential of LLaVA-Next, we should allow image of any size, but the feature size would then be variable.
Configuration menu - View commit details
-
Copy full SHA for e4c3502 - Browse repository at this point
Copy the full SHA e4c3502View commit details -
Configuration menu - View commit details
-
Copy full SHA for 21aaf3d - Browse repository at this point
Copy the full SHA 21aaf3dView commit details -
Configuration menu - View commit details
-
Copy full SHA for ac95b79 - Browse repository at this point
Copy the full SHA ac95b79View commit details -
Configuration menu - View commit details
-
Copy full SHA for 9a9a4e7 - Browse repository at this point
Copy the full SHA 9a9a4e7View commit details -
Configuration menu - View commit details
-
Copy full SHA for 176ad2c - Browse repository at this point
Copy the full SHA 176ad2cView commit details
Commits on Apr 19, 2024
-
Configuration menu - View commit details
-
Copy full SHA for 91ea044 - Browse repository at this point
Copy the full SHA 91ea044View commit details -
Fix LLaVA example and test w.r.t. image processing refactor
- Note that we now load the images directly instead of from `.pt` files
Configuration menu - View commit details
-
Copy full SHA for cb19743 - Browse repository at this point
Copy the full SHA cb19743View commit details -
Configuration menu - View commit details
-
Copy full SHA for 019f473 - Browse repository at this point
Copy the full SHA 019f473View commit details -
Fix circular import and set return type
- These changes are propagated to the child PRs
Configuration menu - View commit details
-
Copy full SHA for f882d99 - Browse repository at this point
Copy the full SHA f882d99View commit details