-
Notifications
You must be signed in to change notification settings - Fork 1.2k
[Feat] Multi model support #931
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Merged
+1,037
−788
Merged
Changes from 26 commits
Commits
Show all changes
53 commits
Select commit
Hold shift + click to select a range
3ec00d0
Update Llama class to handle chat_format & caching
D4ve-R 6e68a4b
Add settings.py
D4ve-R e63cffb
Add util.py & update __main__.py
D4ve-R 55e33ab
multimodel
D4ve-R 39a07d6
Merge branch 'multimodel'
D4ve-R 5ab0010
update settings.py
D4ve-R 45bfa07
cleanup
D4ve-R 76c0168
delete util.py
D4ve-R 97a6a21
Fix /v1/models endpoint
D4ve-R fb2a1e7
MultiLlama now iterable, app check-alive on "/"
D4ve-R 3f150ac
instant model init if file is given
D4ve-R e71946c
backward compability
D4ve-R 55a9767
revert model param mandatory
D4ve-R bb1857a
Merge branch 'main' of https://github.com/abetlen/llama-cpp-python
D4ve-R 3c4b526
fix error
D4ve-R 10a2d32
handle individual model config json
D4ve-R ee71f20
refactor
D4ve-R ea0fcca
revert chathandler/clip_model changes
D4ve-R 6f5e60a
handle chat_handler in MulitLlama()
D4ve-R d9d696d
split settings into server/llama
D4ve-R e71fc92
reduce global vars
D4ve-R 522f0bd
Update LlamaProxy to handle config files
D4ve-R a67e779
Merge branch 'main' of https://github.com/abetlen/llama-cpp-python
D4ve-R 6e0ab3e
Add free method to LlamaProxy
D4ve-R 9ed047c
Merge branch 'main' of https://github.com/abetlen/llama-cpp-python
D4ve-R ec9a9db
update arg parsers & install server alias
D4ve-R fd731d7
refactor cache settings
D4ve-R 288fa85
change server executable name
D4ve-R b64742b
better var name
D4ve-R bc5cf51
whitespace
D4ve-R 02a83f1
Merge branch 'abetlen:main' into main
D4ve-R b780c6e
Merge branch 'main' of github.com:D4ve-R/llama-cpp-python
D4ve-R 5fd9892
Revert "whitespace"
D4ve-R 7b1c17b
remove exe_name
D4ve-R a94b0de
Merge branch 'main' into D4ve-R/main
abetlen ec8265a
Merge branch 'main' of github.com:abetlen/llama_cpp_python into D4ve-…
abetlen ba36629
Fix merge bugs
abetlen 315a82f
Fix type annotations
abetlen c5051be
Fix type annotations
abetlen 7a3e11a
Fix uvicorn app factory
abetlen 4f99ac6
Fix settings
abetlen 3f2e6c1
Refactor server
abetlen 3472b6f
Remove formatting fix
abetlen 310e2e6
Format
abetlen 5c9c35e
Use default model if not found in model settings
abetlen 950f721
Fix
abetlen 3d6c479
Merge branch 'main' into D4ve-R/main
abetlen 8347a78
Cleanup
abetlen 02ab0e2
Fix
abetlen fd1bf64
Fix
abetlen ecd8434
Remove unnused CommandLineSettings
abetlen 5286146
Cleanup
abetlen 1b322b4
Support default name for copilot-codex models
abetlen File filter
Filter by extension
Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
There are no files selected for viewing
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Oops, something went wrong.
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
Uh oh!
There was an error while loading. Please reload this page.