Skip to content

Engineering work for Shale-Serve API. Fork and addition work to LMSYS' FastChat project.

License

Notifications You must be signed in to change notification settings

shaleprotocol/Shale-Serve-API

 
 

Repository files navigation

Shale Protocol Fork

The LMSYS team created the initial version of FastChat. The Shale Protocol team continuously re-engineers it in order to build the infrastructure that supports a production-ready inference API for open-source LLMs.

See more at https://shaleprotocol.com

Thanks to LMSYS team❤️

Coverage of models

We are focused to support Llama2 at scale now. If you want any other models, please contact.

  • OpenHermes-2.5-Mistral-7B
  • Gemma-7b-it

Dev Log

2024-02

Sync upstream changes

2023-09

Sync upstream changes

2023-08

Support llama2 at scale.

2023-07-26

Support "Llama-2-13b-chat-hf" and make it the default for API.

2023-07

  • Fixed issues working with AutoGPT and gpt-engineer etc.
  • Added support for longchat-7b-16k.

2023-06

  • Added support for CodeT5p and Falcon-7b models.

2023-05

  • API key database and rate limit enforcement
  • Deployable on Kubernetes

About

Engineering work for Shale-Serve API. Fork and addition work to LMSYS' FastChat project.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • Python 88.8%
  • Jupyter Notebook 10.2%
  • Other 1.0%