Tags from serve

TorchServe v0.12.0 Release Notes

2024-09-30T22:46:39Z

TorchServe v0.11.1 Release Notes

2024-07-23T16:55:51Z

Bug fix for kserve build issue and fixing nightly tests (#3251)

* Testing kserve build

* Update kserve_cpu_tests.yml

TorchServe v0.11.0 Release Notes

2024-05-17T16:38:42Z

IPEX LLM serving example (#3068)

* adding the files for ipex int8 serving of llms

* Update README.md

Fixed some markdowns

* Fix handler name

* Adding default PyTorch support

* Fixing some issues with handler, added test to verify smooth-quant

* adding auto_mixed_precision flag to config

* Removing min_new_tokens from generation config

* fix lint

* lint

* Fixing unit tests with different model that doesn't require license

* Fix lint error

* Fix lint error in test

* Adding requirements.txt

* adding datasets to the requirements

* upgrading the ipex version to 2.3.0 to match that of pytorch

* Skipping ipex llm tests if accelerate is not present

---------

Co-authored-by: Ubuntu <ubuntu@ip-172-31-51-123.us-west-2.compute.internal>
Co-authored-by: lxning <23464292+lxning@users.noreply.github.com>
Co-authored-by: lxning <lninga@amazon.com>
Co-authored-by: Matthias Reso <13337103+mreso@users.noreply.github.com>

TorchServe v0.10.0 Release Notes

2024-03-15T19:05:27Z

TorchServe v0.9.0 Release Notes

2023-10-17T19:14:02Z

TorchServe v0.8.2 Release Notes

2023-08-28T23:52:56Z

llama2 70b chat accelerate example (#2494)

* llam2 accelerate example

* add readme

* fmt

* fixing the padding and prompt

* update steps

* Updated readme with more details

* changed to inheriting from basehandler

* add model_path

* change to int8

* add download cmd

* update download path

* minor edit for model_path

---------

Co-authored-by: Geeta Chauhan <4461127+chauhang@users.noreply.github.com>
Co-authored-by: Mark Saroufim <marksaroufim@fb.com>
Co-authored-by: Ankith Gunapal <agunapal@ischool.Berkeley.edu>
Co-authored-by: Hamid Shojanazeri <hamid.nazeri2010@gmail.com>

TorchServe v0.8.1 Release Notes

2023-06-20T19:48:59Z

Revert "Update Telemetry env variable. (#2356)" (#2413)

This reverts commit 7270447.

Co-authored-by: lxning <23464292+lxning@users.noreply.github.com>
Co-authored-by: Ankith Gunapal <agunapal@ischool.Berkeley.edu>

TorchServe v0.8.0 Release Notes

2023-05-17T22:46:36Z

Issues/fix docker dependencies (#2340)

* re-run kserve build because of runner issue

* Fixes missing dependencies

* added depth 1 to reduce clone size by 50 MB

TorchServe v0.7.1 Release Notes

2023-02-09T00:00:52Z

TorchServe v0.7.0 Release Notes

2023-07-18T21:36:21Z

Fix docker build script to build with CUDA 11.7 (#2032)

* updated docker to build with CUDA 11.7 as default

* temp workflow for building and pushing docker images

* reverting tmp workflow