tag:github.com,2008:https://github.com/pytorch/serve/releases
Tags from serve
2024-09-24T16:41:17Z
tag:github.com,2008:Repository/212488700/v0.12.0
2024-09-30T22:46:39Z
TorchServe v0.12.0 Release Notes
agunapal
tag:github.com,2008:Repository/212488700/v0.11.1
2024-07-23T16:55:51Z
TorchServe v0.11.1 Release Notes
<p>Bug fix for kserve build issue and fixing nightly tests (<a class="issue-link js-issue-link" href="https://github.com/pytorch/serve/pull/3251">#3251</a>)</p>
<p>* Testing kserve build</p>
<p>* Update kserve_cpu_tests.yml</p>
agunapal
tag:github.com,2008:Repository/212488700/v0.11.0
2024-05-17T16:38:42Z
TorchServe v0.11.0 Release Notes
<p>IPEX LLM serving example (<a class="issue-link js-issue-link" href="https://github.com/pytorch/serve/pull/3068">#3068</a>)</p>
<p>* adding the files for ipex int8 serving of llms</p>
<p>* Update README.md</p>
<p>Fixed some markdowns</p>
<p>* Fix handler name</p>
<p>* Adding default PyTorch support</p>
<p>* Fixing some issues with handler, added test to verify smooth-quant</p>
<p>* adding auto_mixed_precision flag to config</p>
<p>* Removing min_new_tokens from generation config</p>
<p>* fix lint</p>
<p>* lint</p>
<p>* lint</p>
<p>* Fixing unit tests with different model that doesn't require license</p>
<p>* Fix lint error</p>
<p>* Fix lint error in test</p>
<p>* Adding requirements.txt</p>
<p>* adding datasets to the requirements</p>
<p>* upgrading the ipex version to 2.3.0 to match that of pytorch</p>
<p>* Skipping ipex llm tests if accelerate is not present</p>
<p>---------</p>
<p>Co-authored-by: Ubuntu <ubuntu@ip-172-31-51-123.us-west-2.compute.internal>
<br />Co-authored-by: lxning <23464292+lxning@users.noreply.github.com>
<br />Co-authored-by: lxning <lninga@amazon.com>
<br />Co-authored-by: Matthias Reso <13337103+mreso@users.noreply.github.com></p>
lxning
tag:github.com,2008:Repository/212488700/v0.10.0
2024-03-15T19:05:27Z
TorchServe v0.10.0 Release Notes
lxning
tag:github.com,2008:Repository/212488700/v0.9.0
2023-10-17T19:14:02Z
TorchServe v0.9.0 Release Notes
lxning
tag:github.com,2008:Repository/212488700/v0.8.2
2023-08-28T23:52:56Z
TorchServe v0.8.2 Release Notes
<p>llama2 70b chat accelerate example (<a class="issue-link js-issue-link" href="https://github.com/pytorch/serve/pull/2494">#2494</a>)</p>
<p>* llam2 accelerate example</p>
<p>* add readme</p>
<p>* fmt</p>
<p>* fixing the padding and prompt</p>
<p>* update steps</p>
<p>* Updated readme with more details</p>
<p>* changed to inheriting from basehandler</p>
<p>* add model_path</p>
<p>* change to int8</p>
<p>* add download cmd</p>
<p>* update download path</p>
<p>* minor edit for model_path</p>
<p>---------</p>
<p>Co-authored-by: Geeta Chauhan <4461127+chauhang@users.noreply.github.com>
<br />Co-authored-by: Mark Saroufim <marksaroufim@fb.com>
<br />Co-authored-by: Ankith Gunapal <agunapal@ischool.Berkeley.edu>
<br />Co-authored-by: Hamid Shojanazeri <hamid.nazeri2010@gmail.com></p>
lxning
tag:github.com,2008:Repository/212488700/v0.8.1
2023-06-20T19:48:59Z
TorchServe v0.8.1 Release Notes
<p>Revert "Update Telemetry env variable. (<a class="issue-link js-issue-link" href="https://github.com/pytorch/serve/pull/2356">#2356</a>)" (<a class="issue-link js-issue-link" href="https://github.com/pytorch/serve/pull/2413">#2413</a>)</p>
<p>This reverts commit <a class="commit-link" href="https://github.com/pytorch/serve/commit/727044754523e80ea991adbbd8e15634842a8b2c"><tt>7270447</tt></a>.</p>
<p>Co-authored-by: lxning <23464292+lxning@users.noreply.github.com>
<br />Co-authored-by: Ankith Gunapal <agunapal@ischool.Berkeley.edu></p>
lxning
tag:github.com,2008:Repository/212488700/v0.8.0
2023-05-17T22:46:36Z
TorchServe v0.8.0 Release Notes
<p>Issues/fix docker dependencies (<a class="issue-link js-issue-link" href="https://github.com/pytorch/serve/pull/2340">#2340</a>)</p>
<p>* re-run kserve build because of runner issue</p>
<p>* Fixes missing dependencies</p>
<p>* added depth 1 to reduce clone size by 50 MB</p>
lxning
tag:github.com,2008:Repository/212488700/v0.7.1
2023-02-09T00:00:52Z
TorchServe v0.7.1 Release Notes
lxning
tag:github.com,2008:Repository/212488700/v0.7.0
2023-07-18T21:36:21Z
TorchServe v0.7.0 Release Notes
<p>Fix docker build script to build with CUDA 11.7 (<a class="issue-link js-issue-link" href="https://github.com/pytorch/serve/pull/2032">#2032</a>)</p>
<p>* updated docker to build with CUDA 11.7 as default</p>
<p>* temp workflow for building and pushing docker images</p>
<p>* reverting tmp workflow</p>
lxning