Skip to content

Actions: huggingface/text-generation-inference

Server Tests

Actions

Loading...
Loading

Show workflow options

Create status badge

Loading
2,377 workflow runs
2,377 workflow runs

Filter by Event

Filter by Status

Filter by Branch

Filter by Actor

Local gptq support.
Server Tests #766: Pull request #738 opened by Narsil
July 31, 2023 07:52 4m 24s add_local_gptq_support
July 31, 2023 07:52 4m 24s
chore: fix typo in mpt_modeling.py
Server Tests #765: Pull request #737 opened by eltociear
July 31, 2023 02:54 4m 22s eltociear:patch-1
July 31, 2023 02:54 4m 22s
Fix typing in Model.generate_token
Server Tests #764: Pull request #733 opened by jaywonchung
July 28, 2023 21:34 4m 11s jaywonchung:main
July 28, 2023 21:34 4m 11s
v1.0.0
Server Tests #763: Pull request #727 synchronize by OlivierDehaene
July 28, 2023 14:55 10m 49s v1.0.0
July 28, 2023 14:55 10m 49s
v1.0.0
Server Tests #761: Pull request #727 synchronize by OlivierDehaene
July 28, 2023 14:31 16m 46s v1.0.0
July 28, 2023 14:31 16m 46s
v1.0.0
Server Tests #760: Pull request #727 synchronize by OlivierDehaene
July 28, 2023 14:25 5m 43s v1.0.0
July 28, 2023 14:25 5m 43s
v1.0.0
Server Tests #759: Pull request #727 opened by OlivierDehaene
July 28, 2023 14:17 8m 43s v1.0.0
July 28, 2023 14:17 8m 43s
feat(server): update vllm version
Server Tests #758: Pull request #723 opened by OlivierDehaene
July 28, 2023 12:49 20m 39s feat/vllm_update
July 28, 2023 12:49 20m 39s
v0.9.4
Server Tests #756: Pull request #713 opened by OlivierDehaene
July 27, 2023 16:50 20m 52s v0.9.4
July 27, 2023 16:50 20m 52s
feat(server): support new falcon config
Server Tests #755: Pull request #712 opened by OlivierDehaene
July 27, 2023 15:52 21m 52s feat/falcon_compat
July 27, 2023 15:52 21m 52s
fix(server): fix quantization python requirements
Server Tests #754: Pull request #708 opened by OlivierDehaene
July 27, 2023 10:04 23m 51s fix/quant_requirements
July 27, 2023 10:04 23m 51s
Add WIP support for returning top tokens
Server Tests #750: Pull request #617 synchronize by Vinno97
July 25, 2023 14:57 13m 10s Vinno97:feat/return-top-tokens
July 25, 2023 14:57 13m 10s
fix(server): fix exllama buffers
Server Tests #740: Pull request #689 synchronize by OlivierDehaene
July 24, 2023 11:59 12m 15s fix/exllama_buffers
July 24, 2023 11:59 12m 15s
fix(server): fix exllama buffers
Server Tests #738: Pull request #689 opened by OlivierDehaene
July 24, 2023 08:41 20m 28s fix/exllama_buffers
July 24, 2023 08:41 20m 28s
feat: add cuda memory fraction
Server Tests #737: Pull request #659 synchronize by OlivierDehaene
July 24, 2023 08:25 14m 30s feat/memory_fraction
July 24, 2023 08:25 14m 30s
feat(server): Add exllama GPTQ CUDA kernel support #553
Server Tests #732: Pull request #666 synchronize by Narsil
July 21, 2023 08:15 11m 29s gptq-cuda-kernels2
July 21, 2023 08:15 11m 29s
feat(server): Add bitsandbytes 4bit quantization (#626)
Server Tests #731: Pull request #670 opened by OlivierDehaene
July 21, 2023 07:53 12m 48s dev
dev
July 21, 2023 07:53 12m 48s
feat(server): Add exllama GPTQ CUDA kernel support #553
Server Tests #730: Pull request #666 synchronize by Narsil
July 21, 2023 07:29 11m 54s gptq-cuda-kernels2
July 21, 2023 07:29 11m 54s
feat(server): Add exllama GPTQ CUDA kernel support #553
Server Tests #729: Pull request #666 synchronize by Narsil
July 21, 2023 06:27 12m 58s gptq-cuda-kernels2
July 21, 2023 06:27 12m 58s
Add exllama GPTQ CUDA kernel support
Server Tests #728: Pull request #553 synchronize by Narsil
July 21, 2023 06:16 12m 8s fxmarty:gptq-cuda-kernels
July 21, 2023 06:16 12m 8s
feat(server): Add exllama GPTQ CUDA kernel support #553
Server Tests #727: Pull request #666 synchronize by Narsil
July 21, 2023 06:00 15m 23s gptq-cuda-kernels2
July 21, 2023 06:00 15m 23s
feat(server): Add exllama GPTQ CUDA kernel support #553
Server Tests #726: Pull request #666 synchronize by Narsil
July 20, 2023 22:09 11m 23s gptq-cuda-kernels2
July 20, 2023 22:09 11m 23s
ProTip! You can narrow down the results and go further in time using created:<2023-07-20 or the other filters available.