forked from huggingface/text-generation-inference
-
Notifications
You must be signed in to change notification settings - Fork 48
Issues: huggingface/tgi-gaudi
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Author
Label
Projects
Milestones
Assignee
Sort
Issues list
warmup error when MAX_TOTAL_TOKENS and max_input_length are not power of 2 numbers
#256
opened Dec 17, 2024 by
rbrugaro
2 of 4 tasks
tgi-gaudi server error with long inputs sent to chat_completion api using openai python sdk
#248
opened Nov 22, 2024 by
minmin-intel
2 of 4 tasks
Incorrect answer with openai compatible penalty parameters
#238
opened Oct 17, 2024 by
Spycsh
2 of 4 tasks
Generation stopped too early without hitting stop condition
#223
opened Sep 18, 2024 by
minmin-intel
2 of 4 tasks
llama3.1-70B-instruct 422 error Template error: unknown test: test iterable is unknown (in <string>:99)
#218
opened Sep 3, 2024 by
minmin-intel
2 of 4 tasks
Best Performance for a single card for Llama-2-7b-chat-hf
#196
opened Jul 29, 2024 by
AdityaKulshrestha
setting token flags still results in console warning
#195
opened Jul 28, 2024 by
endomorphosis
2 of 4 tasks
low throughput while using TGI-Gaudi on bigcode/starcoderbase-3b on Gaudi2
#166
opened Jun 22, 2024 by
vishnumadhu365
3 of 4 tasks
ProTip!
Add no:assignee to see everything that’s not assigned.