Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Add 4-bit quantized inference to run BLOOM-176B on 2 A100 GPUs #2526
base: master
Are you sure you want to change the base?
Add 4-bit quantized inference to run BLOOM-176B on 2 A100 GPUs #2526
Changes from 13 commits
4114bea
2ce22d7
d2997bf
5341811
e2f6fe9
410faf7
dd03ae6
44184ca
efab0aa
bc1d63e
741e80e
92f7aab
803bf2a
5917d5a
File filter
Filter by extension
Conversations
Jump to
There are no files selected for viewing