Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[Kernel] Initial Activation Quantization Support #4525
[Kernel] Initial Activation Quantization Support #4525
Changes from all commits
4d27a2c
92b3703
2a3eb83
3dd1fe8
f2f8c52
c9308eb
d9d49b5
b111ee6
c31a7af
ca01b39
f0197d4
4624b46
75757d5
e1df0eb
bc0991c
74ad650
43c43f3
cf5600f
169ce7f
03b53e7
f9df31b
ba4b6b3
3c223c6
b27f31a
b589cdd
98159cf
8dbeb31
5eeb40a
c55e023
f5cbbd3
a685957
4dfb37f
de81f9e
15f1863
bd53847
b2926f3
1274386
18640c8
5c5dc84
a44b4a0
6f0e6e1
0090454
4b10fd7
68a59c7
b0afe67
4f4951e
869de3f
e68e391
d77cf50
File filter
Filter by extension
Conversations
Jump to
There are no files selected for viewing
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
It's a bit unclear to me about the name
compressed_tensors
. I suppose this is the official method name of SparseML? Then can we just usesparseml
here?There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
compressed-tensors
is the name of the package responsible for saving quantized and sparse modelsSo the flow is:
safetensors
with acompressed-tensors
config