Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Apply Retie Weights Fix Regardless of Transformers and TRL version for AutoGPTQ #94

Merged
merged 2 commits into from
Oct 25, 2024

Conversation

fabianlim
Copy link
Contributor

In the previous PR #90 we applied a fix for retying the weights, however it was incorrectly done for AutoGPTQ.

  • for BNB, we apply the fix only for certain transformers and trl versions, because the meta device low_cpu_mem_mode was enabled only for those versions
  • however for GPTQ, this is not the case, since we handle low_cpu_mem_mode seperately from HF. Hence the fix must be applied immediately. This PR fixes that.

Signed-off-by: Yu Chin Fabian Lim <flim@sg.ibm.com>
Signed-off-by: Yu Chin Fabian Lim <flim@sg.ibm.com>
@fabianlim fabianlim merged commit cb8235d into main Oct 25, 2024
6 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

1 participant