Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[Fix] Export TORCH_CUDA_ARCH_LIST in install.sh #133

Merged
merged 6 commits into from
Jul 1, 2024

Conversation

LeiWang1999
Copy link
Contributor

@LeiWang1999 LeiWang1999 commented Jul 1, 2024

This pull request includes a change to the install.sh script to export the TORCH_CUDA_ARCH_LIST variables in install.sh scripts. Otherwise, the build pipeline of cuda backend will crash.

Also remove importlib from requirements.txt ref to #132

@Qubitium
Copy link
Contributor

Qubitium commented Jul 1, 2024

@LeiWang1999 Thanks! LGTM. I removed PTX flag since we don't want JIT of the cuda codes for now. We observed slightly faster code with PTX disabled.

@Qubitium Qubitium merged commit ac666be into ModelCloud:main Jul 1, 2024
DeJoker pushed a commit to DeJoker/GPTQModel that referenced this pull request Jul 19, 2024
* chore: update TORCH_CUDA_ARCH_LIST in install.sh

* chore: remove importlib from requirements.txt

* chore: Fix multi-gpu environment check in bitblas_target_detector.py

* chore: Update logger.info message in qlinear_bitblas.py

* Update install.sh

We like to not use PTX compilation for now.

---------

Co-authored-by: Qubitium-ModelCloud <qubitium@modelcloud.ai>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants