Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Error: libnvrtc.so.12: cannot open shared object file: No such file or directory #14

Open
kazakovaanastasia opened this issue Jan 12, 2025 · 2 comments

Comments

@kazakovaanastasia
Copy link

kazakovaanastasia commented Jan 12, 2025

Steps to reproduce

Terminal:

  1. mlspace environments create --env name_test --python 3.11 --cuda 12.4
  2. conda activate name_test

Code: (go through Plan)

  1. Install from PyPI
    pip3 install 'torch==2.5.0'
    pip3 install para-attn
  2. Local Installation
    git clone https://github.com/chengzeyi/ParaAttention.git
    cd ParaAttention
    git submodule update --init --recursive

pip3 install 'torch==2.5.0'
pip3 install 'setuptools>=64' 'setuptools_scm>=8'

'# 'Pass --no-use-pep517 to speed up rebuild by using the legacy build system
'# 'which doesn't use a one-time tmp directory for the build
pip3 install -e '.[dev]' --no-build-isolation
'#' Or:
'# 'python3 setup.py develop

'#'Code formatting and linting
pip3 install pre-commit
pre-commit install
pre-commit run --all-files

3.torchrun --nproc_per_node=2 parallel_examples/run_cogvideox.py

What I got:
from .base import clone # noqa: E402
^^^^^^^^^^^^^^^^^^^^^^^
...
ModuleNotFoundError: No module named 'joblib'

After installing joblib i get more errors, that I can not fix. They might be related to cuda.

What I expected to get:
video in output

@kazakovaanastasia
Copy link
Author

after changing python to 3.10, error fixed. But now i get Could not load library libnvrtc.so.12. Error: libnvrtc.so.12: cannot open shared object file: No such file or directory
Could not load library libnvrtc.so. Error: libnvrtc.so: cannot open shared object file: No such file or directory
Could not load library libnvrtc.so. Error: libnvrtc.so: cannot open shared object file: No such file or directory . what version uf cuda should I use? mlspace environments create --env name_test --python 3.10 --cuda *

@kazakovaanastasia kazakovaanastasia changed the title ModuleNotFoundError: No module named 'joblib' Error: libnvrtc.so.12: cannot open shared object file: No such file or directory Jan 12, 2025
@chengzeyi
Copy link
Owner

chengzeyi commented Jan 12, 2025

after changing python to 3.10, error fixed. But now i get Could not load library libnvrtc.so.12. Error: libnvrtc.so.12: cannot open shared object file: No such file or directory

Could not load library libnvrtc.so. Error: libnvrtc.so: cannot open shared object file: No such file or directory

Could not load library libnvrtc.so. Error: libnvrtc.so: cannot open shared object file: No such file or directory . what version uf cuda should I use? mlspace environments create --env name_test --python 3.10 --cuda *

You could try vanilla Python venv instead of third party venv manager like conda or mlspace which you use. These providers could have compatibility problems with torch and cuda runtime.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants