Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Exception: >- DeepSpeed Op Builder: Installed CUDA version 12.1 does not match the version torch was compiled with 11.8 #107

Open
arvind-27 opened this issue Jun 13, 2023 · 1 comment

Comments

@arvind-27
Copy link

I was running valle.train for ar.yml and an error came up saying

_AttributeError: 'NoneType' object has no attribute 'optimizer_name'_

I installed deepseed version 0.8.3 as suggested by solutions from previous issues. However I am posed with another error saying

_Exception: >- DeepSpeed Op Builder: Installed CUDA version 12.1 does not match the version torch was compiled with 11.8, unable to compile cuda/cpp extensions without a matching cuda version._

I tried several things to resolve this including:-

DS_BUILD_OPS=1 pip install deepspeed==0.8.3
Downgrading pytorch versions to meet the cuda versions but none of them seems to be working. Downgrading pytorch version gives this error when I run the command for training autoregressive model

_OSError: /opt/conda/lib/python3.10/site-packages/torchaudio/lib/libtorchaudio.so: undefined symbol: _ZNK5torch8autograd4Node4nameB5cxx11Ev_

Can someone suggest me what to do to resolve the issue

@JonathanColetti
Copy link

JonathanColetti commented Jun 16, 2023

for this error _Exception: >- DeepSpeed Op Builder: Installed CUDA version 12.1 does not match the version torch was compiled with 11.8, unable to compile cuda/cpp extensions without a matching cuda version._

perform pip uninstall torch torchaudio then
pip install torch==1.9.0+cu121 -f https://download.pytorch.org/whl/torch_stable.html

for more info on this error check out this stack overflow post https://stackoverflow.com/questions/66116155/how-to-tell-pytorch-which-cuda-version-to-take

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants