Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

CUDA_VISIBLE_DEVICES=0,1 python test.py F0223 08:15:30.722712 2135 syncedmem.cpp:19] Check failed: error == cudaSuccess (29 vs. 0) driver shutting down *** Check failure stack trace: *** Aborted (core dumped) #23

Open
li0128 opened this issue Feb 23, 2021 · 3 comments

Comments

@li0128
Copy link

li0128 commented Feb 23, 2021

CUDA_VISIBLE_DEVICES=0,1 python test.py

F0223 08:15:30.722712 2135 syncedmem.cpp:19] Check failed: error == cudaSuccess (29 vs. 0) driver shutting down
*** Check failure stack trace: ***
Aborted (core dumped)

@fyangneil
Copy link
Owner

fyangneil commented Feb 26, 2021

@li0128 might be the problem of cuda version, please check the open or closed issue, there are someone have the same problem

@bolongpa
Copy link

bolongpa commented Mar 3, 2021

I meet the same problem. I essentially just increased the base learning rate in solver_fphb_crack.prototxt and the system throws out the error above. In my experiments, the error appears only when base learning rate is higher than 1e-6, while it works well when I try to use 1e-7 to 1e-9. I searched for solutions but I didn't find a feasible one.

@bolongpa
Copy link

bolongpa commented Mar 3, 2021

could you please clarify what this mean in solver_fphb_crack.prototxt?

lr for fine-tuning should be lower than when starting from scratch

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants