运行GPT2案例出现RuntimeError: Could not find 'SLURM_PROCID'问题,是必须要装SLURM环境? #1371
Unanswered
ZXM1063694570
asked this question in
Community | Q&A
Replies: 1 comment
-
Adding '--from_torch' after train_gpt.py would be helpful. |
Beta Was this translation helpful? Give feedback.
0 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
使用了提供的Dockerhub上的镜像0.1.7,但是在运行GPT案例时候出现RuntimeError: Could not find 'SLURM_PROCID'问题,并且在0.1.8镜像版本中也是如此
这是我的run脚本:
其中我的gpt2_configs配置换了其他的配置也出现同样的问题
Environment:
docker pull hpcaitech/colossalai:0.1.7 & 0.1.8
pip install transformers
pip install titans
8张A100
Beta Was this translation helpful? Give feedback.
All reactions