Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Training : Killed #10

Open
schnellerblitz99 opened this issue Oct 27, 2021 · 1 comment
Open

Training : Killed #10

schnellerblitz99 opened this issue Oct 27, 2021 · 1 comment

Comments

@schnellerblitz99
Copy link

schnellerblitz99 commented Oct 27, 2021

Hey,

python contact_graspnet/train.py --ckpt_dir checkpoints/custom1 --data_path acronym/

I am trying to get the training pipeline running, but after showing "**** EPOCH 000 ****" my training is Killed. I could pin down the error to this line sess.run(ops['iterator'].initializer) in train.py line 109. Do you have an idea what might be wrong? I am using a conda env, when I run sh compile_pointnet_tfops.sh I get:

[ RUN ] GroupPointTest.test
[ OK ] GroupPointTest.test
[ RUN ] GroupPointTest.test_grad

1.6927719116210938e-05
[ OK ] GroupPointTest.test_grad
[ RUN ] GroupPointTest.test_session
[ SKIPPED ] GroupPointTest.test_session

3.540515899658203e-05
[ OK ] GroupPointTest.test_grad
[ RUN ] GroupPointTest.test_session
[ SKIPPED ] GroupPointTest.test_session

Might this be the reason why it is not working or is this fine?

My training data folder looks like this (no meshes folder):

acronym
--grasps
--scene_contacts
--splits

Thank you very much!

@MartinSmeyer
Copy link
Contributor

First, you definitely need the meshes for training, please follow the instructions in the README.

Concerning your error, how much RAM do you have on your machine? Could you monitor it while running train.py? All grasp annotations are loaded into RAM at the start of training for efficiency reasons, but if you have less than 100GB the program might be killed.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants