error when run train.py #1

zhenwwang · 2019-08-10T15:39:07Z

[ERROR 01]
File "./classifier\local_classifier.py", line 10, in
from backward_hooks import *
ModuleNotFoundError: No module named 'backward_hooks'

I just comment this line,then the second error occured

[ERROR 02]
File "C:\Users\Wang\Desktop\8_12\layer_augmentation-master\data.py", line 251, in getitem
char1 = self.char_idx[all_source.contiguous()].view(batch_l, source_l, token_l)
IndexError: tensors used as indices must be long, byte or bool tensors

I fixed this,by add .type(torch.uint8) like this
"char1=self.char_idx[all_source.contiguous().view(-1).type(torch.uint8)].view(batch_l, source_l, token_l)"

but errors continued:

[ERROR 03]
File "C:\Users\Wang\Desktop\8_12\layer_augmentation-master\data.py", line 252, in getitem
char1 = self.char_idx[all_source.contiguous().view(-1).type(torch.uint8)].view(batch_l, source_l, token_l)
IndexError: The shape of the mask [320] at index 0 does not match the shape of the indexed tensor [34292, 16] at index 0

What should I do?

t-li · 2019-08-10T19:19:52Z

Hi,

Thanks for reporting the issues.

I have just pushed a commit that fixed some minor issues occurred on my end. Pleased give it a shot. The [ERROR 1] will be gone now.

[ERROR 2] did not happen on my end. I cleaned up all preprocessed data and restarted it over from scratch. Still got no such error. It seems when saving the variable [all_source] in preprocess.py, the format is already np.int (at line 152 preprocess.py). So when loading in data.py, it should stick to int as default. Maybe you are using a newer version of pytorch that changed the default behavior? I am using pytorch 1.0.1.post2.

You might want to try something like this in line 29-33 in data.py
self.all_source = torch.from_numpy(self.all_source.astype(np.int32))
self.all_target = torch.from_numpy(self.all_target.astype(np.int32))
self.source = torch.from_numpy(self.source.astype(np.int32))
self.target = torch.from_numpy(self.target.astype(np.int32))
self.label = torch.from_numpy(self.label.astype(np.int32))

In your solution, I suggest you to avoid torch.uint8 which is insufficient for token indices.

t-li · 2019-08-10T19:40:20Z

Just made another push that forces indices to be long format.

My hypothesis is that my numpy int is by default int64, so it worked on my end. Sometimes np.int can be defaulted to int32. And pytorch wants indices to be int64/long.

Please pull again and give it a shot.

zhenwwang · 2019-08-11T06:20:59Z

@t-li
Thanks for your reply.

I just pulled again and solved these problems.

It works well now.

Thanks.

rndn123 · 2020-07-29T11:18:30Z

I got this error in your program. I didn't understand please HELP thanks in advance !!
Traceback (most recent call last):
File "train.py", line 305, in
sys.exit(main(sys.argv[1:]))
File "train.py", line 300, in main
train(opt, shared, m, optim, ema, train_data, val_data)
File "train.py", line 178, in train
train_perf, extra_train_perf, loss, num_ex = train_epoch(opt, shared, m, optim, ema, train_data, i, train_idx)
File "train.py", line 113, in train_epoch
output = m.forward(wv_idx1, wv_idx2, cv_idx1, cv_idx2)
File "/content/drive/My Drive/layer_augmentation-master/layer_augmentation-master/pipeline.py", line 104, in forward
att1, att2 = self.attention(input_enc1, input_enc2)
File "/usr/local/lib/python3.6/dist-packages/torch/nn/modules/module.py", line 550, in call
result = self.forward(*input, **kwargs)
File "./attention/local_attention.py", line 36, in forward
self.shared.att_soft1, self.shared.att_soft2)
File "/usr/local/lib/python3.6/dist-packages/torch/nn/modules/module.py", line 550, in call
result = self.forward(*input, **kwargs)
File "/content/drive/My Drive/layer_augmentation-master/layer_augmentation-master/within_layer.py", line 86, in forward
datt1_ls.append(layer(att1.transpose(1,2)).transpose(1,2).contiguous().view(1, batch_l, sent_l1, sent_l2))
File "/usr/local/lib/python3.6/dist-packages/torch/nn/modules/module.py", line 550, in call
result = self.forward(*input, **kwargs)
File "./constraint/n1.py", line 57, in forward
d = self.logic(att)
File "./constraint/n1.py", line 37, in logic
p_content_selector = get_p_selector(self.opt, self.shared, 'content_word', with_nul=False).view(self.shared.batch_l, 1, self.shared.sent_l1)
File "./constraint/constr_utils.py", line 58, in get_p_selector
mask[ex][p_contents] = 1.0
IndexError: index 15 is out of bounds for dimension 0 with size 14

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

error when run train.py #1

error when run train.py #1

zhenwwang commented Aug 10, 2019

t-li commented Aug 10, 2019 •

edited

Loading

t-li commented Aug 10, 2019 •

edited

Loading

zhenwwang commented Aug 11, 2019

rndn123 commented Jul 29, 2020

error when run train.py #1

error when run train.py #1

Comments

zhenwwang commented Aug 10, 2019

t-li commented Aug 10, 2019 • edited Loading

t-li commented Aug 10, 2019 • edited Loading

zhenwwang commented Aug 11, 2019

rndn123 commented Jul 29, 2020

t-li commented Aug 10, 2019 •

edited

Loading

t-li commented Aug 10, 2019 •

edited

Loading