Question about the negative label #1

Z-ZHHH · 2022-09-25T08:02:15Z

Great work!
Does the loss value be negative during the training if we use the negative labels? When the feature collapse to the class prototype, the logit will be strict one-hot. It seems that the loss value -> -infinity.

weijiaheng · 2022-09-26T00:25:19Z

The loss could go negative when learning with negative labels when adopting the cross-entropy loss, this is simply because: the ce loss will multiply per class -log(p_i) with the soft label. For the irrelevant classes (not equal to the training label), multiplying with a negative soft label may result in a negative loss (see here).

In our paper, we discuss how to address this issue in practice (in Appendix D.2). Briefly speaking, negative labels rely on a relatively well-pre-trained model. Since the mechanism is to enhance model confidence in its prediction. Thus, if training with negative labels at the beginning of the training procedure, it is possible that the model becomes overly-confident in bad representation. (The learned presentation is likely to be bad at the beginning of the training)

Z-ZHHH · 2022-09-26T00:51:44Z

Thanks a lot！
I just tried the NLS during the whole training process and it didn't work. Thanks for the details.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Question about the negative label #1

Question about the negative label #1

Z-ZHHH commented Sep 25, 2022

weijiaheng commented Sep 26, 2022

Z-ZHHH commented Sep 26, 2022

Question about the negative label #1

Question about the negative label #1

Comments

Z-ZHHH commented Sep 25, 2022

weijiaheng commented Sep 26, 2022

Z-ZHHH commented Sep 26, 2022