Tensorflow: Interpretation of Weight in Weighted Cross Entropy

About

Asked 19/11, 2016 at 22:41 Answered 20/11, 2016 at 9:41

The Tensorflow function tf.nn.weighted_cross_entropy_with_logits() takes the argument pos_weight. The documentation defines pos_weight as "A coefficient to use on the positive examples." I assume this means that increasing pos_weight increases the loss from false positives and decreases the loss from false negatives. Or do I have that backwards?

Fairhaired answered 19/11, 2016 at 22:41 Comment(0)

Actually, it's the other way around. Citing documentation:

The argument pos_weight is used as a multiplier for the positive targets.

So, assuming you have 5 positive examples in your dataset and 7 negative, if you set the pos_weight=2, then your loss would be as if you had 10 positive examples and 7 negative.

Assume you got all of the positive examples wrong and all negative right. Originally you would have 5 false negatives and 0 false positives. When you increase the pos_weight, the number of false negatives will artificially increase. Note that the loss value coming from false positives doesn't change.

Brotherson answered 20/11, 2016 at 9:41 Comment(2)

Thanks. So if using a mutually exclusive classifier with more than 2 classes and 1-hot truth labels, increasing pos_weight has the effect of amplifying the losses in all cases with wrong estimates, and cases with correct estimates are unchanged (because the loss in the correct-estimate cases is zero)? – Fairhaired 20/11, 2016 at 16:30

amplifying the losses in all cases with false negatives, but yes, I think so. – Brotherson 20/11, 2016 at 16:56

Hot tags

Godot Unity Godot Help Programming Godot 4.X GUI GDScript 3D 2D Physics CSharp Godot 3.X VR XR Projects C++

Recommended topics

Hot tags