Are the callbacks meant to trick us or am I confused?

jacquesthibs · September 2, 2021, 1:48am

Hey,

I was about to post this:

I finished going through the benchmark blog post and noticed that the metric we are monitoring with val_loss, yet it is in mode="max". Shouldn’t it be “min” since a lower XEDiceLoss score indicates better performance?

But then I realized that val_loss is being logged as epoch_iou instead of xe_dice_loss. This is a mistake, right? We shouldn’t be calling our validation performance metric the validation loss? It ends up working out in the code since we do want to maximize iou, but it was a bit confusing to me when I saw it being called the loss.

tglazer · September 2, 2021, 6:53pm

@jacquesthibs Thanks for your note. You are correct that the learning rate scheduler is conditioned on the validation metric (IoU) logged at the end of each epoch, which we seek to maximize. However, Pytorch Lightning used to require that we prepend the name of the metric being monitoring with val_ and conventionally expected val_loss (see the docs). We use the name val_loss here to avoid bugs, but acknowledge that IoU is not a loss function.

jacquesthibs · September 2, 2021, 8:37pm

Ok, good to know. I do have a followup question then. Why are we using the metric to monitor rather than the loss function? Typically we would use the loss function, right?

Topic		Replies	Views
Large difference in validation loss and test loss Pri-matrix Factorization	6	1261	November 12, 2017
Version issues for execution Hateful Memes	8	569	December 4, 2020
Inappropriate evaluation metric VisioMel Challenge	3	529	May 11, 2023
Scoring Metric Question Hakuna Ma-data	1	638	November 23, 2019
Definition of Log Loss Warm Up: Machine Learning with a Heart	1	880	May 2, 2019

Are the callbacks meant to trick us or am I confused?

Related topics