Addressing log loss metric and submission rule

zsolt.bedohazi · May 11, 2023, 9:50am

Dear everyone!

The current rules and evaluation methods unfortunately do not allow equal and fair competition for all competitors.
Firstly, only the best performing model from the public leaderboard should be selected for evaluation on the private test set instead of allowing multiple, or even worse all the submissions. This would prevent participants from fitting the private test set to find the minima of the log loss distribution and unfairly gaining an advantage over others. This would ensure that the competition is fair and that the best model is chosen based on its performance on the public test set.
Additionally, I would like to point out that the use of log loss as an evaluation metric puts participants at a disadvantage because it does not favor normal, scientifically meaningful submissions based for example on accuracy or ROC AUC. However, it can still be accepted that the log loss score was determined as best suited for the challenge for some reason, but in this case it’s essential that the distribution of both the public and private test sets is made public and available to anyone to ensure fairness and transparency.

Topic		Replies	Views
Inappropriate evaluation metric VisioMel Challenge	3	527	May 11, 2023
Suggestion for including additional metrics for final leaderboard submissions VisioMel Challenge	4	339	May 15, 2023
Private Testset VisioMel Challenge	1	279	April 28, 2023
Regarding Release of Public dataset and winning solutions VisioMel Challenge	4	386	April 9, 2024
Leaderboard public score Water Supply Forecast Rodeo	3	252	November 29, 2023

Addressing log loss metric and submission rule

Related topics