1-fold confusion matrix

edumotya · December 14, 2019, 11:29am

We would like to encourage other teams to share their results, so that we can all have fruitful discussions and learn from each other. Therefore, here you have the confusion matrix for an image model trained on just one fold of our local validation set.

This model achieves a loss of 0.28 in our local validation set and a loss of 0.6272 on the leaderboard.
Also, we are still open to find new teammates, so if you are interested drop us a line!

sanket10 · December 14, 2019, 11:08pm

How come there is huge difference between local(CV) loss and leaderboard loss.
I am also facing the same issue, but still this difference seems to be huge.

For me it is 0.44 in local CV and 0.67 on leaderboard. I would like to team up to explore more and see if we can get something new.

bwarner · December 15, 2019, 1:04am

I’ve found that with a well constructed training-validation split local cv loss matches up quite well with the leaderboard loss.

SamSepiol · December 15, 2019, 2:27am

can u share a sample of your CV score and LB score ?

florpi · December 15, 2019, 8:53am

Do you mean using stratify to have the same proportion of different classes?

edumotya · December 15, 2019, 12:41pm

Great! we have sent an invitation to you

bwarner · December 17, 2019, 6:33pm

My best single fold score was ~0.42 for both CV and LB.

Yeah, a good validation set is representative of the data the model will see in the future, which in our case is the test set. Rachel Thomas has a good article on items to consider when creating validation sets.

Topic		Replies	Views
How are you guys validating? Tick Tick Bloom Challenge	9	486	February 7, 2023
Large difference in validation loss and test loss Pri-matrix Factorization	6	1261	November 12, 2017
Sharing CV scores and LB scores VisioMel Challenge	6	665	May 11, 2023
No discussion for this competition? Mapping Disaster Risk from Aerial Imagery	7	651	December 16, 2019
Different results on personal test set and competition test set Mapping Disaster Risk from Aerial Imagery	3	640	December 11, 2019

1-fold confusion matrix

Related topics