Scoring differences

vriveraq · October 7, 2019, 5:33pm

Hi!
To test the improvement of my models and have used
from sklearn.metrics import log_loss

which I believe is the measure also used for the submission and get different results. For example, using log_loss I get 0.11 and my submission score is 0.38 roughly. Can anyone explain why that is? Am I missing something?

Thank you!

talrejanikhil · October 8, 2019, 10:35am

Are you using train_test_split to test?

This would explain why the log loss score is different when you test v/s when you submit.

During submission, the score is computed against actual results whereas when you test, you would be using training data.

vriveraq · October 10, 2019, 8:23pm

Thank you! Is it usual that the difference would be large between using train_test_split and the actual data?

talrejanikhil · October 11, 2019, 9:53am

This would really depend on your model. If you have a very accurate one, it will perform well on the actual data as well.

Topic		Replies	Views
I'm having scoring issues Warm Up: Machine Learning with a Heart	2	640	May 2, 2019
About submission score Warm Up: Machine Learning with a Heart	1	575	September 13, 2019
Leaderboard performance Warm Up: Machine Learning with a Heart	3	988	July 28, 2019
Calculating the score Warm Up: Predict Blood Donations	2	744	April 14, 2023
Gap between prediction and results	1	539	January 5, 2020

Scoring differences

Related topics