How are you guys validating?

Loki_K · February 2, 2023, 6:56pm

I know this is pretty late in the competition. But how are you guys locally validating metadata especially guys at the top of leaderboard? My local and lb scores donot correlate at all?

BrandenKMurray · February 3, 2023, 12:39am

Just doing Stratified K-Fold. CV (~0.67) is below LB (~0.76).

kwetstone · February 3, 2023, 2:44pm

@Loki_K The test data used for final scores does have a similar distribution as the train data, but scores won’t be exactly the same. If you are evaluating your model on data that was used in training, that also may be a reason why it performs different on new, unseen data.

I hope that’s helpful, feel free to follow up with any other questions!

Loki_K · February 6, 2023, 4:13pm

Ohh, Thank you for sharing.
Sry if this is really dumb question or if i’m missing anything, I am still student.
But as mentioned here, is doing stratified k-fold still a valid way to go?

Loki_K · February 6, 2023, 4:16pm

Yeah, thank u.
The val_set I carved out has coordinates(lat, lng) that are in train_set but the acutal test_set have no common coordinates, that could be the reason for local and lb error discrepancy.

BrandenKMurray · February 6, 2023, 5:02pm

I’ve taken that to mean that for a given sample you cannot use imagery/climate data from dates that are after the date of the sample, not that you can’t include future samples as part of your training set.

Loki_K · February 7, 2023, 7:21am

Here @kwetstone mentioned that for a given sample, you can only use information that was already available at the time the sample was taken.

BrandenKMurray · February 7, 2023, 8:45am

@kwetstone can you please clarify:

The earliest test sample is on 2013-01-08. There are only 5 train samples taken before that date. Is it true that when making predictions for that test sample we can only use a model that was trained on only those 5 samples?

My understanding is that we are allowed to train a model on the entire dataset at once, but for the test sample on 2013-01-08 all of the features/images/data for that sample must be from on or before that date.

kwetstone · February 7, 2023, 3:44pm

@Loki_K @BrandenKMurray These are great questions!

My understanding is that we are allowed to train a model on the entire dataset at once, but for the test sample on 2013-01-08 all of the features/images/data for that sample must be from on or before that date.

This is correct. You can train a model on the full dataset, but when running inference on a given sample you can only use data that was available at the time the sample was taken. This means that in training, you’ll also want the features for any given sample to be derived from data that was available when the sample was taken (so that the training setup is an accurate reflection of inference). In other words, during training the samples should be treated independently.

Loki_K · February 7, 2023, 4:31pm

Thanks for the clarification.

Topic		Replies	Views
No discussion for this competition? Mapping Disaster Risk from Aerial Imagery	7	651	December 16, 2019
Any last hints about correlating local MAP to score? Where's Whale-do?	0	361	June 28, 2022
1-fold confusion matrix Mapping Disaster Risk from Aerial Imagery	6	606	December 17, 2019
First Place Model Documentation Box-Plots for Education	4	2294	January 21, 2015
Data Quality Issues? Mapping Disaster Risk from Aerial Imagery	3	820	December 14, 2019

How are you guys validating?

Related topics