Missing observations in the 2014-2017 test data

lz01 · June 8, 2017, 5:03pm

I was wondering if there were missing values in the 2014-2017 test data to which predictions are compared. If yes, then I guess these missing site-year pairs are not taken into account when evaluating predictions. If not, is it because there have been observations for all year-site combinations or because you have performed some extrapolation to deduce nests counts from the data that was available?

bicarrio · June 9, 2017, 1:51pm

Hello there,

My guess is that there is no data for every site, and every year in the target, since it is costly to get these data.
There must be some (maybe a lot) NaN values.
From the linear example, only actual measurements are considered when computing the AMAPE metric, so I’m guessing the same goes with the target.

Cheers,
Ben

lz01 · June 10, 2017, 1:43am

Hi Ben,
Thanks for your answer.
That was my guess too, given the number of missing values in the training set. But I was wondering if only the nest counts had been considered in the test set, or if the adult counts had also been used as a proxy for nest counts when there was no data. I guess not, if it hasn’t been mentioned.

Topic		Replies	Views
Nest_counts time series has more than raw observation? Random Walk of the Penguins	1	804	May 12, 2017
Site id in test set but not in train set Random Walk of the Penguins	4	1105	May 11, 2017
Prediction of nest counts, chicks, or adults? Random Walk of the Penguins	6	1181	May 28, 2017
Site_id and latitude/longitude precision Random Walk of the Penguins	2	1061	May 4, 2017
Penguin occurrence data Random Walk of the Penguins	2	1246	April 30, 2017

Missing observations in the 2014-2017 test data

Related topics