Hi @jash.shah and @charles.hornbaker,
Could I ask two questions about this please.
First, what do we mean by "test set" here? Is it the nest_counts.csv file? If so, that's not really labelled test data in the standard sense, is it? Isn't it just a consolidated time series view of the training_set_observations.csv data?
Second, I can see STOK in nest_count but not in training_set_observations.csv. In nest_count, as far as I can tell, it has no observations whatsoever against it. This is also true in the error file (training_set_e_n.csv). So what does Charles mean by saying STOK is "one of the cases where the first observations occur in the test set"?