I have two question about the test data used in the Pandemic Forecasting.
In the baseline example the test data is derived from the same data set used for training (essentially a subset of the the training data). Can we assume that this will remain the case in the contest or could the test data be derived from an entirely different set of data than what is used for training?
When we write the predictions to a .csv file I’m unclear as to whether we write predictions for just the test data or for every individual in the original data set. Could you please clarify that.