Hi,
Apologies if this has been asked previously, but could you clarify what the target label is for the pandemic track. On the data overview page it is described in two places as:
-
Your task is to predict a risk score for the binary disease state (infected or not infected) of each individual in the final week of the simulation
-
The target variable for the modeling task is a risk score (between 0.0 and 1.0) for each individual in the population. That risk score corresponds to a confidence that that individual enters into an symptomatic infected
I
disease state at any time during the final week of the simulation (between days 56 and 63).
Specifically, if someone has a status of βIβ on both days 56 and 57, what label would they have in the target? The first statement above would suggest 1 whilst the second would suggest 0.
I would assume it is the second case. However if that is the case I believe there to be an error in the va_disease_outcome_target.csv data since there are a few individuals who have positive labels but have either βRβ or βIβ status by day 56
Thank you