The announcement from Nov 17 says that we should only use feature data from the water year.
Does this also mean that, for example, the following feature would be prohibited:
Maximum yearly volume observed at a site (so far).
Let’s say I make a prediction for 2017. For the mentioned feature I take the volumes from all 2016 and before (except the test years) and calculate the maximum per site.
These types of features do not use data from test_monthly_naturalized_flow.csv but only from train.csv.
Are such features allowed or not?