Final Outcome: Model or Algorithm?

kmande · March 26, 2024, 12:39pm

A bit late may be to ask this question, but I realised it now. What is the final expected outcome of this challenge between the following two:

A model/set_of_models that can be used in future issue dates (i.e. going ahead from now 2024, 2025, 2026 etc)? This model expected to be developed such that it has minimum LOOCV error across last 20 yrs.
OR
An machine learning algorithm that outputs lowest errors in last 20 yrs designed in a such a way that each year is left while training on others. The example.py given seemingly follows this route as it saves model with name having combination of site, year and quantile( line 168 f"{site}-{year}-{quantile}.joblib"). Here, e.g, model saved with year 2005 wont have any future use but it is developed in such a way that it gives minimum error for year 2005. Hence, this example code looks just like an ALGORITHM aimed to minimize LOOCV scores across all years but without any model/set_of_models for use in the future.

In my opinion, if we are doing CV then approach 1 should be the expected outcome.

Pls clarify.

Thanks!

kamarain · March 26, 2024, 2:25pm

it is developed in such a way that it gives minimum error for year 2005

To my ears this sounds like it is using validation data for fitting, which is bad modeling and should not be accepted as a solution in my opinion.

jayqi · March 26, 2024, 3:16pm

Hi @kmande,

The goal of the challenge is to get the most accurate modeling methodology that can lead to models that can be used operationally in the future. The cross-validation is an evaluation procedure to estimate the accuracy of a modeling approach. As such, your “approach 1” describes this most closely.

What you describe in “approach 2”—producing a set of models that each individually minimize error for their respective test years—is certainly possible to do, but would be considered overfitting. This will be assessed from your model report, and solutions that are overfitting will be considered as having weak statistical rigor.

The example code saves out the models from each cross-validation iteration mainly for reproducibility and diagnostic purposes.

The conceptual “final product” would be a model trained on all 20 years of the cross-validation period, and the cross-validation procedure is an estimate of the performance of that final model. We don’t do this in the example because we’re not expecting any submission based on it in the Final Stage. However, the model you submitted to the Forecast Stage is basically such a final model.

Topic		Replies	Views
Training Data Question Water Supply Forecast Rodeo	3	168	February 20, 2024
Feedback for the organizers Water Supply Forecast Rodeo	1	213	December 23, 2023
Different stages understanding confirmation Water Supply Forecast Rodeo	1	193	December 23, 2023
Clarification about "explainability" Water Supply Forecast Rodeo	3	217	December 13, 2023
Cross Validation Feature Parameters/Aggregate Statistics Water Supply Forecast Rodeo	6	189	March 6, 2024

Final Outcome: Model or Algorithm?

Related topics