Model not generalizing well to test set

eeorenstein · September 1, 2020, 7:11pm

Hi everyone and @bull,

I have been working on this challenge for a couple of weeks now. I’ve implemented a number of models (RF, XGBoost, KNN, etc.) on the original features (with very minimal feature engineering) as well as engineered features. Using the same split for SJ and IQ as the benchmark did (https://www.drivendata.co/blog/dengue-benchmark/), I am able to achieve a much lower validation MAE on SJ than the benchmark (13 vs 22) and do slightly better than the benchmark on IQ (6.2 vs 6.5). However, when I fit this model on the entire training set and predict on the test set, my test MAE is higher than that of the benchmark (27 vs 25.8). I don’t believe this is a case of overfitting as I am accounting for that via the validation set and I also don’t think data leakage should be an issue as this model was fit on the minimally feature-engineered set. I’ve also tried time series cross validation to reduce bias and this appeared to yield better results on the validation folds, but it performed worse on the test set. Does anyone have any advice or have a similar problem? Thanks!

Elliot

Topic		Replies	Views
Improve LSTM model DengAI Competition	7	1825	July 22, 2021
MAE on Train and Test Data Set DengAI Competition	2	1846	May 1, 2018
Submission scores way higher DengAI Competition	2	1782	May 8, 2019
Predicting on the test set reduces the size of our data DengAI Competition	8	830	April 6, 2021
Sharing baseline models in python : Negative Binomial Regression, Arima, XGBoost etc DengAI Competition	2	3716	May 28, 2019

Model not generalizing well to test set

Related topics