Improve LSTM model

adalseno · April 17, 2020, 5:05pm

Hi, this nice competition gave me the opportunity to study and improve my skills on time series data. I tried both Prophet and ARIMA (actually SARIMAX) and now I’m trying with deep learning models, specifically LSTM. The model seems to perform pretty well on test data but it performs poorly on submission set. Any idea on how to improve it?
Schermata 2020-04-17 alle 19.02.42

I’m using 11 time steps (n_input), with just a subset of features (7 + total cases: n_features = 8), with a batch of 26 and 130 units (one bidirectional layer with relu activation and 1 dense):
model = Sequential()
model.add(Bidirectional(LSTM(130, activation=‘relu’), input_shape=(n_input, n_features)))`
model.add(Dropout(0.1))
model.add(Dense(1))
model.compile(optimizer=‘adam’, loss=‘mae’)
model.fit(x=generator, epochs=65, validation_data=generator_test, shuffle=False)

amithitlab · April 26, 2020, 11:09pm

I tried normal XGBoostRegressor and it works fine. I found that feature selection and engineering are impacting the results immensely. Careful selection can provide you with good accuracy on test set.

pranayrao · May 5, 2020, 3:30am

did u drop many features?

pranayrao · May 5, 2020, 3:31am

i tried implementing xgboost on the test data in competition & got 27 mae

adalseno · May 5, 2020, 9:24am

Hi, thank you, I tried XGBoost too with feature selection (without, till now, any feature engineering but I had better results with Negative Binomial; well better than XGBoost, actually average results 25.3438). I was curious about LSTM since it seems to perform very well on test set and poorly on submission and I wanted to understand why.

adalseno · June 1, 2020, 10:20am

Hi tried also with a boosting model with some feature engineering. The results on the test set seem very good (MAE around 4 for San Juan), but when I submit it performs poorly. I really don’t understand what’s wrong.
Schermata 2020-06-01 alle 12.16.54

eeorenstein · August 26, 2020, 6:34pm

Hi everyone and @adalseno. I have a similar issue where my RF leads to much better scores than the benchmark model (Negative Binomial model with feature selection, https://www.drivendata.co/blog/dengue-benchmark/) on my validation/holdout set but performs worse (27) than the benchmark (25.8) on the test set. I split my train/validation set in the same way that the benchmark does so I could better compare my model to theirs. I performed very little feature engineering so that, again, it matches the benchmark’s methods and leads to a better comparison. Does anyone know why this is happening?

MathiasTiberghien · July 22, 2021, 9:15am

Hi, You probably don’t need the answer anymore but this seems to me to be a classic case of overfitting.
RNN are meant to deal with a lot of data and you have a really small training dataset.

Topic		Replies	Views
Model not generalizing well to test set DengAI Competition	0	606	September 1, 2020
LSTM t-1 feature DengAI Competition	0	579	October 20, 2020
22nd place Non ML submission looking for teammate Cold Start Energy Forecasting	2	805	September 17, 2018
Congratulations to the winners! Predict Wind Speeds of Tropical Storms	7	749	February 5, 2021
Congrats and 1st place brief summary Predict Wind Speeds of Tropical Storms	5	619	February 7, 2021

Improve LSTM model

Related topics