Scaling data with one or multiple scalers?

kyz682 · October 26, 2018, 5:10pm

The LSTM benchmark model scaled the data with a new scaler each time a series was trained. Can anyone explain some of the advantages/disadvantages of using a new scaler every single time as opposed to scaling the entire dataset with just one scaler?

My gut instinct is that a new scaler each time will not represent the differences between the series accurately. For example, a series with the highest consumption at 4 MWh would be scaled to the same value as a series with the highest consumption at 20 kWh.

LastRocky · October 26, 2018, 6:03pm

The consideration of using multiple scalers is to avoid the dominance of high consumption series ids, because of the use of NMAE as performance metric.

Topic		Replies	Views
Did anyone notice that Benchmark LSTM solution is not correct? Cold Start Energy Forecasting	3	848	October 23, 2018
22nd place Non ML submission looking for teammate Cold Start Energy Forecasting	2	805	September 17, 2018
Need a Simple Baseline Model in Python Cold Start Energy Forecasting	1	861	September 21, 2018
4th place solution Cold Start Energy Forecasting	4	1284	January 10, 2019
Clarification of Problem Objectives Power Laws	2	876	February 21, 2018

Scaling data with one or multiple scalers?

Related topics