Country B: New Category values observed for Few columns

suman.shishir · January 8, 2018, 4:44pm

When using the encoding obtained from train hhold dataset of country B and applying it to test dataset, new categories are seen and hence, it throws an exception. For example, second column contains new labels: [‘YRxyY’ ‘pqiPu’]

Should we expect this to happen and handle this case as part of the solution.

Topic		Replies	Views
One hot encoding / test data unique values Pushback to the Future Challenge	1	248	April 10, 2023
New Data in Test Set: date_recorded: year 2001 Pump it Up: Data Mining the Water Table	5	1734	January 27, 2020
Household train data missing "poor" column Pover-T Tests: Predicting Poverty	3	1389	February 1, 2018
Data leak in country C Pover-T Tests: Predicting Poverty	7	1226	February 20, 2018
Share the knowledge Pover-T Tests: Predicting Poverty	29	2896	March 4, 2018

Country B: New Category values observed for Few columns

Related topics