Back to DrivenData | Blog

Country B: New Category values observed for Few columns


#1

When using the encoding obtained from train hhold dataset of country B and applying it to test dataset, new categories are seen and hence, it throws an exception. For example, second column contains new labels: [‘YRxyY’ ‘pqiPu’]

Should we expect this to happen and handle this case as part of the solution.