Back to DrivenData | Blog

Household train data missing "poor" column


#1

The problem description says that the last column name of the household-level training data should be “poor”. The last column for me is “country”. The individual-level training data is fine. Any idea why this is?


#2

Have you tried selecting the column by column name (instead of just looking for the last column)?

(In pandas this can be done with df.poor or df['poor'] where df is your DataFrame.)


#3

It is there. Column FE if you open file with Excel.


#4

It is there in the source training data, but if does get dropped at some point in the benchmark Jupyter Notebook.