Back to DrivenData | Blog

Household train data missing "poor" column


The problem description says that the last column name of the household-level training data should be “poor”. The last column for me is “country”. The individual-level training data is fine. Any idea why this is?


Have you tried selecting the column by column name (instead of just looking for the last column)?

(In pandas this can be done with df.poor or df['poor'] where df is your DataFrame.)


It is there. Column FE if you open file with Excel.


It is there in the source training data, but if does get dropped at some point in the benchmark Jupyter Notebook.