1st Place Solution

giba · April 22, 2015, 12:30am

My best model solution is an ensemble of 11 models and it performed public 0.2482 and private 0.2539.

3 Logistic Regression
1 Logistic Regression with Bagging
1 Random Forest
6 Gradient Boost using xgboost

But using only 4 of that models to ensemble I can score public 0.2483 and private 0.2541.

1 Logistic Regression
1 Random Forest
2 Gradient Boost using xgboost
I did little feature engineering.
For all features I considered 0 as a NA or null level.
Some models I did feature selection.
All models are trainned using a 4 fold crossvalidation, so I have a good estimate of the performance.
The enseble was done using a second level of xgboost trainning over all crossvalidated predictions of the first level.

More details I will provide in the model documentation and then I will post here.

Gilberto

dartdog · April 27, 2015, 9:46pm

Really looking forward to seeing the code to learn from it. In particular the evaluation, log loss code, I’m stuck at the moment.

binga · April 30, 2015, 11:26am

Funny how my ensemble of LR, RF and XgB hasn’t helped me reach closer to 0.2500 public. I was stuck at 0.253 and the XgB and RF ensemble got me 0.2526 public.

Looking forward to have a look at your model documentation @giba

Blooory · August 13, 2015, 10:48am

Congratulations with your win! Thank you for the interview, it was a very interesting read.

Can you please explain what do you mean under “6 XGBoost models”? Is it different random seeds or different parameters for the models?
If it is different parameters, can you please suggest which parameters are the best to vary in XGBoost before doing ensembles?

Topic		Replies	Views
Spitballing for fun? Richter's Predictor	9	2103	September 30, 2020
Simple cleaned and processed data with random forest classifier implemented and score 0.8162 Pump it Up: Data Mining the Water Table	2	3120	April 24, 2019
Any xgbooster here? Senior Data Science: Safe Aging with SPHERE	9	1649	July 27, 2016
Share your approach! Pump it Up: Data Mining the Water Table	46	20355	December 27, 2021
Share the knowledge Pover-T Tests: Predicting Poverty	29	2712	March 4, 2018

1st Place Solution

Related topics