Share your approach!

payback · December 28, 2017, 2:21pm

my score is 0.7921 and my code is here

samirchar · April 27, 2018, 1:14pm

Hi @zlatankr! You mentioned that you had a bug in your code that caused más overfitting, could you please elaborate a bit on this? Im also overfitting and not sure why! Thanks!

dcart · November 20, 2018, 11:49am

Hello,

I am a novice in Machine learning. currently rank 104 on blood donation challenge.

I am planning to add new features on my data. My question, when we add feature in your data, do we need to add feature column as well to the holdout data?

apologise if my question is not clear or need clarification.

Thanks

washier · November 21, 2018, 8:35am

Hi dcart,

Yes, your hold-out data set will always have the same structure as the training data set

huytofu · January 24, 2019, 5:03am

Score = 08125. Current rank = 794
Cleaned a little bit. Replaced some 0 and NaN
Created 1 new feature
Transformed all categorical feature with <100 cats
Dropped some highly correlated feature
Used Random Forest & Gradient Boosting Tree Army. The Army performed much better
Watched out for overfitting

Remember, when choosing your model: split your labeled data into train (which can be further splitted for crosvalidation) and test set. But in the end retrain your best model on the entire labeled set for final prediction of unlabeled data
Also please make sure to double check that your labeled set used for training and unlabeled set have identical set of independent features!!

LOVBNGEA_epsi · June 27, 2019, 6:41am

Hello,
We are a team of 3 students and we would like to share our approach.
Our current score is 0.7991
Here is our code :

Brenda_DS · December 27, 2021, 5:42pm

I currently hold rank 504 with a score of 0.8235. I used an ensemble of four tuned models to get to my final score.
score

I have written 4 Medium articles on my approach to EDA, data cleaning, feature engineering and modelling, which you can find here: Brenda Loznik – Medium

All code is available on my Github: BrendaLoznik/waterpumps (github.com)

Topic		Replies	Views
What's your strategy? Warm Up: Predict Blood Donations	26	10913	August 23, 2020
90.6%? May need the help of a stronger computer Pump it Up: Data Mining the Water Table	1	2350	January 22, 2021
Simple cleaned and processed data with random forest classifier implemented and score 0.8162 Pump it Up: Data Mining the Water Table	2	3157	April 24, 2019
Evalluation metric Pump it Up: Data Mining the Water Table	0	666	January 3, 2021
Share the knowledge Pover-T Tests: Predicting Poverty	29	2888	March 4, 2018

Share your approach!

Related topics