Spitballing for fun?

Natalie-w · April 28, 2020, 1:31am

Hi all, since this competition is just for practice, I was wondering if anyone wanted to spitball ideas together or potentially group up? I’ve gotten to around 0.74 accuracy with just a lgbm, basically zero feature engineering, and super minimal finetuning, so I’m sure there’s a lot more to be done!

Kritiker · April 28, 2020, 11:15pm

Hi there, what about your auc value ?

Natalie-w · April 30, 2020, 6:14am

Hi, it’s 0.8065. What’s yours like?

Kritiker · May 1, 2020, 9:24pm

Hi, my results:
Multi-class area under the curve: 0.8706 but the f1-score values are strange:
precision recall f1
low 0.4165427 0.4153985 0.4159698
medium 0.6583504 0.6610639 0.6597044
high 0.5399637 0.5366671 0.5383103
low is representing the level 1
medium is representing the level 2
high is representing the level 3

Onotseike · May 14, 2020, 11:39am

Hi Natalie, I am trying to use the LightGBM ranker but I am having a hard time with the group data field and eval_group data. Do you have any idea on how to generate it.

booyah1211 · May 19, 2020, 7:02pm

Hi Natalie-w, using random forest and some minor feature engineering like target encoding, I reached an F1 score of 0.7462. Are you still looking to team up?

fasihA · June 21, 2020, 8:47am

Hi Natalie,
Saw your post i have used xGBoost and find a score of 0.7468 without any feature engg. and model tuning. I was wondering if you are still interested in sharing some ideas.
Thanks

naveen_9697 · September 13, 2020, 11:24am

Hi Ms. Natalie,
I have used LightGBM with no feature engineering and no hyper-parameter tuning(used the values that usually give me good results) whatsoever. And didn’t even treat this as a imbalanced classification problem and still got score of 0.7449. I am interested in more ideas.

capac · September 16, 2020, 11:17am

Hello,
That’s interesting. I’m using XGBoost as well but with fine tuning I’ve achieved 0.7427. What have you done differently? I’m treating all variables except five as categorical.

florescorreia · September 30, 2020, 7:15pm

Hi all,
With the cleanest lgbm implementation I only get to f1_score=0.70
By cleanest I mean no hyperparam fine tuning, just transformation of categorical variables into ‘categorical’, and scaling of numerical variables: nothing else.
You guys seem to be around f1_score=0.74. What is this difference based on? If anyone out there wants to clarify it to me, that would be appreciated – I’m not trying to win anything, just trying to get to the bottom of this kind of stuff.
Thanks.

Topic		Replies	Views
Share the knowledge Pover-T Tests: Predicting Poverty	29	2714	March 4, 2018
Any xgbooster here? Senior Data Science: Safe Aging with SPHERE	9	1649	July 27, 2016
Classification Rate - XGB Model Pump it Up: Data Mining the Water Table	4	2577	October 19, 2016
Richter's Predictor: Modeling Earthquake Damage	2	662	February 16, 2020
About the Richter's Predictor category Richter's Predictor	8	2271	January 23, 2024

Spitballing for fun?

Related topics