Feature Engineering (Genetic Eng. Attribution)

dataintel · September 29, 2020, 4:17am

Hi all,

I’ve just joined the competition late in the game. This is my first Data Driven competition; I’ve done a few Kaggles.

On Kaggle I know there are a lot of openly shared notebooks an insights, but I’m not sure they do that here (?). Just thought I’d double check if I’ve missed anything that’s been revealed thusfar.

I’ve tried a few interactions of key variables, but nothing really jumps out. I noticed that - at least in the training data - the original row order seems important, but I’m not sure if what information that might be a proxy for or if it’s generalizable to the scoring data set (I haven’t made a LB submission yet to test it out that way, but it seems to hold up someone in cross-validation and hold out samples).

Cheers!

KieranLitschel · September 30, 2020, 8:36pm

It says in the rules for this competition that you’re disqualified if you share any code outside your team, so that’s why you won’t find any notebooks. Good luck

dataintel · October 1, 2020, 9:40pm

That makes sense, thank you.

I guess I knew we weren’t supposed to share code, but I didn’t realize that’s why the discussions were so quiet.

Topic		Replies	Views
About the Genetic Engineering Attribution category Genetic Engineering Attribution	2	842	August 20, 2020
GEAC: Update on Results & Data Usage Genetic Engineering Attribution	0	423	January 26, 2021
Top-ranked solutions Genetic Engineering Attribution	4	751	May 30, 2021
External data: a question for the orgenizers Genetic Engineering Attribution	0	415	October 9, 2020
Looking to Join a Team Genetic Engineering Attribution	10	923	September 10, 2020

Feature Engineering (Genetic Eng. Attribution)

Related topics