Solutions postings

ZFTurbo · August 9, 2020, 3:10pm

I currently preparing solution write-up for organizers. I also plan to publish small solution writeup on arxiv after that (I will post link in this thread if you interested).

Code will be posted on drivendata github a little bit later, like it was in previous competitions.

My solution is based on these 2 libraries I prepared and already posted:

The best model for me was DenseNet121 3D with (96, 128, 128, 3) input shape and batch size equal to 6. I used large dropout 0.5 at the classifcation layers to prevent overfitting. I started with imagenet weights which I converted for 3D variant of nets.
I trained only on ROI part of videos, which I extracted using your code from forum: Python code to find the roi - #4 by Shanka ))
Batches generated in proportion 25 (stalled==1) / 75 (stalled == 0)
I validate using MCC from begining and then switched to ROC AUC. My validation score was around 0.96-0.98 ROC AUC.
I started with micro dataset and then increase number of used videos up to ~50K (using all available stalled == 1)
Last trick which allows me to increase score from 0.82 to 0.86 on public LB was to finetune only on tier1 data (looks like test set contains only tier1 ???)
I applied augmmentations with volumentations library which I remade a little bit to increase speed and add some more useful augs.
I used 5KFold cross-validation. My validation MCC score wasn’t the same as LB, but direction was similar. Increasing at local gives me better result on LB.
Loss function: binary crossentropy. Optimizer: AdamAccumulate
I choose THR to output binary probabilities using leaderboard (so there was some chance to overfit on LB). I found out the optimal number of stalled videos in test set was around 600-700.

Topic		Replies	Views
Official pre-trained models/external data thread Clog Loss: Advance Alzheimer’s Research	3	1290	August 3, 2020
My solution (2nd place so far) Hakuna Ma-data	3	1078	February 5, 2020
My solution and code Hakuna Ma-data	2	625	February 5, 2020
A big THANK YOU from Stall Catchers! Clog Loss: Advance Alzheimer’s Research	0	632	June 10, 2020
Test data annotation Clog Loss: Advance Alzheimer’s Research	0	340	September 3, 2020

Solutions postings

Related topics