My name is Lauren and I’m interested in creating or joining a team for this competition. I obtained my PhD in Molecular Biology with focus in Bioinformatics from UCI in 2023 and have been working as Data Scientist since then. I am very familiar with classification ML algorithm, transformers, github, and have some experience with docker (though not as much as I’d like). I am happy to share the work I’ve done so far, and some ideas I had to improve model performance. I will be traveling for the holidays, but will be back on the 26th.
Here is my current progress for the competition:
- I was able to install all the necessary python packages and run the Community code jupyter notebook successfully. Automated code to perform the data preprocessing, train-test split methods, gridsearch parameter optimizations, and comparison of model performance using several algorithms that are well suited for this type of dataset (i.e., large # of features). I have a couple of ideas to improve model performance, however I took a break to do some background reading about audio classification using deep learning models.