How good can you do if you ignore the Yelp data?

jhalverson · July 1, 2015, 7:28pm

I acknowledge that the Yelp data is valuable. However, I’m curious if anyone has had success in ignoring the Yelp data and using only training_labels.txt? Has anyone beat the linear regression benchmark score of 1.1386 using this approach?

qwang · July 2, 2015, 9:58am

For the phase 1 test set, my first submission was exactly that if I remember correctly, and I just barely beat the benchmark. However, that only works for the phase 1 test set and would not make any sense otherwise.

Topic		Replies	Views
Yelp Review Dataset Keeping it Fresh	1	745	March 22, 2021
Questions about data set Keeping it Fresh	3	2437	May 28, 2015
Phase 2 Submissions Cheating? Hateful Memes	27	1953	December 9, 2020
Test Labels are missing Flu Shot Learning	2	840	July 29, 2021
Power Laws Forecasting: Where is test data? Power Laws	1	823	March 18, 2018

How good can you do if you ignore the Yelp data?

Related topics