Is using corpus allowed?

Realdeo · April 29, 2015, 2:54pm

HI! I’m trying to develop a model, using corpus data to develop sentiment analysis on the review.

Is it ilegal? On one side of the coin flip, it’s external data which is forbidden, on other side of the coin flip, when we talk about external data, we mean about training data that is gained using anything except the given data. But corpus is not really training data, it’s more like a dictionary.

My corpus data is like dictionary, it’s contains the correct spelling and some sentiment analysis. It’s not really ‘phrases’ but ‘words’

Thank!

isms · April 29, 2015, 3:07pm

Hey @Realdeo, as long as the corpus is part of an open source package and freely available to all competitors, that is totally fine.

Thanks for asking.

Realdeo · April 29, 2015, 3:15pm

May I know which part of the rule says so. You see, I’m a paranoid guy, in case you’re wondering =p

Topic		Replies	Views
External data annotation Hateful Memes	5	1333	August 11, 2020
Are pretrained models allowed? Kelp Wanted: Segmenting Kelp Forests	2	333	December 20, 2023
two questions about the use of the Hateful Memes dataset Hateful Memes	3	808	July 13, 2020
Using datasets for academic homework	1	1702	September 8, 2015
Rules clarifications Where's Whale-do?	4	313	July 3, 2022

Is using corpus allowed?

Related topics