Selection of train set

Hi,

I have a question.

  1. Do we have to use all training set(766) for every label?

  2. Instead of using all zeros as 0 in the training set, can we train the label with low correlation as 0 ?

For example,

Can we use train data for “basalt” like below?

train ones => basalt
train zeros => oxalate + sulfide (min correlation with basalt)

Hi @onurk83,

You may train your model and prepare your training data in any way you choose when using the train and validation set data. There are no restrictions on how you use the provided train and validation sets.

Your model needs to be able to perform inference on new data one sample at a time and make predictions on the target label classes in accordance with the submission format.

1 Like

I understood. Thanks for your explanation.