I’m trying to set up my cross validation. I’m using stratified K-Fold and I wrote a mean log loss func.
I’m getting really different values in my CV and the leaderboard.
I’d appreciate any suggestions!
I’m not sure if my mean log loss is correct, but this is what I have written:
def a_mean_log_loss(y_true_a, y_pred_a, y_true_b, y_pred_b, y_true_c, y_pred_c): # log_loss is from sklearn.metrics a_logloss = log_loss(y_true_a, y_pred_a) b_logloss = log_loss(y_true_b, y_pred_b) c_logloss = log_loss(y_true_c, y_pred_c) # average of each countries log loss return np.sum([a_logloss, b_logloss, c_logloss])/3