Hi @arch! For (1), it’s not exactly clear what this would entail, but as a reminder derivative works aren’t allowed from the dataset. You can refer to the data license terms for more information. On (2) pruning the samples used for training is totally fine. Hope this helps!