Track B output format

hhcho · January 24, 2023, 12:26am

We have some last minute clarification questions regarding the output format for Track B (pandemic):

Does the evaluation pipeline depend on the fact that the predicted scores are between 0 and 1? Or will we receive the same score as long as the ranking is equivalent (which is expected for AUPRC)?
Is each federation unit’s output evaluated separately or are all output predictions concatenated across units for a global AUPRC score?

Thank you,
Hoon

jayqi · January 24, 2023, 1:44am

I believe confidence scores outside of [0.0, 1.0] should technically work and give you the same AUPRC score. However, it is not officially documented or supported, so you would be doing so at your own risk.
For the federated solution evaluation, the output from all federation units are concatenated together and used to calculate a global AUPRC score. This is done separately for each of the three partitioning scenarios, so your federated solution will produce three AUPRC scores.

Topic		Replies	Views
Eval metrics for Track B (Pandemic) PETs Prize Challenge	1	258	November 28, 2022
About example solution PETs Prize Challenge	1	224	December 22, 2022
Data partitioning and preparation for Pandemic Track PETs Prize Challenge	1	325	September 1, 2022
Track B: quick clarification regarding train/test folders in the runtime repo PETs Prize Challenge	1	265	January 4, 2023
Federated aspects for Track B PETs Prize Challenge	3	334	September 3, 2022