Specifying different hyperparameters for different federated scenarios

kzliu · January 25, 2023, 7:58pm

Would it be possible (or encouraged) to specify different hyperparameters (e.g. number of training rounds) for different federated scenarios scenarioXX? The main motivation is that the best performing settings may depend on the number of clients, client dataset sizes, etc. From the log files of federated runs, it seems that we have some ideas about, say, the number of clients; is there any way this information can be accessed at run time?

Thanks!

jayqi · January 25, 2023, 9:59pm

Hi @kzliu,

Setting different hyperparameters for different federation scenarios is permitted.

The number of partitions should be apparent, but here they are for reference.

Track A: Financial Crime Prevention

SWIFT + 2 bank partitions
SWIFT + 4 bank partitions
SWIFT + 9 bank partitions

Track B: Pandemic Forecasting

2 partitions
5 partitions
10 partitions

As documented, partitioning is done by taking the full dataset (that is available to the centralized evaluation), and dividing it up among the partitions in a fairly even way, subject to partitioning boundary constraints specific to each track. These partitioning scenarios are the same between the evaluation dataset and the smoke test dataset (i.e., the data from the centralized case is divided up into the same numbers of partitions for each scenario).

Topic		Replies	Views
Question about the Federated setting in Track A PETs Prize Challenge	1	350	September 13, 2022
PETs challenge - model personalization solutions PETs Prize Challenge	1	320	August 18, 2022
Data partitioning and preparation for Pandemic Track PETs Prize Challenge	1	325	September 1, 2022
Any update on evaluation time? PETs Prize Challenge	1	220	January 6, 2023
Track B: caching preprocessed data PETs Prize Challenge	4	225	January 25, 2023

Specifying different hyperparameters for different federated scenarios

Track A: Financial Crime Prevention

Track B: Pandemic Forecasting

Related topics