Track A - Federated Learning Setting

Is it true that the SWIFT knows everything that each bank knows except the flag of the accounts? If it is the case, why does the SWIFT need the banks? SWIFT can train a centralized model and use it without sharing the model with the banks. If so, there would be no privacy risk at all.

Hi @khangtran97. There is information that the banks have that are relevant signals for the anomaly detection. A model that only has access to SWIFT’s data (which, based on how this use case is structured, is not a centralized model), does not have complete access to all relevant signals, and accordingly is expected to not be as accurate.

Thank you for you response. However, the question is still there. Is it true that the SWIFT knows everything that each bank knows except the flag of the accounts?

@khangtran97

Our role as organizers is to provide conceptual guidance for you to have the necessary information to solve the task. Conceptually, you are have a correct understand that, with the exception of account flags as you have noted, the account information held in both the SWIFT and bank datasets are the same type of account information.

For empirical conclusions regarding the dataset, participants are responsible for analyzing the data and for testing hypotheses that you may have about how to best design a model for solving the problem.

So there will be no Federated Learning at all ? Why don’t the banks send the flags, which are not PII, to SWIFT. Then, SWIFT can train a model with complete information. Therefore, we do not need federated learning in this competition.

1 Like

Hi @khangtran97

flags, which are not PII

This is not correct. All data is considered part of the scope of sensitive data for this challenge.

In general, there is information that the bank holds that is relevant to the anomaly detection task that SWIFT does not have access to alone. It is up to participants to figure out how to model this, and how to incorporate that into a privacy-preserving solution using federated learning. A solution that partially solves the modeling problem by not incorporating federated learning does not meet the objectives of the challenge.