Robust Federated Learning: The Case of Affine Distribution Shifts

Open AccessProceedings Article

Robust Federated Learning: The Case of Affine Distribution Shifts

- Vol. 33, pp 21554-21565

TLDR

This paper considers a structured affine distribution shift in users' data that captures the device-dependent data heterogeneity in federated settings and proposes a Federated Learning framework Robust to Affine distribution shifts (FLRA) that is provably robust against affine Wasserstein shifts to the distribution of observed samples.

Abstract:

Federated learning is a distributed paradigm that aims at training models using samples distributed across multiple users in a network while keeping the samples on users' devices with the aim of efficiency and protecting users privacy. In such settings, the training data is often statistically heterogeneous and manifests various distribution shifts across users, which degrades the performance of the learnt model. The primary goal of this paper is to develop a robust federated learning algorithm that achieves satisfactory performance against distribution shifts in users' samples. To achieve this goal, we first consider a structured affine distribution shift in users' data that captures the device-dependent data heterogeneity in federated settings. This perturbation model is applicable to various federated learning problems such as image classification where the images undergo device-dependent imperfections, e.g. different intensity, contrast, and brightness. To address affine distribution shifts across users, we propose a Federated Learning framework Robust to Affine distribution shifts (FLRA) that is provably robust against affine Wasserstein shifts to the distribution of observed samples. To solve the FLRA's distributed minimax problem, we propose a fast and efficient optimization method and provide convergence guarantees via a gradient Descent Ascent (GDA) method. We further prove generalization error bounds for the learnt classifier to show proper generalization from empirical distribution of samples to the true underlying distribution. We perform several numerical experiments to empirically support FLRA. We show that an affine distribution shift indeed suffices to significantly decrease the performance of the learnt classifier in a new test user, and our proposed algorithm achieves a significant gain in comparison to standard federated learning and adversarial training methods.

Robust Federated Learning: The Case of Affine Distribution Shifts

Citations

A Survey on Federated Learning Systems: Vision, Hype and Reality for Data Privacy and Protection

Federated Learning in Edge Computing: A Systematic Survey

A Field Guide to Federated Optimization

FEDNEST: Federated Bilevel, Minimax, and Compositional Optimization

Federated Learning on Non-IID Data Silos: An Experimental Study.

References

Deep Residual Learning for Image Recognition

Adam: A Method for Stochastic Optimization

ImageNet Classification with Deep Convolutional Neural Networks

Going deeper with convolutions

Batch Normalization: Accelerating Deep Network Training by Reducing Internal Covariate Shift

Related Papers (5)

Communication-Efficient Learning of Deep Networks from Decentralized Data

Federated Optimization in Heterogeneous Networks

Agnostic federated learning

Advances and Open Problems in Federated Learning

Federated Learning: Challenges, Methods, and Future Directions