Learning to Generate Image Embeddings with User-level Differential Privacy

doi:10.48550/arXiv.2211.10844

Journal ArticleDOI

Learning to Generate Image Embeddings with User-level Differential Privacy

Zheng Xu, +8 more

- 20 Nov 2022 -

arXiv.org

- Vol. abs/2211.10844

Chats0

TLDR

DP-FedEmb as discussed by the authors is a variant of federated learning algorithms with per-user sensitivity control and noise addition, to train from user-partitioned data centralized in the datacenter.

Abstract:

Small on-device models have been successfully trained with user-level differential privacy (DP) for next word prediction and image classification tasks in the past. However, existing methods can fail when directly applied to learn embedding models using supervised training data with a large class space. To achieve user-level DP for large image-to-embedding feature extractors, we propose DP-FedEmb, a variant of federated learning algorithms with per-user sensitivity control and noise addition, to train from user-partitioned data centralized in the datacenter. DP-FedEmb combines virtual clients, partial aggregation, private local fine-tuning, and public pretraining to achieve strong privacy utility trade-offs. We apply DP-FedEmb to train image embedding models for faces, landmarks and natural species, and demonstrate its superior utility under same privacy budget on benchmark datasets DigiFace, EMNIST, GLD and iNaturalist. We further illustrate it is possible to achieve strong user-level DP guarantees of $\epsilon<4$ while controlling the utility drop within 5%, when millions of users can participate in training.

Citations

PDF

Open Access

More filters

Journal ArticleDOI

How to DP-fy ML: A Practical Guide to Machine Learning with Differential Privacy

Natalia Ponomareva, +8 more

- 01 Mar 2023 -

arXiv.org

TL;DR: Differential privacy has become a gold standard for making formal statements about data anonymization as mentioned in this paper , and while some adoption of DP has happened in industry, attempts to apply DP to real world complex ML models are still few and far between.

...read moreread less

Journal ArticleDOI

Differentially Private Diffusion Models Generate Useful Synthetic Images

Sahra Ghalebikesabi, +9 more

- 27 Feb 2023 -

arXiv.org

TL;DR: In this article , Wang et al. used differential privacy to fine-tune ImageNet pre-trained diffusion models with more than 80M parameters and obtained SOTA results on CIFAR-10 and Camelyon17 in terms of both FID and the accuracy of downstream classifiers trained on synthetic data.

...read moreread less

Proceedings ArticleDOI

Federated Learning of Gboard Language Models with Differential Privacy

Zheng Xu, +4 more

TL;DR: In this article , the authors train and deploy language models (LMs) with federated learning (FL) and differential privacy (DP) in Google Keyboard (Gboard) using the recent DP-Follow the Regularized Leader (DP-FTRL) algorithm.

...read moreread less

Journal ArticleDOI

An Empirical Evaluation of Federated Contextual Bandit Algorithms

Alekh Agarwal, +2 more

- 17 Mar 2023 -

arXiv.org

TL;DR: In this article , the authors propose federated contextual bandits for learning from sensitive data local to user devices, where the learning can be done using implicit signals generated as users interact with the applications of interest, rather than requiring access to explicit labels.

...read moreread less

Journal ArticleDOI

Can Public Large Language Models Help Private Cross-device Federated Learning?

Boxin Wang, +7 more

- 20 May 2023 -

arXiv.org

TL;DR: The authors proposed a distribution matching algorithm with theoretical grounding to sample public data close to private data distribution, which significantly improves the sample efficiency of (pre-)training on public data, and further improves the privacy-utility tradeoff by techniques of distillation.

...read moreread less

References

PDF

Open Access

More filters

Posted Content

Deep Residual Learning for Image Recognition

Kaiming He, +3 more

- 10 Dec 2015 -

arXiv: Computer Vision and Pattern Recog...

TL;DR: This work presents a residual learning framework to ease the training of networks that are substantially deeper than those used previously, and provides comprehensive empirical evidence showing that these residual networks are easier to optimize, and can gain accuracy from considerably increased depth.

...read moreread less

Proceedings Article

Batch Normalization: Accelerating Deep Network Training by Reducing Internal Covariate Shift

Sergey Ioffe, +1 more

TL;DR: Applied to a state-of-the-art image classification model, Batch Normalization achieves the same accuracy with 14 times fewer training steps, and beats the original model by a significant margin.

...read moreread less

Gradient-based learning applied to document recognition

Yann LeCun, +7 more

TL;DR: This paper reviews various methods applied to handwritten character recognition and compares them on a standard handwritten digit recognition task, and Convolutional neural networks are shown to outperform all other techniques.

...read moreread less

Posted Content

MobileNetV2: Inverted Residuals and Linear Bottlenecks

Mark Sandler, +4 more

- 13 Jan 2018 -

arXiv: Computer Vision and Pattern Recog...

TL;DR: A new mobile architecture, MobileNetV2, is described that improves the state of the art performance of mobile models on multiple tasks and benchmarks as well as across a spectrum of different model sizes and allows decoupling of the input/output domains from the expressiveness of the transformation.

...read moreread less

Posted Content

A Simple Framework for Contrastive Learning of Visual Representations

Ting Chen, +3 more

- 13 Feb 2020 -

arXiv: Learning

TL;DR: It is shown that composition of data augmentations plays a critical role in defining effective predictive tasks, and introducing a learnable nonlinear transformation between the representation and the contrastive loss substantially improves the quality of the learned representations, and contrastive learning benefits from larger batch sizes and more training steps compared to supervised learning.

...read moreread less

Collapse