Home
/
Authors
/
Gilad Baruch

Author

Gilad Baruch

Other affiliations: Intel

Bio: Gilad Baruch is an academic researcher from Bar-Ilan University. The author has contributed to research in topics: Random access & Data structure. The author has an hindex of 6, co-authored 15 publications receiving 229 citations. Previous affiliations of Gilad Baruch include Intel.

Topics: Random access, Data structure, Wavelet Tree, Huffman coding, Rewriting ...read more

Papers

PDF

Open Access

More filters

Proceedings Article•

A Little Is Enough: Circumventing Defenses For Distributed Learning

[...]

Gilad Baruch¹, Moran Baruch², Yoav Goldberg¹•Institutions (2)

Bar-Ilan University¹, University of Jyväskylä²

01 Jan 2019

TL;DR: It is shown that 20% of corrupt workers are sufficient to degrade a CIFAR10 model accuracy by 50%, as well as to introduce backdoors into MNIST and CIFar10 models without hurting their accuracy.

...read moreread less

Abstract: Distributed learning is central for large-scale training of deep-learning models. However, it is exposed to a security threat in which Byzantine participants can interrupt or control the learning process. Previous attack models assume that the rogue participants (a) are omniscient (know the data of all other participants), and (b) introduce large changes to the parameters. Accordingly, most defense mechanisms make a similar assumption and attempt to use statistically robust methods to identify and discard values whose reported gradients are far from the population mean. We observe that if the empirical variance between the gradients of workers is high enough, an attacker could take advantage of this and launch a non-omniscient attack that operates within the population variance. We show that the variance is indeed high enough even for simple datasets such as MNIST, allowing an attack that is not only undetected by existing defenses, but also uses their power against them, causing those defense mechanisms to consistently select the byzantine workers while discarding legitimate ones. We demonstrate our attack method works not only for preventing convergence but also for repurposing of the model behavior (``backdooring''). We show that less than 25\% of colluding workers are sufficient to degrade the accuracy of models trained on MNIST, CIFAR10 and CIFAR100 by 50\%, as well as to introduce backdoors without hurting the accuracy for MNIST and CIFAR10 datasets, but with a degradation for CIFAR100.

...read moreread less

190 citations

Posted Content•

A Little Is Enough: Circumventing Defenses For Distributed Learning

[...]

Moran Baruch, Gilad Baruch, Yoav Goldberg

16 Feb 2019-arXiv: Learning

TL;DR: In this paper, the authors show that small but well-crafted changes are sufficient, leading to a novel non-omniscient attack on distributed learning that go undetected by all existing defenses.

...read moreread less

Abstract: Distributed learning is central for large-scale training of deep-learning models. However, they are exposed to a security threat in which Byzantine participants can interrupt or control the learning process. Previous attack models and their corresponding defenses assume that the rogue participants are (a) omniscient (know the data of all other participants), and (b) introduce large change to the parameters. We show that small but well-crafted changes are sufficient, leading to a novel non-omniscient attack on distributed learning that go undetected by all existing defenses. We demonstrate our attack method works not only for preventing convergence but also for repurposing of the model behavior (backdooring). We show that 20% of corrupt workers are sufficient to degrade a CIFAR10 model accuracy by 50%, as well as to introduce backdoors into MNIST and CIFAR10 models without hurting their accuracy

...read moreread less

70 citations

Patent•

Method and system of background-foreground segmentation for image processing

[...]

Gilad Baruch¹, Avigdor Eldar¹•Institutions (1)

Intel¹

28 Oct 2015

TL;DR: In this article, a method for background-foreground segmentation for image processing may include obtaining pixel data including both non-depth data and depth data for at least one image, where the nondepth data includes color data or luminance data or both and associated with the pixels.

...read moreread less

Abstract: Techniques for a system, article, and method of background-foreground segmentation for image processing may include obtaining pixel data including both non-depth data and depth data for at least one image, where the non-depth data includes color data or luminance data or both and associated with the pixels; determining whether a portion of the image is part of a background or foreground of the image based on the depth data and without using the non-depth data; and determining whether a border area between the background and foreground formed by using the depth data are part of the background or foreground depending on the non-depth data without using the depth data.

...read moreread less

35 citations

Journal Article•DOI•

A space efficient direct access data structure

[...]

Gilad Baruch¹, Shmuel T. Klein¹, Dana Shapira²•Institutions (2)

Bar-Ilan University¹, Ariel University²

01 Mar 2017-Journal of Discrete Algorithms

TL;DR: The pruning procedure is improved and empirical evidence is given that when memory storage is of main concern, the suggested data structure outperforms other direct access techniques such as those due to Külekci, DACs and sampling, with a slowdown as compared to DAC’s and fixed length encoding.

...read moreread less

15 citations

Patent•

Method and system of real-time image segmentation for image processing

[...]

Gilad Baruch¹•Institutions (1)

Intel¹

23 May 2016

TL;DR: In this article, a system, article, and method of real-time image segmentation for image processing is described, and a set of techniques related to such a system are presented.

...read moreread less

Abstract: Techniques related to a system, article, and method of real-time image segmentation for image processing.

...read moreread less

14 citations

Cited by

PDF

Open Access

More filters

Posted Content•

Advances and Open Problems in Federated Learning

[...]

Peter Kairouz, H. Brendan McMahan¹, Brendan Avent, Aurélien Bellet, Mehdi Bennis, Arjun Nitin Bhagoji, Kallista Bonawitz, Zachary Charles, Graham Cormode, Rachel Cummings, Rafael G. L. D'Oliveira, Hubert Eichner, Salim El Rouayheb, David Evans, Josh Gardner, Zachary Garrett, Adrià Gascón, Badih Ghazi, Phillip B. Gibbons, Marco Gruteser, Zaid Harchaoui, Chaoyang He, Lie He, Zhouyuan Huo, Ben Hutchinson, Justin Hsu, Martin Jaggi, Tara Javidi, Gauri Joshi, Mikhail Khodak, Jakub Konečný, Aleksandra Korolova, Farinaz Koushanfar, Sanmi Koyejo, Tancrède Lepoint, Yang Liu, Prateek Mittal, Mehryar Mohri, Richard Nock, Ayfer Ozgur, Rasmus Pagh, Mariana Raykova, Hang Qi, Daniel Ramage, Ramesh Raskar, Dawn Song, Weikang Song, Sebastian U. Stich, Ziteng Sun, Ananda Theertha Suresh, Florian Tramèr, Praneeth Vepakomma, Jianyu Wang, Li Xiong, Zheng Xu, Qiang Yang, Felix X. Yu, Han Yu, Sen Zhao - Show less +55 more•Institutions (1)

Google¹

10 Dec 2019-arXiv: Learning

TL;DR: Motivated by the explosive growth in FL research, this paper discusses recent advances and presents an extensive collection of open problems and challenges.

...read moreread less

Abstract: Federated learning (FL) is a machine learning setting where many clients (e.g. mobile devices or whole organizations) collaboratively train a model under the orchestration of a central server (e.g. service provider), while keeping the training data decentralized. FL embodies the principles of focused data collection and minimization, and can mitigate many of the systemic privacy risks and costs resulting from traditional, centralized machine learning and data science approaches. Motivated by the explosive growth in FL research, this paper discusses recent advances and presents an extensive collection of open problems and challenges.

...read moreread less

1,107 citations

Proceedings Article•

How To Backdoor Federated Learning.

[...]

Eugene Bagdasaryan¹, Andreas Veit¹, Yiqing Hua¹, Deborah Estrin¹, Vitaly Shmatikov¹ - Show less +1 more•Institutions (1)

Cornell University¹

02 Jul 2018

TL;DR: In this article, a new model-poisoning methodology based on model replacement is proposed to poison a global model in federated learning, which can reach 100% accuracy on the backdoor task.

...read moreread less

Abstract: Federated learning enables thousands of participants to construct a deep learning model without sharing their private training data with each other For example, multiple smartphones can jointly train a next-word predictor for keyboards without revealing what individual users type We demonstrate that any participant in federated learning can introduce hidden backdoor functionality into the joint global model, eg, to ensure that an image classifier assigns an attacker-chosen label to images with certain features, or that a word predictor completes certain sentences with an attacker-chosen word We design and evaluate a new model-poisoning methodology based on model replacement An attacker selected in a single round of federated learning can cause the global model to immediately reach 100% accuracy on the backdoor task We evaluate the attack under different assumptions for the standard federated-learning tasks and show that it greatly outperforms data poisoning Our generic constrain-and-scale technique also evades anomaly detection-based defenses by incorporating the evasion into the attacker's loss function during training

...read moreread less

849 citations

Proceedings Article•

DBA: Distributed Backdoor Attacks against Federated Learning

[...]

Chulin Xie¹, Keli Huang, Pin-Yu Chen², Bo Li³•Institutions (3)

University of Illinois at Urbana–Champaign¹, IBM², University of California, Los Angeles³

30 Apr 2020

TL;DR: The distributed backdoor attack (DBA) is proposed --- a novel threat assessment framework developed by fully exploiting the distributed nature of FL that can evade two state-of-the-art robust FL algorithms against centralized backdoors.

...read moreread less

Abstract: Backdoor attacks aim to manipulate a subset of training data by injecting adversarial triggers such that machine learning models trained on the tampered dataset will make arbitrarily (targeted) incorrect prediction on the testset with the same trigger embedded. While federated learning (FL) is capable of aggregating information provided by different parties for training a better model, its distributed learning methodology and inherently heterogeneous data distribution across parties may bring new vulnerabilities. In addition to recent centralized backdoor attacks on FL where each party embeds the same global trigger during training, we propose the distributed backdoor attack (DBA) --- a novel threat assessment framework developed by fully exploiting the distributed nature of FL. DBA decomposes a global trigger pattern into separate local patterns and embed them into the training set of different adversarial parties respectively. Compared to standard centralized backdoors, we show that DBA is substantially more persistent and stealthy against FL on diverse datasets such as finance and image data. We conduct extensive experiments to show that the attack success rate of DBA is significantly higher than centralized backdoors under different settings. Moreover, we find that distributed attacks are indeed more insidious, as DBA can evade two state-of-the-art robust FL algorithms against centralized backdoors. We also provide explanations for the effectiveness of DBA via feature visual interpretation and feature importance ranking. To further explore the properties of DBA, we test the attack performance by varying different trigger factors, including local trigger variations (size, gap, and location), scaling factor in FL, data distribution, and poison ratio and interval. Our proposed DBA and thorough evaluation results shed lights on characterizing the robustness of FL.

...read moreread less

310 citations

Posted Content•

How To Backdoor Federated Learning

[...]

Eugene Bagdasaryan¹, Andreas Veit¹, Yiqing Hua¹, Deborah Estrin¹, Vitaly Shmatikov¹ - Show less +1 more•Institutions (1)

Cornell University¹

02 Jul 2018-arXiv: Cryptography and Security

TL;DR: This work designs and evaluates a new model-poisoning methodology based on model replacement and demonstrates that any participant in federated learning can introduce hidden backdoor functionality into the joint global model, e.g., to ensure that an image classifier assigns an attacker-chosen label to images with certain features.

...read moreread less

Abstract: Federated learning enables thousands of participants to construct a deep learning model without sharing their private training data with each other. For example, multiple smartphones can jointly train a next-word predictor for keyboards without revealing what individual users type. We demonstrate that any participant in federated learning can introduce hidden backdoor functionality into the joint global model, e.g., to ensure that an image classifier assigns an attacker-chosen label to images with certain features, or that a word predictor completes certain sentences with an attacker-chosen word. We design and evaluate a new model-poisoning methodology based on model replacement. An attacker selected in a single round of federated learning can cause the global model to immediately reach 100% accuracy on the backdoor task. We evaluate the attack under different assumptions for the standard federated-learning tasks and show that it greatly outperforms data poisoning. Our generic constrain-and-scale technique also evades anomaly detection-based defenses by incorporating the evasion into the attacker's loss function during training.

...read moreread less

272 citations

Patent•

Computer implemented methods and systems for generating virtual body models for garment fit visualisation

[...]

Tom Adeyoola, Nick Brown, Nikki Trott, Edward Herbert, Duncan Robertson, Jim Downing, Nicholas P. J. Day, Robert Boland, Tom Boucher, Joseph A Townsend, Edward Clay, Tom Warren, Anoop Unadkat, Yu Chen - Show less +10 more

21 Jan 2016

TL;DR: In this article, a virtual body model of a person is created with a small number of measurements and a single photograph, combined with one or more images of garments, which can be used for visualizing photo-realistic fit visualizations of garments.

...read moreread less

Abstract: Methods for generating and sharing a virtual body model of a person, created with a small number of measurements and a single photograph, combined with one or more images of garments. The virtual body model represents a realistic representation of the users body and is used for visualizing photo-realistic fit visualizations of garments, hairstyles, make-up, and/or other accessories. The virtual garments are created from layers based on photographs of real garment from multiple angles. Furthermore the virtual body model is used in multiple embodiments of manual and automatic garment, make-up, and, hairstyle recommendations, such as, from channels, friends, and fashion entities. The virtual body model is sharable for, as example, visualization and comments on looks. Furthermore it is also used for enabling users to buy garments that fit other users, suitable for gifts or similar. The implementation can also be used in peer-to-peer online sales where garments can be bought with the knowledge that the seller has a similar body shape and size as the user.

...read moreread less

269 citations

1
2
3
4
…
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58

Collapse