Home
/
Authors
/
Marco Barreno

Author

Marco Barreno

Other affiliations: University of California

Bio: Marco Barreno is an academic researcher from University of California, Berkeley. The author has contributed to research in topics: Inductive transfer & Instance-based learning. The author has an hindex of 7, co-authored 8 publications receiving 2039 citations. Previous affiliations of Marco Barreno include University of California.

Papers

PDF

Open Access

More filters

Proceedings Article•DOI•

Can machine learning be secure

[...]

Marco Barreno¹, Blaine Nelson¹, Russell Sears¹, Anthony D. Joseph¹, J. D. Tygar¹ - Show less +1 more•Institutions (1)

University of California, Berkeley¹

21 Mar 2006

TL;DR: A taxonomy of different types of attacks on machine learning techniques and systems, a variety of defenses against those attacks, and an analytical model giving a lower bound on attacker's work function are provided.

...read moreread less

Abstract: Machine learning systems offer unparalled flexibility in dealing with evolving input in a variety of applications, such as intrusion detection systems and spam e-mail filtering. However, machine learning algorithms themselves can be a target of attack by a malicious adversary. This paper provides a framework for answering the question, "Can machine learning be secure?" Novel contributions of this paper include a taxonomy of different types of attacks on machine learning techniques and systems, a variety of defenses against those attacks, a discussion of ideas that are important to security for machine learning, an analytical model giving a lower bound on attacker's work function, and a list of open problems.

...read moreread less

853 citations

Journal Article•DOI•

The security of machine learning

[...]

Marco Barreno¹, Blaine Nelson¹, Anthony D. Joseph¹, J. D. Tygar¹•Institutions (1)

University of California, Berkeley¹

01 Nov 2010-Machine Learning

TL;DR: A taxonomy identifying and analyzing attacks against machine learning systems is presented, showing how these classes influence the costs for the attacker and defender, and a formal structure defining their interaction is given.

...read moreread less

Abstract: Machine learning's ability to rapidly evolve to changing and complex situations has helped it become a fundamental tool for computer security. That adaptability is also a vulnerability: attackers can exploit machine learning systems. We present a taxonomy identifying and analyzing attacks against machine learning systems. We show how these classes influence the costs for the attacker and defender, and we give a formal structure defining their interaction. We use our framework to survey and analyze the literature of attacks against machine learning systems. We also illustrate our taxonomy by showing how it can guide attacks against SpamBayes, a popular statistical spam filter. Finally, we discuss how our taxonomy suggests new lines of defenses.

...read moreread less

811 citations

Proceedings Article•

Exploiting machine learning to subvert your spam filter

[...]

Blaine Nelson¹, Marco Barreno¹, Fuching Jack Chi¹, Anthony D. Joseph¹, Benjamin I. P. Rubinstein¹, Udam Saini¹, Charles Sutton¹, J. D. Tygar¹, Kai Xia¹ - Show less +5 more•Institutions (1)

University of California, Berkeley¹

15 Apr 2008

TL;DR: This paper shows how an adversary can exploit statistical machine learning, as used in the SpamBayes spam filter, to render it useless--even if the adversary's access is limited to only 1% of the training messages.

...read moreread less

Abstract: Using statistical machine learning for making security decisions introduces new vulnerabilities in large scale systems. This paper shows how an adversary can exploit statistical machine learning, as used in the SpamBayes spam filter, to render it useless--even if the adversary's access is limited to only 1% of the training messages. We further demonstrate a new class of focused attacks that successfully prevent victims from receiving specific email messages. Finally, we introduce two new types of defenses against these attacks.

...read moreread less

347 citations

Proceedings Article•DOI•

Selfish caching in distributed systems: a game-theoretic analysis

[...]

Byung-Gon Chun¹, Kamalika Chaudhuri¹, Hoeteck Wee¹, Marco Barreno¹, Christos H. Papadimitriou¹, John Kubiatowicz¹ - Show less +2 more•Institutions (1)

University of California, Berkeley¹

25 Jul 2004

TL;DR: The existence of pure strategy Nash equilibria is shown, the price of anarchy is investigated, and the game can always implement the social optimum in the best case by giving servers incentive to replicate.

...read moreread less

Abstract: We analyze replication of resources by server nodes that act selfishly, using a game-theoretic approach. We refer to this as the selfish caching problem. In our model, nodes incur either cost for replicating resources or cost for access to a remote replica. We show the existence of pure strategy Nash equilibria and investigate the price of anarchy, which is the relative cost of the lack of coordination. The price of anarchy can be high due to undersupply problems, but with certain network topologies it has better bounds. With a payment scheme the game can always implement the social optimum in the best case by giving servers incentive to replicate.

...read moreread less

171 citations

Book Chapter•DOI•

Misleading Learners: Co-opting Your Spam Filter

[...]

Blaine Nelson¹, Marco Barreno¹, Fuching Jack Chi¹, Anthony D. Joseph¹, Benjamin I. P. Rubinstein¹, Udam Saini¹, Charles Sutton¹, J. D. Tygar¹, Kai Xia¹ - Show less +5 more•Institutions (1)

University of California¹

01 Jan 2009-Springer US

TL;DR: It is shown how an adversary can exploit statistical machine learning, as used in the SpamBayes spam filter, to make it useless—even if the adversary's access is limited to only 1% of the spam training messages.

...read moreread less

Abstract: Using statistical machine learning for making security decisions intro- duces new vulnerabilities in large scale systems. We show how an adversary can exploit statistical machine learning, as used in the SpamBayes spam filter, to ren- der it useless—even if the adversary's access is limited to only 1% of the spam training messages. We demonstrate three new attacks that successfully make the filter unusable, prevent victims from receiving specific email messages, and cause spam emails to arrive in the victim's inbox.

...read moreread less

106 citations

Cited by

PDF

Open Access

More filters

Book Chapter•DOI•

Adversarial examples in the physical world

[...]

Alexey Kurakin¹, Ian Goodfellow², Samy Bengio¹•Institutions (2)

Google¹, OpenAI²

08 Jul 2016

TL;DR: It is found that a large fraction of adversarial examples are classified incorrectly even when perceived through the camera, which shows that even in physical world scenarios, machine learning systems are vulnerable to adversarialExamples.

...read moreread less

Abstract: Most existing machine learning classifiers are highly vulnerable to adversarial examples. An adversarial example is a sample of input data which has been modified very slightly in a way that is intended to cause a machine learning classifier to misclassify it. In many cases, these modifications can be so subtle that a human observer does not even notice the modification at all, yet the classifier still makes a mistake. Adversarial examples pose security concerns because they could be used to perform an attack on machine learning systems, even if the adversary has no access to the underlying model. Up to now, all previous work have assumed a threat model in which the adversary can feed data directly into the machine learning classifier. This is not always the case for systems operating in the physical world, for example those which are using signals from cameras and other sensors as an input. This paper shows that even in such physical world scenarios, machine learning systems are vulnerable to adversarial examples. We demonstrate this by feeding adversarial images obtained from cell-phone camera to an ImageNet Inception classifier and measuring the classification accuracy of the system. We find that a large fraction of adversarial examples are classified incorrectly even when perceived through the camera.

...read moreread less

3,776 citations

Proceedings Article•DOI•

The Limitations of Deep Learning in Adversarial Settings

[...]

Nicolas Papernot¹, Patrick McDaniel¹, Somesh Jha², Matt Fredrikson², Z. Berkay Celik¹, Ananthram Swami³ - Show less +2 more•Institutions (3)

Pennsylvania State University¹, University of Wisconsin-Madison², United States Army Research Laboratory³

21 Mar 2016

TL;DR: This work formalizes the space of adversaries against deep neural networks (DNNs) and introduces a novel class of algorithms to craft adversarial samples based on a precise understanding of the mapping between inputs and outputs of DNNs.

...read moreread less

Abstract: Deep learning takes advantage of large datasets and computationally efficient training algorithms to outperform other approaches at various machine learning tasks. However, imperfections in the training phase of deep neural networks make them vulnerable to adversarial samples: inputs crafted by adversaries with the intent of causing deep neural networks to misclassify. In this work, we formalize the space of adversaries against deep neural networks (DNNs) and introduce a novel class of algorithms to craft adversarial samples based on a precise understanding of the mapping between inputs and outputs of DNNs. In an application to computer vision, we show that our algorithms can reliably produce samples correctly classified by human subjects but misclassified in specific targets by a DNN with a 97% adversarial success rate while only modifying on average 4.02% of the input features per sample. We then evaluate the vulnerability of different sample classes to adversarial perturbations by defining a hardness measure. Finally, we describe preliminary work outlining defenses against adversarial samples by defining a predictive measure of distance between a benign input and a target classification.

...read moreread less

3,114 citations

Proceedings Article•DOI•

Practical Black-Box Attacks against Machine Learning

[...]

Nicolas Papernot¹, Patrick McDaniel¹, Ian Goodfellow², Somesh Jha³, Z. Berkay Celik¹, Ananthram Swami⁴ - Show less +2 more•Institutions (4)

Pennsylvania State University¹, OpenAI², University of Wisconsin-Madison³, United States Army Research Laboratory⁴

02 Apr 2017

TL;DR: This work introduces the first practical demonstration of an attacker controlling a remotely hosted DNN with no such knowledge, and finds that this black-box attack strategy is capable of evading defense strategies previously found to make adversarial example crafting harder.

...read moreread less

Abstract: Machine learning (ML) models, e.g., deep neural networks (DNNs), are vulnerable to adversarial examples: malicious inputs modified to yield erroneous model outputs, while appearing unmodified to human observers. Potential attacks include having malicious content like malware identified as legitimate or controlling vehicle behavior. Yet, all existing adversarial example attacks require knowledge of either the model internals or its training data. We introduce the first practical demonstration of an attacker controlling a remotely hosted DNN with no such knowledge. Indeed, the only capability of our black-box adversary is to observe labels given by the DNN to chosen inputs. Our attack strategy consists in training a local model to substitute for the target DNN, using inputs synthetically generated by an adversary and labeled by the target DNN. We use the local substitute to craft adversarial examples, and find that they are misclassified by the targeted DNN. To perform a real-world and properly-blinded evaluation, we attack a DNN hosted by MetaMind, an online deep learning API. We find that their DNN misclassifies 84.24% of the adversarial examples crafted with our substitute. We demonstrate the general applicability of our strategy to many ML techniques by conducting the same attack against models hosted by Amazon and Google, using logistic regression substitutes. They yield adversarial examples misclassified by Amazon and Google at rates of 96.19% and 88.94%. We also find that this black-box attack strategy is capable of evading defense strategies previously found to make adversarial example crafting harder.

...read moreread less

2,712 citations

Proceedings Article•DOI•

Model Inversion Attacks that Exploit Confidence Information and Basic Countermeasures

[...]

Matt Fredrikson¹, Somesh Jha², Thomas Ristenpart³•Institutions (3)

Carnegie Mellon University¹, University of Wisconsin-Madison², Cornell University³

12 Oct 2015

TL;DR: A new class of model inversion attack is developed that exploits confidence values revealed along with predictions and is able to estimate whether a respondent in a lifestyle survey admitted to cheating on their significant other and recover recognizable images of people's faces given only their name.

...read moreread less

Abstract: Machine-learning (ML) algorithms are increasingly utilized in privacy-sensitive applications such as predicting lifestyle choices, making medical diagnoses, and facial recognition. In a model inversion attack, recently introduced in a case study of linear classifiers in personalized medicine by Fredrikson et al., adversarial access to an ML model is abused to learn sensitive genomic information about individuals. Whether model inversion attacks apply to settings outside theirs, however, is unknown. We develop a new class of model inversion attack that exploits confidence values revealed along with predictions. Our new attacks are applicable in a variety of settings, and we explore two in depth: decision trees for lifestyle surveys as used on machine-learning-as-a-service systems and neural networks for facial recognition. In both cases confidence values are revealed to those with the ability to make prediction queries to models. We experimentally show attacks that are able to estimate whether a respondent in a lifestyle survey admitted to cheating on their significant other and, in the other context, show how to recover recognizable images of people's faces given only their name and access to the ML model. We also initiate experimental exploration of natural countermeasures, investigating a privacy-aware decision tree training algorithm that is a simple variant of CART learning, as well as revealing only rounded confidence values. The lesson that emerges is that one can avoid these kinds of MI attacks with negligible degradation to utility.

...read moreread less

2,156 citations

Proceedings Article•DOI•

Distillation as a Defense to Adversarial Perturbations Against Deep Neural Networks

[...]

Nicolas Papernot¹, Patrick McDaniel¹, Xi Wu², Somesh Jha², Ananthram Swami³ - Show less +1 more•Institutions (3)

Pennsylvania State University¹, University of Wisconsin-Madison², United States Army Research Laboratory³

22 May 2016

TL;DR: In this article, the authors introduce a defensive mechanism called defensive distillation to reduce the effectiveness of adversarial samples on DNNs, which increases the average minimum number of features that need to be modified to create adversarial examples by about 800%.

...read moreread less

Abstract: Deep learning algorithms have been shown to perform extremely well on manyclassical machine learning problems. However, recent studies have shown thatdeep learning, like other machine learning techniques, is vulnerable to adversarial samples: inputs crafted to force adeep neural network (DNN) to provide adversary-selected outputs. Such attackscan seriously undermine the security of the system supported by the DNN, sometimes with devastating consequences. For example, autonomous vehicles canbe crashed, illicit or illegal content can bypass content filters, or biometricauthentication systems can be manipulated to allow improper access. In thiswork, we introduce a defensive mechanism called defensive distillationto reduce the effectiveness of adversarial samples on DNNs. We analyticallyinvestigate the generalizability and robustness properties granted by the useof defensive distillation when training DNNs. We also empirically study theeffectiveness of our defense mechanisms on two DNNs placed in adversarialsettings. The study shows that defensive distillation can reduce effectivenessof sample creation from 95% to less than 0.5% on a studied DNN. Such dramaticgains can be explained by the fact that distillation leads gradients used inadversarial sample creation to be reduced by a factor of 1030. We alsofind that distillation increases the average minimum number of features thatneed to be modified to create adversarial samples by about 800% on one of theDNNs we tested.

...read moreread less

2,130 citations

1
2
3
4
…
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200

Collapse