Home
/
Authors
/
Andy Chu

Author

Andy Chu

Bio: Andy Chu is an academic researcher from Google. The author has contributed to research in topics: Deep learning & Information sensitivity. The author has an hindex of 3, co-authored 3 publications receiving 2508 citations.

Papers

PDF

Open Access

More filters

Proceedings Article•DOI•

Deep Learning with Differential Privacy

[...]

Martín Abadi¹, Andy Chu¹, Ian Goodfellow, H. Brendan McMahan¹, Ilya Mironov¹, Kunal Talwar¹, Li Zhang¹ - Show less +3 more•Institutions (1)

Google¹

24 Oct 2016

TL;DR: In this paper, the authors develop new algorithmic techniques for learning and a refined analysis of privacy costs within the framework of differential privacy, and demonstrate that they can train deep neural networks with nonconvex objectives, under a modest privacy budget, and at a manageable cost in software complexity, training efficiency, and model quality.

...read moreread less

Abstract: Machine learning techniques based on neural networks are achieving remarkable results in a wide variety of domains. Often, the training of models requires large, representative datasets, which may be crowdsourced and contain sensitive information. The models should not expose private information in these datasets. Addressing this goal, we develop new algorithmic techniques for learning and a refined analysis of privacy costs within the framework of differential privacy. Our implementation and experiments demonstrate that we can train deep neural networks with non-convex objectives, under a modest privacy budget, and at a manageable cost in software complexity, training efficiency, and model quality.

...read moreread less

2,944 citations

Proceedings Article•DOI•

Deep Learning with Differential Privacy

[...]

Martín Abadi¹, Andy Chu¹, Ian Goodfellow, H. Brendan McMahan¹, Ilya Mironov¹, Kunal Talwar¹, Li Zhang¹ - Show less +3 more•Institutions (1)

Google¹

01 Jul 2016-arXiv: Machine Learning

TL;DR: This work develops new algorithmic techniques for learning and a refined analysis of privacy costs within the framework of differential privacy, and demonstrates that deep neural networks can be trained with non-convex objectives, under a modest privacy budget, and at a manageable cost in software complexity, training efficiency, and model quality.

...read moreread less

1,777 citations

Patent•

Determining the relationship between source code bases

[...]

Andy Chu¹•Institutions (1)

Google¹

28 Sep 2005

TL;DR: In this article, an automated technique compares two sets of documents (such as two source codebases) to automatically determine documents within each set that are similar to one another, and a similarity score is calculated for each of the pairs of documents based on the lines from the matrix.

...read moreread less

Abstract: An automated technique compares two sets of documents (such as two source codebases) to automatically determine documents within each set that are similar to one another. The technique constructs a matrix relating pairs of documents from the first and second sets of documents to lines that occur in both documents in each of the pairs of documents. A similarity score is calculated for each of the pairs of documents based on the lines from the matrix.

...read moreread less

9 citations

Cited by

PDF

Open Access

More filters

Posted Content•

Communication-Efficient Learning of Deep Networks from Decentralized Data

[...]

H. Brendan McMahan¹, Eider Moore¹, Daniel Ramage¹, Seth Hampson, Blaise Aguera y Arcas¹ - Show less +1 more•Institutions (1)

Google¹

17 Feb 2016-arXiv: Learning

TL;DR: This work presents a practical method for the federated learning of deep networks based on iterative model averaging, and conducts an extensive empirical evaluation, considering five different model architectures and four datasets.

...read moreread less

Abstract: Modern mobile devices have access to a wealth of data suitable for learning models, which in turn can greatly improve the user experience on the device. For example, language models can improve speech recognition and text entry, and image models can automatically select good photos. However, this rich data is often privacy sensitive, large in quantity, or both, which may preclude logging to the data center and training there using conventional approaches. We advocate an alternative that leaves the training data distributed on the mobile devices, and learns a shared model by aggregating locally-computed updates. We term this decentralized approach Federated Learning. We present a practical method for the federated learning of deep networks based on iterative model averaging, and conduct an extensive empirical evaluation, considering five different model architectures and four datasets. These experiments demonstrate the approach is robust to the unbalanced and non-IID data distributions that are a defining characteristic of this setting. Communication costs are the principal constraint, and we show a reduction in required communication rounds by 10-100x as compared to synchronized stochastic gradient descent.

...read moreread less

5,936 citations

Journal Article•DOI•

Federated Machine Learning: Concept and Applications

[...]

Qiang Yang¹, Yang Liu, Tianjian Chen, Yongxin Tong²•Institutions (2)

Hong Kong University of Science and Technology¹, Beihang University²

28 Jan 2019-ACM Transactions on Intelligent Systems and Technology

TL;DR: This work introduces a comprehensive secure federated-learning framework, which includes horizontal federated learning, vertical federatedLearning, and federated transfer learning, and provides a comprehensive survey of existing works on this subject.

...read moreread less

Abstract: Today’s artificial intelligence still faces two major challenges. One is that, in most industries, data exists in the form of isolated islands. The other is the strengthening of data privacy and security. We propose a possible solution to these challenges: secure federated learning. Beyond the federated-learning framework first proposed by Google in 2016, we introduce a comprehensive secure federated-learning framework, which includes horizontal federated learning, vertical federated learning, and federated transfer learning. We provide definitions, architectures, and applications for the federated-learning framework, and provide a comprehensive survey of existing works on this subject. In addition, we propose building data networks among organizations based on federated mechanisms as an effective solution to allowing knowledge to be shared without compromising user privacy.

...read moreread less

2,593 citations

Proceedings Article•DOI•

Membership Inference Attacks Against Machine Learning Models

[...]

Reza Shokri¹, Marco Stronati², Congzheng Song¹, Vitaly Shmatikov¹•Institutions (2)

Cornell University¹, French Institute for Research in Computer Science and Automation²

22 May 2017

TL;DR: This work quantitatively investigates how machine learning models leak information about the individual data records on which they were trained and empirically evaluates the inference techniques on classification models trained by commercial "machine learning as a service" providers such as Google and Amazon.

...read moreread less

Abstract: We quantitatively investigate how machine learning models leak information about the individual data records on which they were trained. We focus on the basic membership inference attack: given a data record and black-box access to a model, determine if the record was in the model's training dataset. To perform membership inference against a target model, we make adversarial use of machine learning and train our own inference model to recognize differences in the target model's predictions on the inputs that it trained on versus the inputs that it did not train on. We empirically evaluate our inference techniques on classification models trained by commercial "machine learning as a service" providers such as Google and Amazon. Using realistic datasets and classification tasks, including a hospital discharge dataset whose membership is sensitive from the privacy perspective, we show that these models can be vulnerable to membership inference attacks. We then investigate the factors that influence this leakage and evaluate mitigation strategies.

...read moreread less

2,059 citations

Proceedings Article•DOI•

Practical Secure Aggregation for Privacy-Preserving Machine Learning

[...]

Keith Bonawitz¹, Vladimir Ivanov¹, Ben Kreuter¹, Antonio Marcedone², H. Brendan McMahan¹, Sarvar Patel¹, Daniel Ramage¹, Aaron Segal¹, Karn Seth¹ - Show less +5 more•Institutions (2)

Google¹, Cornell University²

30 Oct 2017

TL;DR: In this paper, the authors proposed a secure aggregation of high-dimensional data for federated deep neural networks, which allows a server to compute the sum of large, user-held data vectors from mobile devices in a secure manner without learning each user's individual contribution.

...read moreread less

Abstract: We design a novel, communication-efficient, failure-robust protocol for secure aggregation of high-dimensional data. Our protocol allows a server to compute the sum of large, user-held data vectors from mobile devices in a secure manner (i.e. without learning each user's individual contribution), and can be used, for example, in a federated learning setting, to aggregate user-provided model updates for a deep neural network. We prove the security of our protocol in the honest-but-curious and active adversary settings, and show that security is maintained even if an arbitrarily chosen subset of users drop out at any time. We evaluate the efficiency of our protocol and show, by complexity analysis and a concrete implementation, that its runtime and communication overhead remain low even on large data sets and client pools. For 16-bit input values, our protocol offers $1.73 x communication expansion for 210 users and 220-dimensional vectors, and 1.98 x expansion for 214 users and 224-dimensional vectors over sending data in the clear.

...read moreread less

1,890 citations

Proceedings Article•DOI•

Deep Learning with Differential Privacy

[...]

Martín Abadi¹, Andy Chu¹, Ian Goodfellow, H. Brendan McMahan¹, Ilya Mironov¹, Kunal Talwar¹, Li Zhang¹ - Show less +3 more•Institutions (1)

Google¹

01 Jul 2016-arXiv: Machine Learning

...read moreread less

1,777 citations

1
2
3
4
…
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200

Collapse