Home
/
Authors
/
Ben Adlam

Author

Ben Adlam

Bio: Ben Adlam is an academic researcher from Google. The author has contributed to research in topics: Population & Artificial neural network. The author has an hindex of 15, co-authored 36 publications receiving 782 citations. Previous affiliations of Ben Adlam include Harvard University.

Papers

PDF

Open Access

More filters

Posted Content•

Underspecification Presents Challenges for Credibility in Modern Machine Learning

[...]

Alexander D'Amour¹, Katherine Heller¹, Dan Moldovan¹, Ben Adlam¹, Babak Alipanahi¹, Alex Beutel¹, Christina Chen¹, Jonathan Deaton¹, Jacob Eisenstein¹, Matthew D. Hoffman¹, Farhad Hormozdiari¹, Neil Houlsby¹, Shaobo Hou¹, Ghassen Jerfel¹, Alan Karthikesalingam¹, Mario Lucic¹, Yi-An Ma², Cory Y. McLean¹, Diana Mincu¹, Akinori Mitani¹, Andrea Montanari³, Zachary Nado¹, Vivek T. Natarajan¹, Christopher Nielson⁴, Thomas F. Osborne⁴, Rajiv Raman, Kim Ramasamy, Rory Sayres¹, Jessica Schrouff¹, Martin G. Seneviratne¹, Shannon Sequeira¹, Harini Suresh⁵, Victor Veitch¹, Max Vladymyrov¹, Xuezhi Wang¹, Kellie Webster¹, Steve Yadlowsky¹, Taedong Yun¹, Xiaohua Zhai¹, D. Sculley¹ - Show less +36 more•Institutions (5)

Google¹, University of California, San Diego², Stanford University³, United States Department of Veterans Affairs⁴, Massachusetts Institute of Technology⁵

06 Nov 2020-arXiv: Learning

TL;DR: This work shows the need to explicitly account for underspecification in modeling pipelines that are intended for real-world deployment in any domain, and shows that this problem appears in a wide variety of practical ML pipelines.

...read moreread less

Abstract: ML models often exhibit unexpectedly poor behavior when they are deployed in real-world domains. We identify underspecification as a key reason for these failures. An ML pipeline is underspecified when it can return many predictors with equivalently strong held-out performance in the training domain. Underspecification is common in modern ML pipelines, such as those based on deep learning. Predictors returned by underspecified pipelines are often treated as equivalent based on their training domain performance, but we show here that such predictors can behave very differently in deployment domains. This ambiguity can lead to instability and poor model behavior in practice, and is a distinct failure mode from previously identified issues arising from structural mismatch between training and deployment domains. We show that this problem appears in a wide variety of practical ML pipelines, using examples from computer vision, medical imaging, natural language processing, clinical risk prediction based on electronic health records, and medical genomics. Our results show the need to explicitly account for underspecification in modeling pipelines that are intended for real-world deployment in any domain.

...read moreread less

374 citations

Journal Article•DOI•

Crowding and the shape of COVID-19 epidemics.

[...]

Benjamin Rader¹, Benjamin Rader², Samuel V. Scarpino³, Samuel V. Scarpino⁴, Anjalika Nande⁵, Alison L. Hill⁶, Alison L. Hill⁵, Ben Adlam⁵, Robert C. Reiner⁷, David M. Pigott⁷, Bernardo Gutierrez⁸, Bernardo Gutierrez⁹, Alexander E. Zarebski⁹, Munik Shrestha³, John S. Brownstein⁵, Marcia C. Castro⁵, Christopher Dye⁹, Huaiyu Tian¹⁰, Oliver G. Pybus¹¹, Oliver G. Pybus⁹, Moritz U. G. Kraemer⁹ - Show less +17 more•Institutions (11)

Boston Children's Hospital¹, Boston University², Northeastern University³, Santa Fe Institute⁴, Harvard University⁵, Johns Hopkins University⁶, University of Washington⁷, Universidad San Francisco de Quito⁸, University of Oxford⁹, Beijing Normal University¹⁰, Royal Veterinary College¹¹

05 Oct 2020-Nature Medicine

TL;DR: Analysis of spatial heterogeneity of crowding in China and Italy, together with COVID-19 case data, show that cities with higher crowding have longer epidemics and higher attack rates after the first epidemic wave, and predict that crowded cities worldwide could experience more prolonged epidemics.

...read moreread less

Abstract: The coronavirus disease 2019 (COVID-19) pandemic is straining public health systems worldwide, and major non-pharmaceutical interventions have been implemented to slow its spread1-4. During the initial phase of the outbreak, dissemination of severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) was primarily determined by human mobility from Wuhan, China5,6. Yet empirical evidence on the effect of key geographic factors on local epidemic transmission is lacking7. In this study, we analyzed highly resolved spatial variables in cities, together with case count data, to investigate the role of climate, urbanization and variation in interventions. We show that the degree to which cases of COVID-19 are compressed into a short period of time (peakedness of the epidemic) is strongly shaped by population aggregation and heterogeneity, such that epidemics in crowded cities are more spread over time, and crowded cities have larger total attack rates than less populated cities. Observed differences in the peakedness of epidemics are consistent with a meta-population model of COVID-19 that explicitly accounts for spatial hierarchies. We paired our estimates with globally comprehensive data on human mobility and predict that crowded cities worldwide could experience more prolonged epidemics.

...read moreread less

191 citations

Journal Article•DOI•

Current CRISPR gene drive systems are likely to be highly invasive in wild populations.

[...]

Charleston Noble¹, Charleston Noble², Ben Adlam², George M. Church¹, George M. Church², Kevin M. Esvelt³, Martin A. Nowak² - Show less +3 more•Institutions (3)

Wyss Institute for Biologically Inspired Engineering¹, Harvard University², Massachusetts Institute of Technology³

19 Jun 2018-eLife

TL;DR: Models show that self-propagating gene drive is best suited to applications such as malaria prevention that seek to affect all wild populations of the target species, and even the least effective drive systems reported to date are likely to be highly invasive.

...read moreread less

Abstract: Gene drive is a genetic engineering technology that can spread a particular suite of genes throughout a population. Among the types of gene drive systems, those based on the CRISPR genome editing technology are predicted to be able to spread genes particularly rapidly. This is because components of the CRISPR system can be tailored to replace alternative copies of a particular gene, ensuring that only the desired version is passed on to offspring. In this way, for example, a gene that prevents mosquitoes from carrying or transmitting the malaria parasite could be introduced to a very large wild population to reduce the incidence of the disease among humans. Gene drives can be “self-propagating” or “self-exhausting”: the former are designed so that they can always spread as long as there are wild organisms around, whereas the latter are expected to lose their ability to spread over time. Self-propagating CRISPR gene drives have been shown to work in controlled populations of fruit flies, mosquitoes and yeast. These experiments happen in a controlled environment in the laboratory, so the organisms edited to have the gene drive elements do not come in contact with susceptible wild organisms. However, if just a few were to escape, the gene drive could theoretically spread quickly outside the laboratory. Noble, Adlam et al. investigated, using mathematical models, whether or not – and how fast – a self-propagating CRISPR-based gene drive would spread if a number of organisms with the gene-drive elements were released into the wild. The models showed that the release of just a few of the edited organisms would result in the gene drive spreading to most populations that interbreed. This happened regardless of the structure of the wild populations or whether a degree of resistance to the drive emerged. As a result, even the smallest breach of a contained trial could lead to significant gene drive spread in the wild. The findings suggest that self-propagating gene drive technologies would be most useful where the invasion of most wild populations of the target species is the intended purpose, rather than a risk to be avoided. As a result, a self-propagating CRISPR-based gene drive could be well suited to spreading among mosquitoes to impede the malaria parasite, provided there were strong international agreements in place. The findings also underline the difficulty of carrying out safe field trials of self-propagating gene drives, and the need for very tight control of laboratories carrying out experiments in this field. Lastly, they highlight the importance of developing and testing the evolutionary stability of self-exhausting gene drives, which could be better contained to local populations.

...read moreread less

134 citations

Proceedings Article•

Finite Versus Infinite Neural Networks: an Empirical Study

[...]

Jaehoon Lee¹, Samuel S. Schoenholz¹, Jeffrey Pennington¹, Ben Adlam¹, Lechao Xiao¹, Roman Novak¹, Jascha Sohl-Dickstein¹ - Show less +3 more•Institutions (1)

Google¹

31 Jul 2020

TL;DR: Improved best practices for using NNGP and NT kernels for prediction are developed, including a novel ensembling technique that achieves state-of-the-art results on CIFAR-10 classification for kernels corresponding to each architecture class the authors consider.

...read moreread less

Abstract: We perform a careful, thorough, and large scale empirical study of the correspondence between wide neural networks and kernel methods. By doing so, we resolve a variety of open questions related to the study of infinitely wide neural networks. Our experimental results include: kernel methods outperform fully-connected finite-width networks, but underperform convolutional finite width networks; neural network Gaussian process (NNGP) kernels frequently outperform neural tangent (NT) kernels; centered and ensembled finite networks have reduced posterior variance and behave more similarly to infinite networks; weight decay and the use of a large learning rate break the correspondence between finite and infinite networks; the NTK parameterization outperforms the standard parameterization for finite width networks; diagonal regularization of kernels acts similarly to early stopping; floating point precision limits kernel performance beyond a critical dataset size; regularized ZCA whitening improves accuracy; finite network performance depends non-monotonically on width in ways not captured by double descent phenomena; equivariance of CNNs is only beneficial for narrow networks far from the kernel regime. Our experiments additionally motivate an improved layer-wise scaling for weight decay which improves generalization in finite-width networks. Finally, we develop improved best practices for using NNGP and NT kernels for prediction, including a novel ensembling technique. Using these best practices we achieve state-of-the-art results on CIFAR-10 classification for kernels corresponding to each architecture class we consider.

...read moreread less

125 citations

Journal Article•DOI•

Amplifiers of selection

[...]

Ben Adlam¹, Krishnendu Chatterjee², Martin A. Nowak¹•Institutions (2)

Harvard University¹, Institute of Science and Technology Austria²

08 Sep 2015-Proceedings of The Royal Society A: Mathematical, Physical and Engineering Sciences

TL;DR: This work proves that there exist only bounded amplifiers for adversarial placement—that is, for arbitrary initial conditions and constructs population structures that amplify for all mutational events that arise through reproduction, uniformly at random, or through some combination of the two.

...read moreread less

Abstract: When a new mutant arises in a population, there is a probability it outcompetes the residents and fixes. The structure of the population can affect this fixation probability. Suppressing population structures reduce the difference between two competing variants, while amplifying population structures enhance the difference. Suppressors are ubiquitous and easy to construct, but amplifiers for the large population limit are more elusive and only a few examples have been discovered. Whether or not a population structure is an amplifier of selection depends on the probability distribution for the placement of the invading mutant. First, we prove that there exist only bounded amplifiers for adversarial placement—that is, for arbitrary initial conditions. Next, we show that the Star population structure, which is known to amplify for mutants placed uniformly at random, does not amplify for mutants that arise through reproduction and are therefore placed proportional to the temperatures of the vertices. Finally, we construct population structures that amplify for all mutational events that arise through reproduction, uniformly at random, or through some combination of the two.

...read moreread less

62 citations

1
2
3
4
…
5
6
7
8
9

Collapse

Cited by

PDF

Open Access

More filters

Proceedings Article•DOI•

Random graphs

[...]

Alan Frieze¹•Institutions (1)

Carnegie Mellon University¹

22 Jan 2006

TL;DR: Some of the major results in random graphs and some of the more challenging open problems are reviewed, including those related to the WWW.

...read moreread less

Abstract: We will review some of the major results in random graphs and some of the more challenging open problems. We will cover algorithmic and structural questions. We will touch on newer models, including those related to the WWW.

...read moreread less

7,116 citations

Clinical course and risk factors for mortality of adult inpatients with COVID-19 in Wuhan, China: a retrospective cohort study

[...]

Fei Zhou¹, Ting Yu, Ronghui Du, Guohui Fan², Ying Liu, Zhibo Liu¹, Jie Xiang³, Yeming Wang⁴, Bin Song, Xiaoying Gu², Xiaoying Gu¹, Lulu Guan, Yuan Wei, Li Hui¹, Xudong Wu, Jiuyang Xu⁵, Shengjin Tu, Yi Zhang¹, Hua Chen, Bin Cao - Show less +16 more•Institutions (5)

Peking Union Medical College¹, China-Japan Friendship Hospital², Wuhan Jinyintan Hospital³, Capital Medical University⁴, Tsinghua University⁵

01 Jan 2020

TL;DR: Prolonged viral shedding provides the rationale for a strategy of isolation of infected patients and optimal antiviral interventions in the future.

...read moreread less

Abstract: Summary Background Since December, 2019, Wuhan, China, has experienced an outbreak of coronavirus disease 2019 (COVID-19), caused by the severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2). Epidemiological and clinical characteristics of patients with COVID-19 have been reported but risk factors for mortality and a detailed clinical course of illness, including viral shedding, have not been well described. Methods In this retrospective, multicentre cohort study, we included all adult inpatients (≥18 years old) with laboratory-confirmed COVID-19 from Jinyintan Hospital and Wuhan Pulmonary Hospital (Wuhan, China) who had been discharged or had died by Jan 31, 2020. Demographic, clinical, treatment, and laboratory data, including serial samples for viral RNA detection, were extracted from electronic medical records and compared between survivors and non-survivors. We used univariable and multivariable logistic regression methods to explore the risk factors associated with in-hospital death. Findings 191 patients (135 from Jinyintan Hospital and 56 from Wuhan Pulmonary Hospital) were included in this study, of whom 137 were discharged and 54 died in hospital. 91 (48%) patients had a comorbidity, with hypertension being the most common (58 [30%] patients), followed by diabetes (36 [19%] patients) and coronary heart disease (15 [8%] patients). Multivariable regression showed increasing odds of in-hospital death associated with older age (odds ratio 1·10, 95% CI 1·03–1·17, per year increase; p=0·0043), higher Sequential Organ Failure Assessment (SOFA) score (5·65, 2·61–12·23; p Interpretation The potential risk factors of older age, high SOFA score, and d-dimer greater than 1 μg/mL could help clinicians to identify patients with poor prognosis at an early stage. Prolonged viral shedding provides the rationale for a strategy of isolation of infected patients and optimal antiviral interventions in the future. Funding Chinese Academy of Medical Sciences Innovation Fund for Medical Sciences; National Science Grant for Distinguished Young Scholars; National Key Research and Development Program of China; The Beijing Science and Technology Project; and Major Projects of National Science and Technology on New Drug Creation and Development.

...read moreread less

4,408 citations

5分で分かる!? 有名論文ナナメ読み：Jacob Devlin et al. : BERT : Pre-training of Deep Bidirectional Transformers for Language Understanding

[...]

柴田知秀

15 Feb 2020

1,595 citations

Journal Article•DOI•

The effect of large-scale anti-contagion policies on the COVID-19 pandemic.

[...]

Solomon Hsiang¹, Daniel Allen¹, Sébastien Annan-Phan¹, Kendon Bell², Kendon Bell¹, Ian Bolliger¹, Trinetta Chong¹, Hannah Druckenmiller¹, Luna Yue Huang¹, Andrew Hultgren¹, Emma Krasovich¹, Peiley Lau¹, Jaecheol Lee¹, Esther Rolf¹, Jeanette Tseng¹, Tiffany Wu¹ - Show less +12 more•Institutions (2)

University of California, Berkeley¹, Landcare Research²

08 Jun 2020-Nature

TL;DR: It is estimated that across these six countries, interventions prevented or delayed on the order of 62 million confirmed cases, corresponding to averting roughly 530 million total infections, and anti-contagion policies have significantly and substantially slowed this growth.

...read moreread less

Abstract: Governments around the world are responding to the novel coronavirus (COVID-19) pandemic1 with unprecedented policies designed to slow the growth rate of infections. Many actions, such as closing schools and restricting populations to their homes, impose large and visible costs on society, but their benefits cannot be directly observed and are currently understood only through process-based simulations2–4. Here, we compile new data on 1,717 local, regional, and national non-pharmaceutical interventions deployed in the ongoing pandemic across localities in China, South Korea, Italy, Iran, France, and the United States (US). We then apply reduced-form econometric methods, commonly used to measure the effect of policies on economic growth5,6, to empirically evaluate the effect that these anti-contagion policies have had on the growth rate of infections. In the absence of policy actions, we estimate that early infections of COVID-19 exhibit exponential growth rates of roughly 38% per day. We find that anti-contagion policies have significantly and substantially slowed this growth. Some policies have different impacts on different populations, but we obtain consistent evidence that the policy packages now deployed are achieving large, beneficial, and measurable health outcomes. We estimate that across these six countries, interventions prevented or delayed on the order of 62 million confirmed cases, corresponding to averting roughly 530 million total infections. These findings may help inform whether or when these policies should be deployed, intensified, or lifted, and they can support decision-making in the other 180+ countries where COVID-19 has been reported7.

...read moreread less

1,095 citations

Journal Article•DOI•

Review of deep learning: concepts, CNN architectures, challenges, applications, future directions

[...]

Laith Alzubaidi¹, Jinglan Zhang¹, Amjad J. Humaidi², Ayad Q. Al-Dujaili, Ye Duan³, Omran Al-Shamma, José Santamaría⁴, Mohammed A. Fadhel⁵, Muthana Al-Amidie³, Laith Farhan⁶ - Show less +6 more•Institutions (6)

Queensland University of Technology¹, University of Technology, Iraq², University of Missouri³, University of Jaén⁴, Information Technology University⁵, Manchester Metropolitan University⁶

01 Jan 2021-Journal of Big Data

TL;DR: In this paper, a comprehensive survey of the most important aspects of DL and including those enhancements recently added to the field is provided, and the challenges and suggested solutions to help researchers understand the existing research gaps.

...read moreread less

Abstract: In the last few years, the deep learning (DL) computing paradigm has been deemed the Gold Standard in the machine learning (ML) community. Moreover, it has gradually become the most widely used computational approach in the field of ML, thus achieving outstanding results on several complex cognitive tasks, matching or even beating those provided by human performance. One of the benefits of DL is the ability to learn massive amounts of data. The DL field has grown fast in the last few years and it has been extensively used to successfully address a wide range of traditional applications. More importantly, DL has outperformed well-known ML techniques in many domains, e.g., cybersecurity, natural language processing, bioinformatics, robotics and control, and medical information processing, among many others. Despite it has been contributed several works reviewing the State-of-the-Art on DL, all of them only tackled one aspect of the DL, which leads to an overall lack of knowledge about it. Therefore, in this contribution, we propose using a more holistic approach in order to provide a more suitable starting point from which to develop a full understanding of DL. Specifically, this review attempts to provide a more comprehensive survey of the most important aspects of DL and including those enhancements recently added to the field. In particular, this paper outlines the importance of DL, presents the types of DL techniques and networks. It then presents convolutional neural networks (CNNs) which the most utilized DL network type and describes the development of CNNs architectures together with their main features, e.g., starting with the AlexNet network and closing with the High-Resolution network (HR.Net). Finally, we further present the challenges and suggested solutions to help researchers understand the existing research gaps. It is followed by a list of the major DL applications. Computational tools including FPGA, GPU, and CPU are summarized along with a description of their influence on DL. The paper ends with the evolution matrix, benchmark datasets, and summary and conclusion.

...read moreread less

1,084 citations

1
2
3
4
…
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200

Collapse