Home
/
Authors
/
David Altshuler

Author

David Altshuler

Other affiliations: Vertex Pharmaceuticals, Massachusetts Institute of Technology, Broad Institute ...read more

Bio: David Altshuler is an academic researcher from University of Michigan. The author has contributed to research in topics: Genome-wide association study & Population. The author has an hindex of 162, co-authored 345 publications receiving 201782 citations. Previous affiliations of David Altshuler include Vertex Pharmaceuticals & Massachusetts Institute of Technology.

Papers published on a yearly basis

2023
2022
2021
2020
2019
2018
2017
2016
2015
2014
2013
2012
2011
2010
2009
2008
2007
2006
2005
2004
2003
2002
2001
2000
1999
1998
1993
1992

Papers

PDF

Open Access

More filters

Journal Article•DOI•

Functional Investigations of HNF1A Identify Rare Variants as Risk Factors for Type 2 Diabetes in the General Population

[...]

Laeya A. Najmi¹, Ingvild Aukrust², Ingvild Aukrust¹, Jason Flannick³, Janne Molnes¹, Janne Molnes², Noël P. Burtt³, Anders Molven², Anders Molven¹, Leif Groop⁴, David Altshuler⁵, David Altshuler³, Stefan Johansson², Stefan Johansson¹, Lise Bjørkhaug⁶, Lise Bjørkhaug¹, Pål R. Njølstad¹, Pål R. Njølstad² - Show less +14 more•Institutions (6)

University of Bergen¹, Haukeland University Hospital², Massachusetts Institute of Technology³, Lund University⁴, Harvard University⁵, Bergen University College⁶

01 Feb 2017-Diabetes

TL;DR: The results suggest that functional characterization of variants within MODY genes may overcome the limitations of bioinformatics tools for the purposes of presymptomatic diabetes risk prediction in the general population.

...read moreread less

Abstract: Variants in HNF1A encoding hepatocyte nuclear factor 1α (HNF-1A) are associated with maturity-onset diabetes of the young form 3 (MODY 3) and type 2 diabetes. We investigated whether functional classification of HNF1A rare coding variants can inform models of diabetes risk prediction in the general population by analyzing the effect of 27 HNF1A variants identified in well-phenotyped populations (n = 4,115). Bioinformatics tools classified 11 variants as likely pathogenic and showed no association with diabetes risk (combined minor allele frequency [MAF] 0.22%; odds ratio [OR] 2.02; 95% CI 0.73-5.60; P = 0.18). However, a different set of 11 variants that reduced HNF-1A transcriptional activity to <60% of normal (wild-type) activity was strongly associated with diabetes in the general population (combined MAF 0.22%; OR 5.04; 95% CI 1.99-12.80; P = 0.0007). Our functional investigations indicate that 0.44% of the population carry HNF1A variants that result in a substantially increased risk for developing diabetes. These results suggest that functional characterization of variants within MODY genes may overcome the limitations of bioinformatics tools for the purposes of presymptomatic diabetes risk prediction in the general population.

...read moreread less

51 citations

Journal Article•DOI•

Genetic modifiers of EGFR dependence in non-small cell lung cancer

[...]

Tanaz Sharifnia¹, Victor Rusu¹, Federica Piccioni¹, Mukta Bagul¹, Marcin Imielinski¹, Andrew D. Cherniack¹, Chandra Sekhar Pedamallu¹, Bang Wong¹, Frederick H. Wilson¹, Levi A. Garraway¹, David Altshuler¹, Todd R. Golub¹, David E. Root¹, Aravind Subramanian¹, Matthew Meyerson¹ - Show less +11 more•Institutions (1)

Harvard University¹

30 Dec 2014-Proceedings of the National Academy of Sciences of the United States of America

TL;DR: A spectrum of kinase genes whose overexpression can overcome NSCLC cells’ reliance on EGFR are identified and their convergence on the PI3K-AKT and MEK-ERK signaling axes in sustaining EGFR-independent survival is underscored.

...read moreread less

Abstract: Lung adenocarcinomas harboring activating mutations in the epidermal growth factor receptor (EGFR) represent a common molecular subset of non-small cell lung cancer (NSCLC) cases. EGFR mutations predict sensitivity to EGFR tyrosine kinase inhibitors (TKIs) and thus represent a dependency in NSCLCs harboring these alterations, but the genetic basis of EGFR dependence is not fully understood. Here, we applied an unbiased, ORF-based screen to identify genetic modifiers of EGFR dependence in EGFR-mutant NSCLC cells. This approach identified 18 kinase and kinase-related genes whose overexpression can substitute for EGFR in EGFR-dependent PC9 cells, and these genes include seven of nine Src family kinase genes, FGFR1, FGFR2, ITK, NTRK1, NTRK2, MOS, MST1R, and RAF1. A subset of these genes can complement loss of EGFR activity across multiple EGFR-dependent models. Unbiased gene-expression profiling of cells overexpressing EGFR bypass genes, together with targeted validation studies, reveals EGFR-independent activation of the MEK-ERK and phosphoinositide 3-kinase (PI3K)-AKT pathways. Combined inhibition of PI3K-mTOR and MEK restores EGFR dependence in cells expressing each of the 18 EGFR bypass genes. Together, these data uncover a broad spectrum of kinases capable of overcoming dependence on EGFR and underscore their convergence on the PI3K-AKT and MEK-ERK signaling axes in sustaining EGFR-independent survival.

...read moreread less

50 citations

Journal Article•DOI•

Analysis of case–control association studies with known risk variants

[...]

Noah Zaitlen¹, Bogdan Pasaniuc, Nick Patterson, Samuela Pollack, Benjamin F. Voight, Leif Groop, David Altshuler, Brian E. Henderson, Laurence N. Kolonel, Loic Le Marchand, Kevin M. Waters, Christopher A. Haiman, Barbara E. Stranger, Emmanouil T. Dermitzakis, Peter Kraft, Alkes L. Price - Show less +12 more•Institutions (1)

Harvard University¹

01 Jul 2012-Bioinformatics

TL;DR: This work proposes a new conditioning approach, which is based in part on the classical technique of liability threshold modeling, and shows that it outperforms both the no conditioning strategy and the standard conditioning strategy, with a properly controlled false-positive rate.

...read moreread less

Abstract: Motivation: The question of how to best use information from known associated variants when conducting disease association studies has yet to be answered. Some studies compute a marginal P-value for each Several Nucleotide Polymorphisms independently, ignoring previously discovered variants. Other studies include known variants as covariates in logistic regression, but a weakness of this standard conditioning strategy is that it does not account for disease prevalence and non-random ascertainment, which can induce a correlation structure between candidate variants and known associated variants even if the variants lie on different chromosomes. Here, we propose a new conditioning approach, which is based in part on the classical technique of liability threshold modeling. Roughly, this method estimates model parameters for each known variant while accounting for the published disease prevalence from the epidemiological literature. Results: We show via simulation and application to empirical datasets that our approach outperforms both the no conditioning strategy and the standard conditioning strategy, with a properly controlled false-positive rate. Furthermore, in multiple data sets involving diseases of low prevalence, standard conditioning produces a severe drop in test statistics whereas our approach generally performs as well or better than no conditioning. Our approach may substantially improve disease gene discovery for diseases with many known risk variants. Availability: LTSOFT software is available online http://www.hsph.harvard.edu/faculty/alkes-price/software/ Contact:nzaitlen@hsph.harvard.edu; aprice@hsph.harvard.edu Supplementary information: Supplementary data are available at Bioinformatics online.

...read moreread less

46 citations

Patent•

Methods Of Regulating Metabolism And Mitochondrial Function

[...]

Vamsi K. Mootha¹, David Altshuler²•Institutions (2)

Massachusetts Institute of Technology¹, Harvard University²

14 Jun 2004

TL;DR: In this article, a set of coordinately-regulated genes which regulate oxidative phosphorylation are described. But the authors do not specify how such genes are used to diagnose and diagnose mitochondrial diseases.

...read moreread less

Abstract: The invention relates to novel methods of regulating metabolism and mitochondrial biogenesis. Some aspects of the invention relate to methods of treating or preventing diseases in a patient associated with reduced mitochondrial function, to methods of identifying agents to treat such diseases, and to methods of diagnosing such diseases. Other aspects of the invention relate to a set of coordinately-regulated genes which regulate oxidative phosphorylation.

...read moreread less

44 citations

Journal Article•DOI•

Biases and Reconciliation in Estimates of Linkage Disequilibrium in the Human Genome

[...]

Itsik Pe'er¹, Itsik Pe'er², Yves Chretien², Yves Chretien¹, Paul I.W. de Bakker, Jeffrey C. Barrett³, Mark J. Daly, David Altshuler - Show less +4 more•Institutions (3)

Harvard University¹, Massachusetts Institute of Technology², Wellcome Trust³

01 Apr 2006-American Journal of Human Genetics

TL;DR: Bias in estimations of linkage disequilibrium along the human genome and in the population under study are dissected to guide the understanding of empirical LD surveys and has implications for whole-genome association studies.

...read moreread less

Abstract: Genetic association studies of common disease often rely on linkage disequilibrium (LD) along the human genome and in the population under study. Although understanding the characteristics of this correlation has been the focus of many large-scale surveys (culminating in genomewide haplotype maps), the results of different studies have yielded wide-ranging estimates. Since understanding these differences (and whether they can be reconciled) has important implications for whole-genome association studies, in this article we dissect biases in these estimations that are due to known aspects of study design and analytic methodology. In particular, we document in the empirical data that the long-known complicating effects of allele frequency, marker density, and sample size largely reconcile all large-scale surveys. Two exceptions are an underappraisal of redundancy among single-nucleotide polymorphisms (SNPs) when evaluation is limited to short regions (as in candidate-gene resequencing studies) and an inflation in the extent of LD in HapMap phase I, which is likely due to oversampling of specific haplotypes in the creation of the public SNP map. Understanding these factors can guide the understanding of empirical LD surveys and has implications for genetic association studies.

...read moreread less

43 citations

1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
…
47
48
49
50
51
52
53
…
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71

Collapse

Cited by

PDF

Open Access

More filters

Journal Article•DOI•

Fast gapped-read alignment with Bowtie 2

[...]

Ben Langmead¹, Steven L. Salzberg¹, Steven L. Salzberg², Steven L. Salzberg³•Institutions (3)

University of Maryland, College Park¹, Johns Hopkins University School of Medicine², Johns Hopkins University³

01 Apr 2012-Nature Methods

TL;DR: Bowtie 2 combines the strengths of the full-text minute index with the flexibility and speed of hardware-accelerated dynamic programming algorithms to achieve a combination of high speed, sensitivity and accuracy.

...read moreread less

Abstract: As the rate of sequencing increases, greater throughput is demanded from read aligners. The full-text minute index is often used to make alignment very fast and memory-efficient, but the approach is ill-suited to finding longer, gapped alignments. Bowtie 2 combines the strengths of the full-text minute index with the flexibility and speed of hardware-accelerated dynamic programming algorithms to achieve a combination of high speed, sensitivity and accuracy.

...read moreread less

37,898 citations

Journal Article•DOI•

Gene set enrichment analysis: A knowledge-based approach for interpreting genome-wide expression profiles

[...]

Aravind Subramanian¹, Pablo Tamayo¹, Vamsi K. Mootha², Sayan Mukherjee³, Benjamin L. Ebert², Michael A. Gillette², Amanda G. Paulovich⁴, Scott L. Pomeroy², Todd R. Golub², Eric S. Lander¹, Jill P. Mesirov¹ - Show less +7 more•Institutions (4)

Massachusetts Institute of Technology¹, Harvard University², Duke University³, Fred Hutchinson Cancer Research Center⁴

25 Oct 2005-Proceedings of the National Academy of Sciences of the United States of America

TL;DR: The Gene Set Enrichment Analysis (GSEA) method as discussed by the authors focuses on gene sets, that is, groups of genes that share common biological function, chromosomal location, or regulation.

...read moreread less

Abstract: Although genomewide RNA expression analysis has become a routine tool in biomedical research, extracting biological insight from such information remains a major challenge. Here, we describe a powerful analytical method called Gene Set Enrichment Analysis (GSEA) for interpreting gene expression data. The method derives its power by focusing on gene sets, that is, groups of genes that share common biological function, chromosomal location, or regulation. We demonstrate how GSEA yields insights into several cancer-related data sets, including leukemia and lung cancer. Notably, where single-gene analysis finds little similarity between two independent studies of patient survival in lung cancer, GSEA reveals many biological pathways in common. The GSEA method is embodied in a freely available software package, together with an initial database of 1,325 biologically defined gene sets.

...read moreread less

34,830 citations

Journal Article•DOI•

PLINK: A Tool Set for Whole-Genome Association and Population-Based Linkage Analyses

[...]

Shaun Purcell¹, Shaun Purcell², Benjamin M. Neale¹, Benjamin M. Neale³, Kathe Todd-Brown², Lori Thomas², Manuel A. R. Ferreira², David Bender², David Bender¹, Julian Maller¹, Julian Maller², Pamela Sklar², Pamela Sklar¹, Paul I.W. de Bakker², Paul I.W. de Bakker¹, Mark J. Daly², Mark J. Daly¹, Pak C. Sham⁴ - Show less +14 more•Institutions (4)

Massachusetts Institute of Technology¹, Harvard University², University of London³, University of Hong Kong⁴

01 Sep 2007-American Journal of Human Genetics

TL;DR: This work introduces PLINK, an open-source C/C++ WGAS tool set, and describes the five main domains of function: data management, summary statistics, population stratification, association analysis, and identity-by-descent estimation, which focuses on the estimation and use of identity- by-state and identity/descent information in the context of population-based whole-genome studies.

...read moreread less

Abstract: Whole-genome association studies (WGAS) bring new computational, as well as analytic, challenges to researchers. Many existing genetic-analysis tools are not designed to handle such large data sets in a convenient manner and do not necessarily exploit the new opportunities that whole-genome data bring. To address these issues, we developed PLINK, an open-source C/C++ WGAS tool set. With PLINK, large data sets comprising hundreds of thousands of markers genotyped for thousands of individuals can be rapidly manipulated and analyzed in their entirety. As well as providing tools to make the basic analytic steps computationally efficient, PLINK also supports some novel approaches to whole-genome data that take advantage of whole-genome coverage. We introduce PLINK and describe the five main domains of function: data management, summary statistics, population stratification, association analysis, and identity-by-descent estimation. In particular, we focus on the estimation and use of identity-by-state and identity-by-descent information in the context of population-based whole-genome studies. This information can be used to detect and correct for population stratification and to identify extended chromosomal segments that are shared identical by descent between very distantly related individuals. Analysis of the patterns of segmental sharing has the potential to map disease loci that contain multiple rare variants in a population-based linkage analysis.

...read moreread less

26,280 citations

Journal Article•DOI•

Initial sequencing and analysis of the human genome.

[...]

Eric S. Lander¹, Lauren Linton¹, Bruce W. Birren¹, Chad Nusbaum¹ +245 more•Institutions (29)

15 Feb 2001-Nature

TL;DR: The results of an international collaboration to produce and make freely available a draft sequence of the human genome are reported and an initial analysis is presented, describing some of the insights that can be gleaned from the sequence.

...read moreread less

Abstract: The human genome holds an extraordinary trove of information about human development, physiology, medicine and evolution. Here we report the results of an international collaboration to produce and make freely available a draft sequence of the human genome. We also present an initial analysis of the data, describing some of the insights that can be gleaned from the sequence.

...read moreread less

22,269 citations

Journal Article•DOI•

limma powers differential expression analyses for RNA-sequencing and microarray studies

[...]

Matthew E. Ritchie¹, Belinda Phipson², Di Wu³, Yifang Hu¹, Charity W. Law⁴, Wei Shi¹, Gordon K. Smyth⁵, Gordon K. Smyth¹ - Show less +4 more•Institutions (5)

Walter and Eliza Hall Institute of Medical Research¹, Royal Children's Hospital², Harvard University³, University of Zurich⁴, University of Melbourne⁵

20 Apr 2015-Nucleic Acids Research

TL;DR: The philosophy and design of the limma package is reviewed, summarizing both new and historical features, with an emphasis on recent enhancements and features that have not been previously described.

...read moreread less

Abstract: limma is an R/Bioconductor software package that provides an integrated solution for analysing data from gene expression experiments. It contains rich features for handling complex experimental designs and for information borrowing to overcome the problem of small sample sizes. Over the past decade, limma has been a popular choice for gene discovery through differential expression analyses of microarray and high-throughput PCR data. The package contains particularly strong facilities for reading, normalizing and exploring such data. Recently, the capabilities of limma have been significantly expanded in two important directions. First, the package can now perform both differential expression and differential splicing analyses of RNA sequencing (RNA-seq) data. All the downstream analysis tools previously restricted to microarray data are now available for RNA-seq as well. These capabilities allow users to analyse both RNA-seq and microarray data with very similar pipelines. Second, the package is now able to go past the traditional gene-wise expression analyses in a variety of ways, analysing expression profiles in terms of co-regulated sets of genes or in terms of higher-order expression signatures. This provides enhanced possibilities for biological interpretation of gene expression differences. This article reviews the philosophy and design of the limma package, summarizing both new and historical features, with an emphasis on recent enhancements and features that have not been previously described.

...read moreread less

22,147 citations

1
2
3
4
…
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200

Collapse