Home
/
Authors
/
David Altshuler

Author

David Altshuler

Other affiliations: Vertex Pharmaceuticals, Massachusetts Institute of Technology, Broad Institute ...read more

Bio: David Altshuler is an academic researcher from University of Michigan. The author has contributed to research in topics: Genome-wide association study & Population. The author has an hindex of 162, co-authored 345 publications receiving 201782 citations. Previous affiliations of David Altshuler include Vertex Pharmaceuticals & Massachusetts Institute of Technology.

Papers published on a yearly basis

2023
2022
2021
2020
2019
2018
2017
2016
2015
2014
2013
2012
2011
2010
2009
2008
2007
2006
2005
2004
2003
2002
2001
2000
1999
1998
1993
1992

Papers

PDF

Open Access

More filters

Plasma HDL cholesterol and risk of myocardial infarction: a mendelian randomisation study

[...]

Benjamin F. Voight, Gina M. Peloso, Marju Orho-Melander, Ruth Frikke-Schmidt, Maja Barbalić, Majken K. Jensen, George Hindy, Hilma Holm, Eric L. Ding, Toby Johnson, Heribert Schunkert, Nilesh J. Samani, Robert Clarke, Jemma C. Hopewell, John F. Thompson, Mingyao Li, Gudmar Thorleifsson, Christopher Newton-Cheh, Kiran Musunuru, James P. Pirruccello, Danish Saleheen, Li Chen, Alexandre F.R. Stewart, Arne Schillert, Unnur Thorsteinsdottir, Gudmundur Thorgeirsson, Sonia S. Anand, James C. Engert, Thomas M. Morgan, John A. Spertus, Monika Stoll, Klaus Berger, Nicola Martinelli, Domenico Girelli, Pascal P. McKeown, Christopher Patterson, Stephen E. Epstein, Joseph M. Devaney, Mary-Susan Burnett, Vincent Mooser, Samuli Ripatti, Ida Surakka, Markku S. Nieminen, Juha Sinisalo, Marja-Liisa Lokki, Markus Perola, Aki S. Havulinna, Ulf de Faire, Bruna Gigante, Erik Ingelsson, Tanja Zeller, Philipp S. Wild, Paul I.W. de Bakker, Olaf H. Klungel, Anke-Hilse Maitland-van der Zee, Bas J M Peters, Anthonius de Boer, Diederick E. Grobbee, Pieter Willem Kamphuisen, Vera H.M. Deneer, Clara C. Elbers, N. Charlotte Onland-Moret, Marten H. Hofker, Cisca Wijmenga, W. M. Monique Verschuren, Jolanda M. A. Boer, Yvonne T. van der Schouw, Asif Rasheed, Philippe M. Frossard, Serkalem Demissie, Cristen J. Willer, Ron Do, Jose M. Ordovas, Gonçalo R. Abecasis, Michael Boehnke, Karen L. Mohlke, Mark J. Daly, Candace Guiducci, Noël P. Burtt, Aarti Surti, Elena Gonzalez, Shaun Purcell, Stacey Gabriel, Jaume Marrugat, John F. Peden, Jeanette Erdmann, Patrick Diemert, Christina Willenborg, Inke R. Koenig, Marcus Fischer, Christian Hengstenberg, Andreas Ziegler, Ian Buysschaert, Diether Lambrechts, Frans Van de Werf, Keith A.A. Fox, Nour Eddine El Mokhtari, Diana Rubin, Juergen Schrezenmeir, Stefan Schreiber, Arne S. Schaefer, John Danesh, Stefan Blankenberg, Robert Roberts, Ruth McPherson, Hugh Watkins, Alistair S. Hall, Kim Overvad, Eric B. Rimm, Eric Boerwinkle, Anne Tybjærg-Hansen, L. Adrienne Cupples, Muredach P. Reilly, Olle Melander, Pier Mannuccio Mannucci, Diego Ardissino, David S. Siscovick, Roberto Elosua, Kari Stefansson, Christopher J. O'Donnell, Veikko Salomaa, Daniel J. Rader, Leena Peltonen, Stephen M. Schwartz, David Altshuler, Sekar Kathiresan - Show less +122 more

01 Jan 2012

TL;DR: Mendelian randomisation analyses challenge the concept that raising of plasma HDL cholesterol will uniformly translate into reductions in risk of myocardial infarction.

...read moreread less

Abstract: Summary Background High plasma HDL cholesterol is associated with reduced risk of myocardial infarction, but whether this association is causal is unclear. Exploiting the fact that genotypes are randomly assigned at meiosis, are independent of non-genetic confounding, and are unmodified by disease processes, mendelian randomisation can be used to test the hypothesis that the association of a plasma biomarker with disease is causal. Methods We performed two mendelian randomisation analyses. First, we used as an instrument a single nucleotide polymorphism (SNP) in the endothelial lipase gene (LIPG Asn396Ser) and tested this SNP in 20 studies (20 913 myocardial infarction cases, 95 407 controls). Second, we used as an instrument a genetic score consisting of 14 common SNPs that exclusively associate with HDL cholesterol and tested this score in up to 12 482 cases of myocardial infarction and 41 331 controls. As a positive control, we also tested a genetic score of 13 common SNPs exclusively associated with LDL cholesterol. Findings Carriers of the LIPG 396Ser allele (2·6% frequency) had higher HDL cholesterol (0·14 mmol/L higher, p=8×10−13) but similar levels of other lipid and non-lipid risk factors for myocardial infarction compared with non-carriers. This difference in HDL cholesterol is expected to decrease risk of myocardial infarction by 13% (odds ratio [OR] 0·87, 95% CI 0·84–0·91). However, we noted that the 396Ser allele was not associated with risk of myocardial infarction (OR 0·99, 95% CI 0·88–1·11, p=0·85). From observational epidemiology, an increase of 1 SD in HDL cholesterol was associated with reduced risk of myocardial infarction (OR 0·62, 95% CI 0·58–0·66). However, a 1 SD increase in HDL cholesterol due to genetic score was not associated with risk of myocardial infarction (OR 0·93, 95% CI 0·68–1·26, p=0·63). For LDL cholesterol, the estimate from observational epidemiology (a 1 SD increase in LDL cholesterol associated with OR 1·54, 95% CI 1·45–1·63) was concordant with that from genetic score (OR 2·13, 95% CI 1·69–2·69, p=2×10−10). Interpretation Some genetic mechanisms that raise plasma HDL cholesterol do not seem to lower risk of myocardial infarction. These data challenge the concept that raising of plasma HDL cholesterol will uniformly translate into reductions in risk of myocardial infarction. Funding US National Institutes of Health, The Wellcome Trust, European Union, British Heart Foundation, and the German Federal Ministry of Education and Research.

...read moreread less

1,550 citations

Journal Article•DOI•

Large-scale association analysis identifies new risk loci for coronary artery disease

[...]

Panos Deloukas¹, Stavroula Kanoni¹, Christina Willenborg², Martin Farrall³ +201 more•Institutions (64)

01 Jan 2013-Nature Genetics

TL;DR: An association analysis in CAD cases and controls identifies 15 loci reaching genome-wide significance, taking the number of susceptibility loci for CAD to 46, and a further 104 independent variants strongly associated with CAD at a 5% false discovery rate (FDR).

...read moreread less

Abstract: Coronary artery disease (CAD) is the commonest cause of death. Here, we report an association analysis in 63,746 CAD cases and 130,681 controls identifying 15 loci reaching genome-wide significance, taking the number of susceptibility loci for CAD to 46, and a further 104 independent variants (r(2) < 0.2) strongly associated with CAD at a 5% false discovery rate (FDR). Together, these variants explain approximately 10.6% of CAD heritability. Of the 46 genome-wide significant lead SNPs, 12 show a significant association with a lipid trait, and 5 show a significant association with blood pressure, but none is significantly associated with diabetes. Network analysis with 233 candidate genes (loci at 10% FDR) generated 5 interaction networks comprising 85% of these putative genes involved in CAD. The four most significant pathways mapping to these networks are linked to lipid metabolism and inflammation, underscoring the causal role of these activities in the genetic etiology of CAD. Our study provides insights into the genetic basis of CAD and identifies key biological pathways.

...read moreread less

1,518 citations

Journal Article•DOI•

Association between Microdeletion and Microduplication at 16p11.2 and Autism

[...]

Lauren A. Weiss, Yiping Shen¹, Joshua M. Korn¹, Joshua M. Korn², Dan E. Arking, David T. Miller¹, Ragnheidur Fossdal³, Evald Saemundsen, Hreinn Stefansson³, Todd Green², Todd Green¹, Orah S. Platt¹, Douglas M. Ruderfer¹, Douglas M. Ruderfer², Christopher A. Walsh¹, David Altshuler², David Altshuler¹, Aravinda Chakravarti², Aravinda Chakravarti¹, Rudolph E. Tanzi¹, Kari Stefansson³, Susan L. Santangelo¹, James F. Gusella¹, James F. Gusella², Pamela Sklar¹, Pamela Sklar², Bai-Lin Wu¹, Mark J. Daly², Mark J. Daly¹ - Show less +25 more•Institutions (3)

Harvard University¹, Massachusetts Institute of Technology², deCODE genetics³

14 Feb 2008-The New England Journal of Medicine

TL;DR: A novel, recurrent microdeletion and a reciprocal microduplication that carry substantial susceptibility to autism and appear to account for approximately 1% of cases are identified.

...read moreread less

Abstract: BACKGROUND Autism spectrum disorder is a heritable developmental disorder in which chromosomal abnormalities are thought to play a role. METHODS As a first component of a genomewide association study of families from the Autism Genetic Resource Exchange (AGRE), we used two novel algorithms to search for recurrent copy-number variations in genotype data from 751 multiplex families with autism. Specific recurrent de novo events were further evaluated in clinical-testing data from Children's Hospital Boston and in a large population study in Iceland. RESULTS Among the AGRE families, we observed five instances of a de novo deletion of 593 kb on chromosome 16p11.2. Using comparative genomic hybridization, we observed the identical deletion in 5 of 512 children referred to Children's Hospital Boston for developmental delay, mental retardation, or suspected autism spectrum disorder, as well as in 3 of 299 persons with autism in an Icelandic population; the deletion was also carried by 2 of 18,834 unscreened Icelandic control subjects. The reciprocal duplication of this region occurred in 7 affected persons in AGRE families and 4 of the 512 children from Children's Hospital Boston. The duplication also appeared to be a high-penetrance risk factor. CONCLUSIONS We have identified a novel, recurrent microdeletion and a reciprocal microduplication that carry substantial susceptibility to autism and appear to account for approximately 1% of cases. We did not identify other regions with similar aggregations of large de novo mutations.

...read moreread less

1,480 citations

Journal Article•DOI•

Genetic Mapping in Human Disease

[...]

David Altshuler, Mark J. Daly¹, Mark J. Daly², Eric S. Lander•Institutions (2)

Harvard University¹, Broad Institute²

07 Nov 2008-Science

TL;DR: The intellectual foundations of genetic mapping of Mendelian and complex traits in humans are discussed, lessons emerging from linkage analysis of MendELian diseases and genome-wide association studies of common diseases are examined, and questions and challenges that lie ahead are discussed.

...read moreread less

Abstract: Genetic mapping provides a powerful approach to identify genes and biological processes underlying any trait influenced by inheritance, including human diseases We discuss the intellectual foundations of genetic mapping of Mendelian and complex traits in humans, examine lessons emerging from linkage analysis of Mendelian diseases and genome-wide association studies of common diseases, and discuss questions and challenges that lie ahead

...read moreread less

1,421 citations

Journal Article•DOI•

Six new loci associated with blood low-density lipoprotein cholesterol, high-density lipoprotein cholesterol or triglycerides in humans.

[...]

Sekar Kathiresan¹, Sekar Kathiresan², Olle Melander³, Candace Guiducci¹, Aarti Surti¹, Noël P. Burtt¹, Mark J. Rieder⁴, Gregory M. Cooper⁴, Charlotta Roos³, Benjamin F. Voight¹, Benjamin F. Voight², Aki S. Havulinna, Björn Wahlstrand⁵, Thomas Hedner⁵, Dolores Corella⁶, E. Shyong Tai⁷, Jose M. Ordovas⁸, Göran Berglund³, Erkki Vartiainen, Pekka Jousilahti, Bo Hedblad³, Marja-Riitta Taskinen⁹, Christopher Newton-Cheh², Christopher Newton-Cheh¹, Veikko Salomaa, Leena Peltonen, Leif Groop⁹, Leif Groop³, David Altshuler, Marju Orho-Melander³ - Show less +26 more•Institutions (9)

Massachusetts Institute of Technology¹, Harvard University², Lund University³, University of Washington⁴, Sahlgrenska University Hospital⁵, University of Valencia⁶, Singapore General Hospital⁷, United States Department of Agriculture⁸, University of Helsinki⁹

13 Jan 2008-Nature Genetics

TL;DR: Using genome-wide association data from three studies and targeted replication association analyses in up to 18,554 independent participants, it is shown that common SNPs at 18 loci are reproducibly associated with concentrations of low-density cholesterol, high-density lipoprotein (HDL) cholesterol, and/or triglycerides.

...read moreread less

Abstract: Blood concentrations of lipoproteins and lipids are heritable risk factors for cardiovascular disease. Using genome-wide association data from three studies (n = 8,816 that included 2,758 individuals from the Diabetes Genetics Initiative specific to the current paper as well as 1,874 individuals from the FUSION study of type 2 diabetes and 4,184 individuals from the SardiNIA study of aging-associated variables reported in a companion paper in this issue) and targeted replication association analyses in up to 18,554 independent participants, we show that common SNPs at 18 loci are reproducibly associated with concentrations of low-density lipoprotein (LDL) cholesterol, high-density lipoprotein (HDL) cholesterol, and/or triglycerides. Six of these loci are new (P < 5 x 10(-8) for each new locus). Of the six newly identified chromosomal regions, two were associated with LDL cholesterol (1p13 near CELSR2, PSRC1 and SORT1 and 19p13 near CILP2 and PBX4), one with HDL cholesterol (1q42 in GALNT2) and five with triglycerides (7q11 near TBL2 and MLXIPL, 8q24 near TRIB1, 1q42 in GALNT2, 19p13 near CILP2 and PBX4 and 1p31 near ANGPTL3). At 1p13, the LDL-associated SNP was also strongly correlated with CELSR2, PSRC1, and SORT1 transcript levels in human liver, and a proxy for this SNP was recently shown to affect risk for coronary artery disease. Understanding the molecular, cellular and clinical consequences of the newly identified loci may inform therapy and clinical care.

...read moreread less

1,380 citations

1
2
3
4
…
5
6
7
8
9
10
11
…
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71

Collapse

Cited by

PDF

Open Access

More filters

Journal Article•DOI•

Fast gapped-read alignment with Bowtie 2

[...]

Ben Langmead¹, Steven L. Salzberg¹, Steven L. Salzberg², Steven L. Salzberg³•Institutions (3)

University of Maryland, College Park¹, Johns Hopkins University School of Medicine², Johns Hopkins University³

01 Apr 2012-Nature Methods

TL;DR: Bowtie 2 combines the strengths of the full-text minute index with the flexibility and speed of hardware-accelerated dynamic programming algorithms to achieve a combination of high speed, sensitivity and accuracy.

...read moreread less

Abstract: As the rate of sequencing increases, greater throughput is demanded from read aligners. The full-text minute index is often used to make alignment very fast and memory-efficient, but the approach is ill-suited to finding longer, gapped alignments. Bowtie 2 combines the strengths of the full-text minute index with the flexibility and speed of hardware-accelerated dynamic programming algorithms to achieve a combination of high speed, sensitivity and accuracy.

...read moreread less

37,898 citations

Journal Article•DOI•

Gene set enrichment analysis: A knowledge-based approach for interpreting genome-wide expression profiles

[...]

Aravind Subramanian¹, Pablo Tamayo¹, Vamsi K. Mootha², Sayan Mukherjee³, Benjamin L. Ebert², Michael A. Gillette², Amanda G. Paulovich⁴, Scott L. Pomeroy², Todd R. Golub², Eric S. Lander¹, Jill P. Mesirov¹ - Show less +7 more•Institutions (4)

Massachusetts Institute of Technology¹, Harvard University², Duke University³, Fred Hutchinson Cancer Research Center⁴

25 Oct 2005-Proceedings of the National Academy of Sciences of the United States of America

TL;DR: The Gene Set Enrichment Analysis (GSEA) method as discussed by the authors focuses on gene sets, that is, groups of genes that share common biological function, chromosomal location, or regulation.

...read moreread less

Abstract: Although genomewide RNA expression analysis has become a routine tool in biomedical research, extracting biological insight from such information remains a major challenge. Here, we describe a powerful analytical method called Gene Set Enrichment Analysis (GSEA) for interpreting gene expression data. The method derives its power by focusing on gene sets, that is, groups of genes that share common biological function, chromosomal location, or regulation. We demonstrate how GSEA yields insights into several cancer-related data sets, including leukemia and lung cancer. Notably, where single-gene analysis finds little similarity between two independent studies of patient survival in lung cancer, GSEA reveals many biological pathways in common. The GSEA method is embodied in a freely available software package, together with an initial database of 1,325 biologically defined gene sets.

...read moreread less

34,830 citations

Journal Article•DOI•

PLINK: A Tool Set for Whole-Genome Association and Population-Based Linkage Analyses

[...]

Shaun Purcell¹, Shaun Purcell², Benjamin M. Neale¹, Benjamin M. Neale³, Kathe Todd-Brown², Lori Thomas², Manuel A. R. Ferreira², David Bender², David Bender¹, Julian Maller¹, Julian Maller², Pamela Sklar², Pamela Sklar¹, Paul I.W. de Bakker², Paul I.W. de Bakker¹, Mark J. Daly², Mark J. Daly¹, Pak C. Sham⁴ - Show less +14 more•Institutions (4)

Massachusetts Institute of Technology¹, Harvard University², University of London³, University of Hong Kong⁴

01 Sep 2007-American Journal of Human Genetics

TL;DR: This work introduces PLINK, an open-source C/C++ WGAS tool set, and describes the five main domains of function: data management, summary statistics, population stratification, association analysis, and identity-by-descent estimation, which focuses on the estimation and use of identity- by-state and identity/descent information in the context of population-based whole-genome studies.

...read moreread less

Abstract: Whole-genome association studies (WGAS) bring new computational, as well as analytic, challenges to researchers. Many existing genetic-analysis tools are not designed to handle such large data sets in a convenient manner and do not necessarily exploit the new opportunities that whole-genome data bring. To address these issues, we developed PLINK, an open-source C/C++ WGAS tool set. With PLINK, large data sets comprising hundreds of thousands of markers genotyped for thousands of individuals can be rapidly manipulated and analyzed in their entirety. As well as providing tools to make the basic analytic steps computationally efficient, PLINK also supports some novel approaches to whole-genome data that take advantage of whole-genome coverage. We introduce PLINK and describe the five main domains of function: data management, summary statistics, population stratification, association analysis, and identity-by-descent estimation. In particular, we focus on the estimation and use of identity-by-state and identity-by-descent information in the context of population-based whole-genome studies. This information can be used to detect and correct for population stratification and to identify extended chromosomal segments that are shared identical by descent between very distantly related individuals. Analysis of the patterns of segmental sharing has the potential to map disease loci that contain multiple rare variants in a population-based linkage analysis.

...read moreread less

26,280 citations

Journal Article•DOI•

Initial sequencing and analysis of the human genome.

[...]

Eric S. Lander¹, Lauren Linton¹, Bruce W. Birren¹, Chad Nusbaum¹ +245 more•Institutions (29)

15 Feb 2001-Nature

TL;DR: The results of an international collaboration to produce and make freely available a draft sequence of the human genome are reported and an initial analysis is presented, describing some of the insights that can be gleaned from the sequence.

...read moreread less

Abstract: The human genome holds an extraordinary trove of information about human development, physiology, medicine and evolution. Here we report the results of an international collaboration to produce and make freely available a draft sequence of the human genome. We also present an initial analysis of the data, describing some of the insights that can be gleaned from the sequence.

...read moreread less

22,269 citations

Journal Article•DOI•

limma powers differential expression analyses for RNA-sequencing and microarray studies

[...]

Matthew E. Ritchie¹, Belinda Phipson², Di Wu³, Yifang Hu¹, Charity W. Law⁴, Wei Shi¹, Gordon K. Smyth⁵, Gordon K. Smyth¹ - Show less +4 more•Institutions (5)

Walter and Eliza Hall Institute of Medical Research¹, Royal Children's Hospital², Harvard University³, University of Zurich⁴, University of Melbourne⁵

20 Apr 2015-Nucleic Acids Research

TL;DR: The philosophy and design of the limma package is reviewed, summarizing both new and historical features, with an emphasis on recent enhancements and features that have not been previously described.

...read moreread less

Abstract: limma is an R/Bioconductor software package that provides an integrated solution for analysing data from gene expression experiments. It contains rich features for handling complex experimental designs and for information borrowing to overcome the problem of small sample sizes. Over the past decade, limma has been a popular choice for gene discovery through differential expression analyses of microarray and high-throughput PCR data. The package contains particularly strong facilities for reading, normalizing and exploring such data. Recently, the capabilities of limma have been significantly expanded in two important directions. First, the package can now perform both differential expression and differential splicing analyses of RNA sequencing (RNA-seq) data. All the downstream analysis tools previously restricted to microarray data are now available for RNA-seq as well. These capabilities allow users to analyse both RNA-seq and microarray data with very similar pipelines. Second, the package is now able to go past the traditional gene-wise expression analyses in a variety of ways, analysing expression profiles in terms of co-regulated sets of genes or in terms of higher-order expression signatures. This provides enhanced possibilities for biological interpretation of gene expression differences. This article reviews the philosophy and design of the limma package, summarizing both new and historical features, with an emphasis on recent enhancements and features that have not been previously described.

...read moreread less

22,147 citations

1
2
3
4
…
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200

Collapse