Home
/
Authors
/
David Altshuler

Author

David Altshuler

Other affiliations: Vertex Pharmaceuticals, Massachusetts Institute of Technology, Broad Institute ...read more

Bio: David Altshuler is an academic researcher from University of Michigan. The author has contributed to research in topics: Genome-wide association study & Population. The author has an hindex of 162, co-authored 345 publications receiving 201782 citations. Previous affiliations of David Altshuler include Vertex Pharmaceuticals & Massachusetts Institute of Technology.

Papers published on a yearly basis

2023
2022
2021
2020
2019
2018
2017
2016
2015
2014
2013
2012
2011
2010
2009
2008
2007
2006
2005
2004
2003
2002
2001
2000
1999
1998
1993
1992

Papers

PDF

Open Access

More filters

Journal Article•DOI•

Analysis of 6,515 exomes reveals the recent origin of most human protein-coding variants

[...]

Wenqing Fu¹, Timothy D. O’Connor¹, Goo Jun², Hyun Min Kang², Gonçalo R. Abecasis², Suzanne M. Leal³, Stacey Gabriel⁴, Mark J. Rieder¹, David Altshuler⁴, Jay Shendure¹, Deborah A. Nickerson¹, Michael J. Bamshad¹, Joshua M. Akey¹ - Show less +9 more•Institutions (4)

University of Washington¹, University of Michigan², Baylor College of Medicine³, Broad Institute⁴

10 Jan 2013-Nature

TL;DR: The results better delimit the historical details of human protein-coding variation, show the profound effect of recent human history on the burden of deleterious SNVs segregating in contemporary populations, and provide important practical information that can be used to prioritize variants in disease-gene discovery.

...read moreread less

Abstract: Establishing the age of each mutation segregating in contemporary human populations is important to fully understand our evolutionary history and will help to facilitate the development of new approaches for disease-gene discovery. Large-scale surveys of human genetic variation have reported signatures of recent explosive population growth, notable for an excess of rare genetic variants, suggesting that many mutations arose recently. To more quantitatively assess the distribution of mutation ages, we resequenced 15,336 genes in 6,515 individuals of European American and African American ancestry and inferred the age of 1,146,401 autosomal single nucleotide variants (SNVs). We estimate that approximately 73% of all protein-coding SNVs and approximately 86% of SNVs predicted to be deleterious arose in the past 5,000-10,000 years. The average age of deleterious SNVs varied significantly across molecular pathways, and disease genes contained a significantly higher proportion of recently arisen deleterious SNVs than other genes. Furthermore, European Americans had an excess of deleterious variants in essential and Mendelian disease genes compared to African Americans, consistent with weaker purifying selection due to the Out-of-Africa dispersal. Our results better delimit the historical details of human protein-coding variation, show the profound effect of recent human history on the burden of deleterious SNVs segregating in contemporary populations, and provide important practical information that can be used to prioritize variants in disease-gene discovery.

...read moreread less

934 citations

Journal Article•DOI•

Loss-of-function mutations in APOC3, triglycerides, and coronary disease

[...]

Jacy R Crosby¹, Gina M. Peloso², Gina M. Peloso³, Paul L. Auer⁴, David R. Crosslin⁵, Nathan O. Stitziel⁶, Leslie A. Lange⁷, Yingchang Lu⁸, Zheng-Zheng Tang⁷, He Zhang⁹, George Hindy¹⁰, Nicholas G. D. Masca¹¹, Kathleen Stirrups¹², Stavroula Kanoni¹², Ron Do², Ron Do³, Goo Jun⁹, Youna Hu⁹, Hyun Min Kang⁹, Chenyi Xue⁹, Anuj Goel¹³, Martin Farrall¹³, Stefano Duga¹⁴, Pier Angelica Merlini, Rosanna Asselta¹⁴, Domenico Girelli¹⁵, Oliviero Olivieri¹⁵, Nicola Martinelli¹⁵, Wu Yin¹⁶, Dermot F. Reilly¹⁶, Elizabeth K. Speliotes⁹, Caroline S. Fox¹⁷, Kristian Hveem¹⁸, Oddgeir L. Holmen¹⁹, Majid Nikpay²⁰, Deborah N. Farlow², Themistocles L. Assimes²¹, Nora Franceschini⁷, Jennifer G. Robinson²², Kari E. North⁷, Lisa W. Martin²³, Mark A. DePristo², Namrata Gupta², Stefan A. Escher¹⁰, Jan-Håkan Jansson²⁴, Natalie R. van Zuydam²⁵, Colin N. A. Palmer²⁵, Nicholas J. Wareham²⁶, Werner Koch²⁷, Thomas Meitinger²⁷, Annette Peters, Wolfgang Lieb²⁸, Raimund Erbel, Inke R. König²⁹, Jochen Kruppa²⁹, Franziska Degenhardt³⁰, Omri Gottesman⁸, Erwin P. Bottinger⁸, Christopher J. O'Donnell¹⁷, Bruce M. Psaty⁵, Bruce M. Psaty³¹, Christie M. Ballantyne³², Christie M. Ballantyne³³, Gonçalo R. Abecasis⁹, Jose M. Ordovas³⁴, Jose M. Ordovas³⁵, Olle Melander¹⁰, Hugh Watkins¹³, Marju Orho-Melander¹⁰, Diego Ardissino, Ruth J. F. Loos⁸, Ruth McPherson²⁰, Cristen J. Willer⁹, Jeanette Erdmann²⁹, Alistair S. Hall³⁶, Nilesh J. Samani¹¹, Panos Deloukas³⁷, Panos Deloukas³⁸, Panos Deloukas¹², Heribert Schunkert²⁷, James G. Wilson³⁹, Charles Kooperberg⁴⁰, Stephen S. Rich⁴¹, Russell P. Tracy⁴², Danyu Lin⁷, David Altshuler³, David Altshuler², Stacey Gabriel², Deborah A. Nickerson⁵, Gail P. Jarvik⁵, L. Adrienne Cupples²⁶, L. Adrienne Cupples⁴³, Alexander P. Reiner⁴⁰, Alexander P. Reiner⁵, Eric Boerwinkle³³, Sekar Kathiresan², Sekar Kathiresan³ - Show less +93 more•Institutions (43)

University of Texas Health Science Center at Houston¹, Broad Institute², Harvard University³, University of Wisconsin–Milwaukee⁴, University of Washington⁵, Washington University in St. Louis⁶, University of North Carolina at Chapel Hill⁷, Icahn School of Medicine at Mount Sinai⁸, University of Michigan⁹, Lund University¹⁰, University of Leicester¹¹, Queen Mary University of London¹², University of Oxford¹³, University of Milan¹⁴, University of Verona¹⁵, Merck & Co.¹⁶, National Institutes of Health¹⁷, Levanger Hospital¹⁸, Norwegian University of Science and Technology¹⁹, University of Ottawa²⁰, Stanford University²¹, University of Iowa²², George Washington University²³, Umeå University²⁴, University of Dundee²⁵, Cambridge University Hospitals NHS Foundation Trust²⁶, Technische Universität München²⁷, University of Kiel²⁸, University of Lübeck²⁹, University of Bonn³⁰, Group Health Cooperative³¹, Houston Methodist Hospital³², Baylor College of Medicine³³, Tufts University³⁴, IMDEA³⁵, University of Leeds³⁶, Wellcome Trust Sanger Institute³⁷, King Abdulaziz University³⁸, University of Mississippi³⁹, Fred Hutchinson Cancer Research Center⁴⁰, University of Virginia⁴¹, University of Vermont⁴², Boston University⁴³

02 Jul 2014-The New England Journal of Medicine

TL;DR: Rare mutations that disrupt AP OC3 function were associated with lower levels of plasma triglycerides and APOC3, and carriers of these mutations were found to have a reduced risk of coronary heart disease.

...read moreread less

Abstract: Background Plasma triglyceride levels are heritable and are correlated with the risk of coronary heart disease. Sequencing of the protein-coding regions of the human genome (the exome) has the potential to identify rare mutations that have a large effect on phenotype. Methods We sequenced the protein-coding regions of 18,666 genes in each of 3734 participants of European or African ancestry in the Exome Sequencing Project. We conducted tests to determine whether rare mutations in coding sequence, individually or in aggregate within a gene, were associated with plasma triglyceride levels. For mutations associated with triglyceride levels, we subsequently evaluated their association with the risk of coronary heart disease in 110,970 persons. Results An aggregate of rare mutations in the gene encoding apolipoprotein C3 (APOC3) was associated with lower plasma triglyceride levels. Among the four mutations that drove this result, three were loss-of-function mutations: a nonsense mutation (R19X) and two splice-site mutations (IVS2+1G→A and IVS3+1G→T). The fourth was a missense mutation (A43T). Approximately 1 in 150 persons in the study was a heterozygous carrier of at least one of these four mutations. Triglyceride levels in the carriers were 39% lower than levels in noncarriers (P<1×10 − 20 ), and circulating levels of APOC3 in carriers were 46% lower than levels in noncarriers (P = 8×10 − 10 ). The risk of coronary heart disease among 498 carriers of any rare APOC3 mutation was 40% lower than the risk among 110,472 noncarriers (odds ratio, 0.60; 95% confidence interval, 0.47 to 0.75; P = 4×10 − 6 ). Conclusions Rare mutations that disrupt APOC3 function were associated with lower levels of plasma triglycerides and APOC3. Carriers of these mutations were found to have a reduced risk of coronary heart disease. (Funded by the National Heart, Lung, and Blood Institute and others.)

...read moreread less

877 citations

Journal Article•DOI•

Clinical risk factors, DNA variants, and the development of type 2 diabetes.

[...]

Valeriya Lyssenko, Anna Jonsson, Peter Almgren, Nicolo Pulizzi, Bo Isomaa, Tiinamaija Tuomi, Göran Berglund, David Altshuler, Peter M. Nilsson, Leif Groop - Show less +6 more

20 Nov 2008-The New England Journal of Medicine

TL;DR: Variants in 11 genes were significantly associated with the risk of type 2 diabetes independently of clinical risk factors; variants in 8 of these genes were associated with impaired beta-cell function.

...read moreread less

Abstract: Background Type 2 diabetes mellitus is thought to develop from an interaction between environmental and genetic factors. We examined whether clinical or genetic factors or both could predict progression to diabetes in two prospective cohorts. Methods We genotyped 16 single-nucleotide polymorphisms (SNPs) and examined clinical factors in 16,061 Swedish and 2770 Finnish subjects. Type 2 diabetes developed in 2201 (11.7%) of these subjects during a median follow-up period of 23.5 years. We also studied the effect of genetic variants on changes in insulin secretion and action over time. Results Strong predictors of diabetes were a family history of the disease, an increased body-mass index, elevated liver-enzyme levels, current smoking status, and reduced measures of insulin secretion and action. Variants in 11 genes (TCF7L2, PPARG, FTO, KCNJ11, NOTCH2, WFS1, CDKAL1, IGF2BP2, SLC30A8, JAZF1, and HHEX) were significantly associated with the risk of type 2 diabetes independently of clinical risk factors; variants in 8 of these genes were associated with impaired beta-cell function. The addition of specific genetic information to clinical factors slightly improved the prediction of future diabetes, with a slight increase in the area under the receiveroperating-characteristic curve from 0.74 to 0.75; however, the magnitude of the increase was significant (P = 1.0×10 −4 ). The discriminative power of genetic risk factors improved with an increasing duration of follow-up, whereas that of clinical risk factors decreased. Conclusions As compared with clinical risk factors alone, common genetic variants associated with the risk of diabetes had a small effect on the ability to predict the future development of type 2 diabetes. The value of genetic factors increased with an increasing duration of follow-up.

...read moreread less

871 citations

Journal Article•DOI•

The genetic architecture of type 2 diabetes

[...]

Christian Fuchsberger¹, Christian Fuchsberger², Jason Flannick³, Jason Flannick⁴ +346 more•Institutions (77)

11 Jul 2016-Nature

TL;DR: In this paper, the authors performed whole-genome sequencing in 2,657 European individuals with and without diabetes, and exome sequencing for 12,940 individuals from five ancestry groups.

...read moreread less

Abstract: The genetic architecture of common traits, including the number, frequency, and effect sizes of inherited variants that contribute to individual risk, has been long debated. Genome-wide association studies have identified scores of common variants associated with type 2 diabetes, but in aggregate, these explain only a fraction of the heritability of this disease. Here, to test the hypothesis that lower-frequency variants explain much of the remainder, the GoT2D and T2D-GENES consortia performed whole-genome sequencing in 2,657 European individuals with and without diabetes, and exome sequencing in 12,940 individuals from five ancestry groups. To increase statistical power, we expanded the sample size via genotyping and imputation in a further 111,548 subjects. Variants associated with type 2 diabetes after sequencing were overwhelmingly common and most fell within regions previously identified by genome-wide association studies. Comprehensive enumeration of sequence variation is necessary to identify functional alleles that provide important clues to disease pathophysiology, but large-scale sequencing does not support the idea that lower-frequency variants have a major role in predisposition to type 2 diabetes.

...read moreread less

866 citations

Journal Article•DOI•

Copy number variation: New insights in genome diversity

[...]

Jennifer L. Freeman¹, George H. Perry, Lars Feuk², Richard Redon³, Steven A. McCarroll⁴, David Altshuler⁴, Hiroyuki Aburatani⁵, Keith W. Jones⁶, Chris Tyler-Smith³, Matthew E. Hurles³, Nigel P. Carter³, Stephen W. Scherer², Charles Lee⁴ - Show less +9 more•Institutions (6)

Brigham and Women's Hospital¹, University of Toronto², Wellcome Trust Sanger Institute³, Harvard University⁴, University of Tokyo⁵, Thermo Fisher Scientific⁶

01 Aug 2006-Genome Research

TL;DR: Current efforts are directed toward a more comprehensive cataloging and characterization of CNVs that will provide the basis for determining how genomic diversity impacts biological function, evolution, and common human diseases.

...read moreread less

Abstract: DNA copy number variation has long been associated with specific chromosomal rearrangements and genomic disorders, but its ubiquity in mammalian genomes was not fully realized until recently. Although our understanding of the extent of this variation is still developing, it seems likely that, at least in humans, copy number variants (CNVs) account for a substantial amount of genetic variation. Since many CNVs include genes that result in differential levels of gene expression, CNVs may account for a significant proportion of normal phenotypic variation. Current efforts are directed toward a more comprehensive cataloging and characterization of CNVs that will provide the basis for determining how genomic diversity impacts biological function, evolution, and common human diseases.

...read moreread less

855 citations

1
2
3
4
5
6
7
…
8
9
10
11
12
13
14
…
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71

Collapse

Cited by

PDF

Open Access

More filters

Journal Article•DOI•

Fast gapped-read alignment with Bowtie 2

[...]

Ben Langmead¹, Steven L. Salzberg¹, Steven L. Salzberg², Steven L. Salzberg³•Institutions (3)

University of Maryland, College Park¹, Johns Hopkins University School of Medicine², Johns Hopkins University³

01 Apr 2012-Nature Methods

TL;DR: Bowtie 2 combines the strengths of the full-text minute index with the flexibility and speed of hardware-accelerated dynamic programming algorithms to achieve a combination of high speed, sensitivity and accuracy.

...read moreread less

Abstract: As the rate of sequencing increases, greater throughput is demanded from read aligners. The full-text minute index is often used to make alignment very fast and memory-efficient, but the approach is ill-suited to finding longer, gapped alignments. Bowtie 2 combines the strengths of the full-text minute index with the flexibility and speed of hardware-accelerated dynamic programming algorithms to achieve a combination of high speed, sensitivity and accuracy.

...read moreread less

37,898 citations

Journal Article•DOI•

Gene set enrichment analysis: A knowledge-based approach for interpreting genome-wide expression profiles

[...]

Aravind Subramanian¹, Pablo Tamayo¹, Vamsi K. Mootha², Sayan Mukherjee³, Benjamin L. Ebert², Michael A. Gillette², Amanda G. Paulovich⁴, Scott L. Pomeroy², Todd R. Golub², Eric S. Lander¹, Jill P. Mesirov¹ - Show less +7 more•Institutions (4)

Massachusetts Institute of Technology¹, Harvard University², Duke University³, Fred Hutchinson Cancer Research Center⁴

25 Oct 2005-Proceedings of the National Academy of Sciences of the United States of America

TL;DR: The Gene Set Enrichment Analysis (GSEA) method as discussed by the authors focuses on gene sets, that is, groups of genes that share common biological function, chromosomal location, or regulation.

...read moreread less

Abstract: Although genomewide RNA expression analysis has become a routine tool in biomedical research, extracting biological insight from such information remains a major challenge. Here, we describe a powerful analytical method called Gene Set Enrichment Analysis (GSEA) for interpreting gene expression data. The method derives its power by focusing on gene sets, that is, groups of genes that share common biological function, chromosomal location, or regulation. We demonstrate how GSEA yields insights into several cancer-related data sets, including leukemia and lung cancer. Notably, where single-gene analysis finds little similarity between two independent studies of patient survival in lung cancer, GSEA reveals many biological pathways in common. The GSEA method is embodied in a freely available software package, together with an initial database of 1,325 biologically defined gene sets.

...read moreread less

34,830 citations

Journal Article•DOI•

PLINK: A Tool Set for Whole-Genome Association and Population-Based Linkage Analyses

[...]

Shaun Purcell¹, Shaun Purcell², Benjamin M. Neale¹, Benjamin M. Neale³, Kathe Todd-Brown², Lori Thomas², Manuel A. R. Ferreira², David Bender², David Bender¹, Julian Maller¹, Julian Maller², Pamela Sklar², Pamela Sklar¹, Paul I.W. de Bakker², Paul I.W. de Bakker¹, Mark J. Daly², Mark J. Daly¹, Pak C. Sham⁴ - Show less +14 more•Institutions (4)

Massachusetts Institute of Technology¹, Harvard University², University of London³, University of Hong Kong⁴

01 Sep 2007-American Journal of Human Genetics

TL;DR: This work introduces PLINK, an open-source C/C++ WGAS tool set, and describes the five main domains of function: data management, summary statistics, population stratification, association analysis, and identity-by-descent estimation, which focuses on the estimation and use of identity- by-state and identity/descent information in the context of population-based whole-genome studies.

...read moreread less

Abstract: Whole-genome association studies (WGAS) bring new computational, as well as analytic, challenges to researchers. Many existing genetic-analysis tools are not designed to handle such large data sets in a convenient manner and do not necessarily exploit the new opportunities that whole-genome data bring. To address these issues, we developed PLINK, an open-source C/C++ WGAS tool set. With PLINK, large data sets comprising hundreds of thousands of markers genotyped for thousands of individuals can be rapidly manipulated and analyzed in their entirety. As well as providing tools to make the basic analytic steps computationally efficient, PLINK also supports some novel approaches to whole-genome data that take advantage of whole-genome coverage. We introduce PLINK and describe the five main domains of function: data management, summary statistics, population stratification, association analysis, and identity-by-descent estimation. In particular, we focus on the estimation and use of identity-by-state and identity-by-descent information in the context of population-based whole-genome studies. This information can be used to detect and correct for population stratification and to identify extended chromosomal segments that are shared identical by descent between very distantly related individuals. Analysis of the patterns of segmental sharing has the potential to map disease loci that contain multiple rare variants in a population-based linkage analysis.

...read moreread less

26,280 citations

Journal Article•DOI•

Initial sequencing and analysis of the human genome.

[...]

Eric S. Lander¹, Lauren Linton¹, Bruce W. Birren¹, Chad Nusbaum¹ +245 more•Institutions (29)

15 Feb 2001-Nature

TL;DR: The results of an international collaboration to produce and make freely available a draft sequence of the human genome are reported and an initial analysis is presented, describing some of the insights that can be gleaned from the sequence.

...read moreread less

Abstract: The human genome holds an extraordinary trove of information about human development, physiology, medicine and evolution. Here we report the results of an international collaboration to produce and make freely available a draft sequence of the human genome. We also present an initial analysis of the data, describing some of the insights that can be gleaned from the sequence.

...read moreread less

22,269 citations

Journal Article•DOI•

limma powers differential expression analyses for RNA-sequencing and microarray studies

[...]

Matthew E. Ritchie¹, Belinda Phipson², Di Wu³, Yifang Hu¹, Charity W. Law⁴, Wei Shi¹, Gordon K. Smyth⁵, Gordon K. Smyth¹ - Show less +4 more•Institutions (5)

Walter and Eliza Hall Institute of Medical Research¹, Royal Children's Hospital², Harvard University³, University of Zurich⁴, University of Melbourne⁵

20 Apr 2015-Nucleic Acids Research

TL;DR: The philosophy and design of the limma package is reviewed, summarizing both new and historical features, with an emphasis on recent enhancements and features that have not been previously described.

...read moreread less

Abstract: limma is an R/Bioconductor software package that provides an integrated solution for analysing data from gene expression experiments. It contains rich features for handling complex experimental designs and for information borrowing to overcome the problem of small sample sizes. Over the past decade, limma has been a popular choice for gene discovery through differential expression analyses of microarray and high-throughput PCR data. The package contains particularly strong facilities for reading, normalizing and exploring such data. Recently, the capabilities of limma have been significantly expanded in two important directions. First, the package can now perform both differential expression and differential splicing analyses of RNA sequencing (RNA-seq) data. All the downstream analysis tools previously restricted to microarray data are now available for RNA-seq as well. These capabilities allow users to analyse both RNA-seq and microarray data with very similar pipelines. Second, the package is now able to go past the traditional gene-wise expression analyses in a variety of ways, analysing expression profiles in terms of co-regulated sets of genes or in terms of higher-order expression signatures. This provides enhanced possibilities for biological interpretation of gene expression differences. This article reviews the philosophy and design of the limma package, summarizing both new and historical features, with an emphasis on recent enhancements and features that have not been previously described.

...read moreread less

22,147 citations

1
2
3
4
…
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200

Collapse