Home
/
Authors
/
David Altshuler

Author

David Altshuler

Other affiliations: Vertex Pharmaceuticals, Massachusetts Institute of Technology, Broad Institute ...read more

Bio: David Altshuler is an academic researcher from University of Michigan. The author has contributed to research in topics: Genome-wide association study & Population. The author has an hindex of 162, co-authored 345 publications receiving 201782 citations. Previous affiliations of David Altshuler include Vertex Pharmaceuticals & Massachusetts Institute of Technology.

Papers published on a yearly basis

2023
2022
2021
2020
2019
2018
2017
2016
2015
2014
2013
2012
2011
2010
2009
2008
2007
2006
2005
2004
2003
2002
2001
2000
1999
1998
1993
1992

Papers

PDF

Open Access

More filters

Journal Article•DOI•

Whole-exome imputation of sequence variants identified two novel alleles associated with adult body height in African Americans

[...]

Mengmeng Du¹, Mengmeng Du², Paul L. Auer¹, Paul L. Auer³, Shuo Jiao¹, Jeffrey Haessler¹, David Altshuler⁴, Eric Boerwinkle⁵, Christopher S. Carlson¹, Cara L. Carty¹, Yii-Der Ida Chen⁶, Keith R. Curtis¹, Nora Franceschini⁷, Li Hsu¹, Rebecca D. Jackson⁸, Leslie A. Lange⁷, Guillaume Lettre⁹, Keri L. Monda⁷, Deborah A. Nickerson¹, Alexander P. Reiner¹⁰, Stephen S. Rich¹, Stephanie A. Rosse⁶, Jerome I. Rotter¹¹, Cristen J. Willer¹², James G. Wilson⁷, Kari E. North¹, Charles Kooperberg¹³, Nancy L. Heard-Costa¹, Ulrike Peters¹ - Show less +25 more•Institutions (13)

Fred Hutchinson Cancer Research Center¹, Brigham and Women's Hospital², University of Wisconsin–Milwaukee³, Broad Institute⁴, University of Texas Health Science Center at Houston⁵, Los Angeles Biomedical Research Institute⁶, University of North Carolina at Chapel Hill⁷, The Ohio State University Wexner Medical Center⁸, Montreal Heart Institute⁹, University of Virginia¹⁰, University of Michigan¹¹, University of Mississippi Medical Center¹², Boston University¹³

15 Jul 2014-Human Molecular Genetics

TL;DR: It is demonstrated that whole-exome imputation of sequence variants can identify low-frequency variants and discover novel variants in non-European populations and that this search for novel associations between height and common or infrequent variants across the exome in African Americans shows success.

...read moreread less

Abstract: Adult body height is a quantitative trait for which genome-wide association studies (GWAS) have identified numerous loci, primarily in European populations. These loci, comprising common variants, explain <10% of the phenotypic variance in height. We searched for novel associations between height and common (minor allele frequency, MAF ≥5%) or infrequent (0.5% < MAF < 5%) variants across the exome in African Americans. Using a reference panel of 1692 African Americans and 471 Europeans from the National Heart, Lung, and Blood Institute's (NHLBI) Exome Sequencing Project (ESP), we imputed whole-exome sequence data into 13 719 African Americans with existing array-based GWAS data (discovery). Variants achieving a height-association threshold of P < 5E−06 in the imputed dataset were followed up in an independent sample of 1989 African Americans with whole-exome sequence data (replication). We used P < 2.5E−07 (=0.05/196 779 variants) to define statistically significant associations in meta-analyses combining the discovery and replication sets (N = 15 708). We discovered and replicated three independent loci for association: 5p13.3/C5orf22/rs17410035 (MAF = 0.10, β = 0.64 cm, P = 8.3E−08), 13q14.2/SPRYD7/rs114089985 (MAF = 0.03, β = 1.46 cm, P = 4.8E−10) and 17q23.3/GH2/rs2006123 (MAF = 0.30; β = 0.47 cm; P = 4.7E−09). Conditional analyses suggested 5p13.3 (C5orf22/rs17410035) and 13q14.2 (SPRYD7/rs114089985) may harbor novel height alleles independent of previous GWAS-identified variants (r2 with GWAS loci <0.01); whereas 17q23.3/GH2/rs2006123 was correlated with GWAS-identified variants in European and African populations. Notably, 13q14.2/rs114089985 is infrequent in African Americans (MAF = 3%), extremely rare in European Americans (MAF = 0.03%), and monomorphic in Asian populations, suggesting it may be an African-American-specific height allele. Our findings demonstrate that whole-exome imputation of sequence variants can identify low-frequency variants and discover novel variants in non-European populations.

...read moreread less

14 citations

Journal Article•DOI•

Genetic association analysis of LARS2 with type 2 diabetes

[...]

Erwin Reiling¹, Bahram Jafar-Mohammadi², Bahram Jafar-Mohammadi³, E. van 't Riet⁴, Michael N. Weedon⁵, J. V. van Vliet-Ostaptchouk⁶, Torben Hansen⁷, Torben Hansen⁸, Richa Saxena⁹, T.W. van Haeften¹⁰, P. A. Arp¹¹, S. Das², Giel Nijpels⁴, M. J. Groenewoud¹, E. C. van Hove¹, André G. Uitterlinden¹¹, Jan W. A. Smit¹, Andrew D. Morris¹², Alex S. F. Doney¹², Colin N. A. Palmer¹², Candace Guiducci⁹, Andrew T. Hattersley⁵, Timothy M. Frayling⁵, Oluf Pedersen¹³, Oluf Pedersen¹⁴, Oluf Pedersen⁸, P. E. Slagboom¹, David Altshuler¹⁵, David Altshuler⁹, Leif Groop¹⁶, Leif Groop¹⁷, Johannes A. Romijn¹, Johannes A Maassen¹, Marten H. Hofker⁶, J. M. Dekker⁴, Mark I. McCarthy¹⁸, Mark I. McCarthy³, Mark I. McCarthy², Leen M 't Hart¹ - Show less +35 more•Institutions (18)

Leiden University Medical Center¹, University of Oxford², National Institute for Health Research³, VU University Medical Center⁴, University of Exeter⁵, University of Groningen⁶, University of Southern Denmark⁷, Steno Diabetes Center⁸, Massachusetts Institute of Technology⁹, Utrecht University¹⁰, Erasmus University Medical Center¹¹, University of Dundee¹², Health Science University¹³, Aarhus University¹⁴, Harvard University¹⁵, Lund University¹⁶, University of Helsinki¹⁷, Wellcome Trust Centre for Human Genetics¹⁸

01 Jan 2010-Diabetologia

TL;DR: In this study, the largest study examining the role of sequence variants in LARS2 in type 2 diabetes susceptibility, no evidence to support previous data indicating a role in type 1 diabetes susceptibility was found.

...read moreread less

Abstract: LARS2 has been previously identified as a potential type 2 diabetes susceptibility gene through the low-frequency H324Q (rs71645922) variant (minor allele frequency [MAF] 3.0%). However, this association did not achieve genome-wide levels of significance. The aim of this study was to establish the true contribution of this variant and common variants in LARS2 (MAF > 5%) to type 2 diabetes risk. We combined genome-wide association data (n = 10,128) from the DIAGRAM consortium with independent data derived from a tagging single nucleotide polymorphism (SNP) approach in Dutch individuals (n = 999) and took forward two SNPs of interest to replication in up to 11,163 Dutch participants (rs17637703 and rs952621). In addition, because inspection of genome-wide association study data identified a cluster of low-frequency variants with evidence of type 2 diabetes association, we attempted replication of rs9825041 (a proxy for this group) and the previously identified H324Q variant in up to 35,715 participants of European descent. No association between the common SNPs in LARS2 and type 2 diabetes was found. Our replication studies for the two low-frequency variants, rs9825041 and H324Q, failed to confirm an association with type 2 diabetes in Dutch, Scandinavian and UK samples (OR 1.03 [95% CI 0.95-1.12], p = 0.45, n = 31,962 and OR 0.99 [0.90-1.08], p = 0.78, n = 35,715 respectively). In this study, the largest study examining the role of sequence variants in LARS2 in type 2 diabetes susceptibility, we found no evidence to support previous data indicating a role in type 2 diabetes susceptibility.

...read moreread less

14 citations

Journal Article•DOI•

Association testing of common variants in the insulin receptor substrate-1 gene (IRS1) with type 2 diabetes.

[...]

Jose C. Florez¹, Jose C. Florez², Marketa Sjögren³, Christina M. Agapakis¹, Christina M. Agapakis², Noël P. Burtt¹, Peter Almgren³, Ulf Lindblad³, Göran Berglund³, Tiinamaija Tuomi⁴, Tiinamaija Tuomi⁵, Daniel Gaudet, Mark J. Daly¹, Mark J. Daly², Kristin G. Ardlie¹, Joel N. Hirschhorn², Joel N. Hirschhorn⁶, Joel N. Hirschhorn¹, David Altshuler², David Altshuler¹, Leif Groop⁵, Leif Groop³ - Show less +18 more•Institutions (6)

Broad Institute¹, Harvard University², Lund University³, University of Helsinki⁴, Helsinki University Central Hospital⁵, Boston Children's Hospital⁶

19 Apr 2007-Diabetologia

TL;DR: The data do not support an association of common variants in IRS1 with type 2 diabetes in populations of European descent and other nearby variants might account for the putative association signal.

...read moreread less

Abstract: Aims/hypothesis Activation of the insulin receptor substrate-1 (IRS1) is a key initial step in the insulin signalling pathway. Despite several reports of association of the G972R polymorphism in its gene IRS1 with type 2 diabetes, we and others have not observed this association in well-powered samples. However, other nearby variants might account for the putative association signal. Subjects and methods We characterised the haplotype map of IRS1 and selected 20 markers designed to capture common variations in the region. We genotyped this comprehensive set of markers in several family-based and case-control samples of European descent totalling 12,129 subjects. Results In an initial sample of 2,235 North American and Polish case-control pairs, the minor allele of the rs934167 polymorphism showed nominal evidence of association with type 2 diabetes (odds ratio [OR] 1.25, 95% CI 1.03-1.51, p=0.03). This association showed a trend in the same direction in 7,659 Scandinavian samples (OR 1.16, 95% CI 0.96-1.39, p=0.059). The combined OR was 1.20 (p=0.008), but statistical correction for the number of variants examined yielded a p value of 0.086. We detected no differences across rs934167 genotypes in insulin-related quantitative traits. Conclusion/interpretation Our data do not support an association of common variants in IRS1 with type 2 diabetes in populations of European descent.

...read moreread less

14 citations

Journal Article•DOI•

Six new loci associated with blood low-density lipoprotein cholesterol, high-density lipoprotein cholesterol or triglycerides in humans (Nature Genetics (2008) 40, (189-197))

[...]

Sekar Kathiresan, Olle Melander, Candace Guiducci, Aarti Surti, Noël P. Burtt, Mark J. Rieder, Gregory M. Cooper, Charlotta Roos, Benjamin F. Voight, Aki S. Havulinna, Björn Wahlstrand, Thomas Hedner, Dolores Corella, E. Shyong Tai, Jose M. Ordovas, Göran Berglund, Erkki Vartiainen, Pekka Jousilahti, Bo Hedblad, Marja-Riitta Taskinen, Christopher Newton-Cheh, Veikko Salomaa, Leena Peltonen, Leif Groop, David Altshuler, Marju Orho-Melander - Show less +22 more

01 Nov 2008-Nature Genetics

14 citations

Journal Article•DOI•

European admixture on the Micronesian island of Kosrae: lessons from complete genetic information

[...]

Penelope E. Bonnen¹, Jennifer K. Lowe², Jennifer K. Lowe³, Jennifer K. Lowe⁴, David Altshuler, Jan L. Breslow², Markus Stoffel⁵, Markus Stoffel², Jeffrey M. Friedman⁶, Jeffrey M. Friedman², Itsik Pe'er⁷ - Show less +7 more•Institutions (7)

Baylor College of Medicine¹, Rockefeller University², Broad Institute³, Harvard University⁴, École Polytechnique Fédérale de Lausanne⁵, Howard Hughes Medical Institute⁶, Columbia University⁷

01 Mar 2010-European Journal of Human Genetics

TL;DR: Novel software that uses SNP data to delineate ancestry for individual segments of the genome is developed and shows the benefit of combining information from autosomal and uniparental polymorphisms and provides new methodology for determining ancestry in a population.

...read moreread less

Abstract: The architecture of natural variation present in a contemporary population is a result of multiple population genetic forces, including population bottleneck and expansion, selection, drift, and admixture. We seek to untangle the contribution of admixture to genetic diversity on the Micronesian island of Kosrae. Toward this goal, we used a complete genetic approach by combining a dense genome-wide map of 100 000 single-nucleotide polymorphisms (SNPs) with data from uniparental markers from the mitochondrial genome and the nonrecombining portion of the Y chromosome. These markers were typed in ∼3200 individuals from Kosrae, representing 80% of the adult population of the island. We developed novel software that uses SNP data to delineate ancestry for individual segments of the genome. Through this analysis, we determined that 39% of Kosraens have some European ancestry. However, the vast majority of admixed individuals (77%) have European alleles spanning less than 10% of their genomes. Data from uniparental markers show most of this admixture to be male, introduced in the late nineteenth century. Furthermore, pedigree analysis shows that the majority of European admixture on Kosrae is because of the contribution of one individual. This approach shows the benefit of combining information from autosomal and uniparental polymorphisms and provides new methodology for determining ancestry in a population.

...read moreread less

14 citations

1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
…
56
57
58
59
60
61
62
…
63
64
65
66
67
68
69
70
71

Collapse

Cited by

PDF

Open Access

More filters

Journal Article•DOI•

Fast gapped-read alignment with Bowtie 2

[...]

Ben Langmead¹, Steven L. Salzberg¹, Steven L. Salzberg², Steven L. Salzberg³•Institutions (3)

University of Maryland, College Park¹, Johns Hopkins University School of Medicine², Johns Hopkins University³

01 Apr 2012-Nature Methods

TL;DR: Bowtie 2 combines the strengths of the full-text minute index with the flexibility and speed of hardware-accelerated dynamic programming algorithms to achieve a combination of high speed, sensitivity and accuracy.

...read moreread less

Abstract: As the rate of sequencing increases, greater throughput is demanded from read aligners. The full-text minute index is often used to make alignment very fast and memory-efficient, but the approach is ill-suited to finding longer, gapped alignments. Bowtie 2 combines the strengths of the full-text minute index with the flexibility and speed of hardware-accelerated dynamic programming algorithms to achieve a combination of high speed, sensitivity and accuracy.

...read moreread less

37,898 citations

Journal Article•DOI•

Gene set enrichment analysis: A knowledge-based approach for interpreting genome-wide expression profiles

[...]

Aravind Subramanian¹, Pablo Tamayo¹, Vamsi K. Mootha², Sayan Mukherjee³, Benjamin L. Ebert², Michael A. Gillette², Amanda G. Paulovich⁴, Scott L. Pomeroy², Todd R. Golub², Eric S. Lander¹, Jill P. Mesirov¹ - Show less +7 more•Institutions (4)

Massachusetts Institute of Technology¹, Harvard University², Duke University³, Fred Hutchinson Cancer Research Center⁴

25 Oct 2005-Proceedings of the National Academy of Sciences of the United States of America

TL;DR: The Gene Set Enrichment Analysis (GSEA) method as discussed by the authors focuses on gene sets, that is, groups of genes that share common biological function, chromosomal location, or regulation.

...read moreread less

Abstract: Although genomewide RNA expression analysis has become a routine tool in biomedical research, extracting biological insight from such information remains a major challenge. Here, we describe a powerful analytical method called Gene Set Enrichment Analysis (GSEA) for interpreting gene expression data. The method derives its power by focusing on gene sets, that is, groups of genes that share common biological function, chromosomal location, or regulation. We demonstrate how GSEA yields insights into several cancer-related data sets, including leukemia and lung cancer. Notably, where single-gene analysis finds little similarity between two independent studies of patient survival in lung cancer, GSEA reveals many biological pathways in common. The GSEA method is embodied in a freely available software package, together with an initial database of 1,325 biologically defined gene sets.

...read moreread less

34,830 citations

Journal Article•DOI•

PLINK: A Tool Set for Whole-Genome Association and Population-Based Linkage Analyses

[...]

Shaun Purcell¹, Shaun Purcell², Benjamin M. Neale¹, Benjamin M. Neale³, Kathe Todd-Brown², Lori Thomas², Manuel A. R. Ferreira², David Bender², David Bender¹, Julian Maller¹, Julian Maller², Pamela Sklar², Pamela Sklar¹, Paul I.W. de Bakker², Paul I.W. de Bakker¹, Mark J. Daly², Mark J. Daly¹, Pak C. Sham⁴ - Show less +14 more•Institutions (4)

Massachusetts Institute of Technology¹, Harvard University², University of London³, University of Hong Kong⁴

01 Sep 2007-American Journal of Human Genetics

TL;DR: This work introduces PLINK, an open-source C/C++ WGAS tool set, and describes the five main domains of function: data management, summary statistics, population stratification, association analysis, and identity-by-descent estimation, which focuses on the estimation and use of identity- by-state and identity/descent information in the context of population-based whole-genome studies.

...read moreread less

Abstract: Whole-genome association studies (WGAS) bring new computational, as well as analytic, challenges to researchers. Many existing genetic-analysis tools are not designed to handle such large data sets in a convenient manner and do not necessarily exploit the new opportunities that whole-genome data bring. To address these issues, we developed PLINK, an open-source C/C++ WGAS tool set. With PLINK, large data sets comprising hundreds of thousands of markers genotyped for thousands of individuals can be rapidly manipulated and analyzed in their entirety. As well as providing tools to make the basic analytic steps computationally efficient, PLINK also supports some novel approaches to whole-genome data that take advantage of whole-genome coverage. We introduce PLINK and describe the five main domains of function: data management, summary statistics, population stratification, association analysis, and identity-by-descent estimation. In particular, we focus on the estimation and use of identity-by-state and identity-by-descent information in the context of population-based whole-genome studies. This information can be used to detect and correct for population stratification and to identify extended chromosomal segments that are shared identical by descent between very distantly related individuals. Analysis of the patterns of segmental sharing has the potential to map disease loci that contain multiple rare variants in a population-based linkage analysis.

...read moreread less

26,280 citations

Journal Article•DOI•

Initial sequencing and analysis of the human genome.

[...]

Eric S. Lander¹, Lauren Linton¹, Bruce W. Birren¹, Chad Nusbaum¹ +245 more•Institutions (29)

15 Feb 2001-Nature

TL;DR: The results of an international collaboration to produce and make freely available a draft sequence of the human genome are reported and an initial analysis is presented, describing some of the insights that can be gleaned from the sequence.

...read moreread less

Abstract: The human genome holds an extraordinary trove of information about human development, physiology, medicine and evolution. Here we report the results of an international collaboration to produce and make freely available a draft sequence of the human genome. We also present an initial analysis of the data, describing some of the insights that can be gleaned from the sequence.

...read moreread less

22,269 citations

Journal Article•DOI•

limma powers differential expression analyses for RNA-sequencing and microarray studies

[...]

Matthew E. Ritchie¹, Belinda Phipson², Di Wu³, Yifang Hu¹, Charity W. Law⁴, Wei Shi¹, Gordon K. Smyth⁵, Gordon K. Smyth¹ - Show less +4 more•Institutions (5)

Walter and Eliza Hall Institute of Medical Research¹, Royal Children's Hospital², Harvard University³, University of Zurich⁴, University of Melbourne⁵

20 Apr 2015-Nucleic Acids Research

TL;DR: The philosophy and design of the limma package is reviewed, summarizing both new and historical features, with an emphasis on recent enhancements and features that have not been previously described.

...read moreread less

Abstract: limma is an R/Bioconductor software package that provides an integrated solution for analysing data from gene expression experiments. It contains rich features for handling complex experimental designs and for information borrowing to overcome the problem of small sample sizes. Over the past decade, limma has been a popular choice for gene discovery through differential expression analyses of microarray and high-throughput PCR data. The package contains particularly strong facilities for reading, normalizing and exploring such data. Recently, the capabilities of limma have been significantly expanded in two important directions. First, the package can now perform both differential expression and differential splicing analyses of RNA sequencing (RNA-seq) data. All the downstream analysis tools previously restricted to microarray data are now available for RNA-seq as well. These capabilities allow users to analyse both RNA-seq and microarray data with very similar pipelines. Second, the package is now able to go past the traditional gene-wise expression analyses in a variety of ways, analysing expression profiles in terms of co-regulated sets of genes or in terms of higher-order expression signatures. This provides enhanced possibilities for biological interpretation of gene expression differences. This article reviews the philosophy and design of the limma package, summarizing both new and historical features, with an emphasis on recent enhancements and features that have not been previously described.

...read moreread less

22,147 citations

1
2
3
4
…
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200

Collapse