Home
/
Authors
/
David Altshuler

Author

David Altshuler

Other affiliations: Vertex Pharmaceuticals, Massachusetts Institute of Technology, Broad Institute ...read more

Bio: David Altshuler is an academic researcher from University of Michigan. The author has contributed to research in topics: Genome-wide association study & Population. The author has an hindex of 162, co-authored 345 publications receiving 201782 citations. Previous affiliations of David Altshuler include Vertex Pharmaceuticals & Massachusetts Institute of Technology.

Papers published on a yearly basis

2023
2022
2021
2020
2019
2018
2017
2016
2015
2014
2013
2012
2011
2010
2009
2008
2007
2006
2005
2004
2003
2002
2001
2000
1999
1998
1993
1992

Papers

PDF

Open Access

More filters

Journal Article•DOI•

Integrated allelic, transcriptional, and phenomic dissection of the cardiac effects of titin truncations in health and disease

[...]

Angharad M. Roberts¹, Angharad M. Roberts², James S. Ware, Daniel S. Herman³, Daniel S. Herman⁴, Daniel S. Herman⁵, Sebastian Schafer⁶, John Baksi², Alexander G. Bick⁴, Alexander G. Bick³, Rachel Buchan², Roddy Walsh², Shibu John², S Wilkinson², Francesco Mazzarotto⁷, Francesco Mazzarotto², Leanne E. Felkin⁷, Leanne E. Felkin², Sungsam Gong², Jacqueline A. L. MacArthur⁸, Fiona Cunningham⁸, Jason Flannick⁴, Jason Flannick³, Stacey Gabriel³, David Altshuler⁴, David Altshuler³, Peter S. Macdonald⁹, Peter S. Macdonald¹⁰, Peter S. Macdonald¹¹, Matthias Heinig⁶, Anne Keogh¹¹, Anne Keogh⁹, Anne Keogh¹⁰, Christopher S. Hayward⁹, Christopher S. Hayward¹⁰, Christopher S. Hayward¹¹, Nicholas R. Banner¹², Nicholas R. Banner⁷, Dudley J. Pennell⁷, Dudley J. Pennell², Declan P. O'Regan¹, Tan Ru San, Antonio de Marvao¹, Timothy J W Dawes¹, Ankur Gulati², Emma J. Birks⁷, Emma J. Birks¹³, Magdi H. Yacoub⁷, Michael H. Radke⁶, Michael Gotthardt⁶, James G. Wilson¹⁴, Christopher J. O'Donnell, Sanjay K Prasad², Paul J.R. Barton², Paul J.R. Barton⁷, Diane Fatkin⁹, Diane Fatkin¹⁰, Diane Fatkin¹¹, Norbert Hubner¹⁵, Norbert Hubner⁶, Jonathan G. Seidman⁴, Christine E. Seidman, Stuart A. Cook - Show less +59 more•Institutions (15)

Medical Research Council¹, National Institute for Health Research², Massachusetts Institute of Technology³, Harvard University⁴, University of Washington⁵, Max Delbrück Center for Molecular Medicine⁶, National Institutes of Health⁷, European Bioinformatics Institute⁸, St. Vincent's Health System⁹, University of New South Wales¹⁰, Victor Chang Cardiac Research Institute¹¹, Harefield Hospital¹², University of Louisville¹³, University of Mississippi Medical Center¹⁴, Charité¹⁵

14 Jan 2015-Science Translational Medicine

TL;DR: It is shown that TTNtv is the most common genetic cause of DCM in ambulant patients in the community, identify clinically important manifestations ofTTNtv-positive DCM, and define the penetrance and outcomes of TTNTV in the general population.

...read moreread less

Abstract: The recent discovery of heterozygous human mutations that truncate full-length titin (TTN, an abundant structural, sensory, and signaling filament in muscle) as a common cause of end-stage dilated cardiomyopathy (DCM) promises new prospects for improving heart failure management. However, realization of this opportunity has been hindered by the burden of TTN-truncating variants (TTNtv) in the general population and uncertainty about their consequences in health or disease. To elucidate the effects of TTNtv, we coupled TTN gene sequencing with cardiac phenotyping in 5267 individuals across the spectrum of cardiac physiology and integrated these data with RNA and protein analyses of human heart tissues. We report diversity of TTN isoform expression in the heart, define the relative inclusion of TTN exons in different isoforms (using the TTN transcript annotations available at http://cardiodb.org/titin), and demonstrate that these data, coupled with the position of the TTNtv, provide a robust strategy to discriminate pathogenic from benign TTNtv. We show that TTNtv is the most common genetic cause of DCM in ambulant patients in the community, identify clinically important manifestations of TTNtv-positive DCM, and define the penetrance and outcomes of TTNtv in the general population. By integrating genetic, transcriptome, and protein analyses, we provide evidence for a length-dependent mechanism of disease. These data inform diagnostic criteria and management strategies for TTNtv-positive DCM patients and for TTNtv that are identified as incidental findings.

...read moreread less

341 citations

Journal Article•DOI•

Genome coverage and sequence fidelity of ϕ29 polymerase‐based multiple strand displacement whole genome amplification

[...]

J. Guillermo Paez¹, Ming Lin, Rameen Beroukhim, Jeffrey C. Lee, Xiaojun Zhao, Daniel J. Richter, Stacey Gabriel, Paula Herman, Hidefumi Sasaki, David Altshuler, Cheng Li, Matthew Meyerson, William R. Sellers - Show less +9 more•Institutions (1)

Harvard University¹

01 May 2004-Nucleic Acids Research

TL;DR: In this article, the fidelity and genome representation of f29 polymerase-based genome amplification (f29MDA) using direct sequencing and high density oligonucleotide arrays probing >10 000 SNP alleles were tested.

...read moreread less

Abstract: Major efforts are underway to systematically define the somatic and germline genetic variations causally associated with disease. Genome-wide genetic analysis of actual clinical samples is, however, limited by the paucity of genomic DNA available. Here we have tested the fidelity and genome representation of f29 polymerase-based genome amplification (f29MDA) using direct sequencing and high density oligonucleotide arrays probing >10 000 SNP alleles. Genome representation was comprehensive and estimated to be 99.82% complete, although six regions encompassing a maximum of 5.62 Mb failed to amplify. There was no degradation in the accuracy of SNP genotyping and, in direct sequencing experiments sampling 500 000 bp, the estimated error rate (9.5 3 10 ‐6 ) was the same as in paired unamplified samples. The detection of cancer-associated loss of heterozygosity and copy number changes, including homozygous deletion and gene amplification, were similarly robust. These results suggest that f29MDA yields high fidelity, near-complete genome representation suitable for high resolution genetic analysis.

...read moreread less

332 citations

Journal Article•DOI•

Genetic variants at CD28, PRDM1, and CD2/CD58 are associated with rheumatoid arthritis risk

[...]

Soumya Raychaudhuri¹, Soumya Raychaudhuri², Soumya Raychaudhuri³, Brian Thomson¹, Elaine F. Remmers⁴, Stephen Eyre⁵, Anne Hinks⁵, Candace Guiducci¹, Joseph J. Catanese⁶, Gang Xie⁷, Eli A. Stahl², Robert Chen², Lars Alfredsson⁸, Christopher I. Amos⁹, Kristin G. Ardlie¹, Anne Barton⁵, John Bowes⁵, Noël P. Burtt¹, Monica Chang⁶, Jonathan S. Coblyn², Karen H. Costenbader², Lindsey A. Criswell¹⁰, J. Bart A. Crusius¹¹, Jing Cui², Phillip L. De Jager², Phillip L. De Jager¹, Bo Ding⁸, Paul Emery¹², Edward Flynn⁵, Pille Harrison¹³, Lynne J. Hocking¹⁴, Tom W J Huizinga¹⁵, Daniel L. Kastner⁴, Xiayi Ke⁵, Fina A S Kurreeman¹⁵, Fina A S Kurreeman², Annette Lee¹⁶, Xiangdong Liu⁷, Yonghong Li⁶, Paul Martin⁵, Ann W. Morgan¹², Leonid Padyukov¹⁷, David M. Reid¹⁴, Mark Seielstad¹⁸, Michael F. Seldin¹⁹, Nancy A. Shadick², Sophia Steer²⁰, Paul P. Tak²¹, Wendy Thomson⁵, Annette H M van der Helm-van Mil¹⁵, Irene E. van der Horst-Bruinsma¹¹, Michael E. Weinblatt², Anthony G. Wilson²², Gertjan Wolbink²¹, Paul Wordsworth¹³, David Altshuler¹, David Altshuler³, Elizabeth W. Karlson², René E. M. Toes¹⁵, Niek de Vries²¹, Ann B. Begovich²³, Ann B. Begovich⁶, Katherine A. Siminovitch⁷, Jane Worthington⁵, Lars Klareskog¹⁷, Peter K. Gregersen¹⁶, Mark J. Daly³, Mark J. Daly¹, Robert M. Plenge², Robert M. Plenge¹ - Show less +66 more•Institutions (23)

Broad Institute¹, Brigham and Women's Hospital², Harvard University³, National Institutes of Health⁴, University of Manchester⁵, Celera Corporation⁶, University Health Network⁷, Karolinska Institutet⁸, University of Texas MD Anderson Cancer Center⁹, University of California, San Francisco¹⁰, VU University Medical Center¹¹, National Institute for Health Research¹², University of Oxford¹³, University of Aberdeen¹⁴, Leiden University¹⁵, North Shore-LIJ Health System¹⁶, Karolinska University Hospital¹⁷, Genome Institute of Singapore¹⁸, University of California, Davis¹⁹, King's College²⁰, University of Amsterdam²¹, University of Sheffield²², Hoffmann-La Roche²³

08 Nov 2009-Nature Genetics

TL;DR: To discover new rheumatoid arthritis risk loci, GRAIL used Gene Relationships Across Implicated Loci (GRAIL), a computational method that applies statistical text mining to PubMed abstracts, to score these 179 loci for functional relationships to genes in 16 established RA disease loci.

...read moreread less

Abstract: To discover new rheumatoid arthritis (RA) risk loci, we systematically examined 370 SNPs from 179 independent loci with P < 0.001 in a published meta-analysis of RA genome-wide association studies (GWAS) of 3,393 cases and 12,462 controls. We used Gene Relationships Across Implicated Loci (GRAIL), a computational method that applies statistical text mining to PubMed abstracts, to score these 179 loci for functional relationships to genes in 16 established RA disease loci. We identified 22 loci with a significant degree of functional connectivity. We genotyped 22 representative SNPs in an independent set of 7,957 cases and 11,958 matched controls. Three were convincingly validated: CD2-CD58 (rs11586238, P = 1 x 10(-6) replication, P = 1 x 10(-9) overall), CD28 (rs1980422, P = 5 x 10(-6) replication, P = 1 x 10(-9) overall) and PRDM1 (rs548234, P = 1 x 10(-5) replication, P = 2 x 10(-8) overall). An additional four were replicated (P < 0.0023): TAGAP (rs394581, P = 0.0002 replication, P = 4 x 10(-7) overall), PTPRC (rs10919563, P = 0.0003 replication, P = 7 x 10(-7) overall), TRAF6-RAG1 (rs540386, P = 0.0008 replication, P = 4 x 10(-6) overall) and FCGR2A (rs12746613, P = 0.0022 replication, P = 2 x 10(-5) overall). Many of these loci are also associated to other immunologic diseases.

...read moreread less

331 citations

Journal Article•DOI•

Linkage Disequilibrium and Heritability of Copy-Number Polymorphisms within Duplicated Regions of the Human Genome

[...]

Devin P. Locke¹, Andrew J. Sharp¹, Steven A. McCarroll², Steven A. McCarroll³, Sean McGrath¹, Tera L. Newman¹, Ze Cheng¹, Ze Cheng⁴, Stuart Schwartz⁵, Donna G. Albertson⁶, Daniel Pinkel⁶, David Altshuler, Evan E. Eichler¹, Evan E. Eichler⁴ - Show less +10 more•Institutions (6)

University of Washington¹, Harvard University², Massachusetts Institute of Technology³, Howard Hughes Medical Institute⁴, University of Chicago⁵, University of California, San Francisco⁶

01 Aug 2006-American Journal of Human Genetics

TL;DR: A combination of BAC-based and high-density customized oligonucleotide arrays were used to resolve the molecular basis of structural rearrangements and underscore the need for complete maps of genetic variation in duplication-rich regions of the genome.

...read moreread less

Abstract: Studies of copy-number variation and linkage disequilibrium (LD) have typically excluded complex regions of the genome that are rich in duplications and prone to rearrangement. In an attempt to assess the heritability and LD of copy-number polymorphisms (CNPs) in duplication-rich regions of the genome, we profiled copy-number variation in 130 putative "rearrangement hotspot regions" among 269 individuals of European, Yoruba, Chinese, and Japanese ancestry analyzed by the International HapMap Consortium. Eighty-four hotspot regions, corresponding to 257 bacterial artificial chromosome (BAC) probes, showed evidence of copy-number differences. Despite a predisposing genetic architecture, no polymorphism was ever observed in the remaining 46 "rearrangement hotspots," and we suggest these represent excellent candidate sites for pathogenic rearrangements. We used a combination of BAC-based and high-density customized oligonucleotide arrays to resolve the molecular basis of structural rearrangements. For common variants (frequency >10%), we observed a distinct bias against copy-number losses, suggesting that deletions are subject to purifying selection. Heritability estimates did not differ significantly from 1.0 among the majority (30 of 34) of loci analyzed, consistent with normal Mendelian inheritance. Some of the CNPs in duplication-rich regions showed strong LD with nearby single-nucleotide polymorphisms (SNPs) and were observed to segregate on ancestral SNP haplotypes. However, LD with the best available SNP markers was weaker than has been reported for deletion polymorphisms in less complex regions of the genome. These observations may be accounted for by a low density of SNP data in duplicated regions, challenges in mapping and typing the CNPs, and the possibility that CNPs in these regions have rearranged on multiple haplotype backgrounds. Our results underscore the need for complete maps of genetic variation in duplication-rich regions of the genome.

...read moreread less

330 citations

Journal Article•DOI•

A locus on 19p13 modifies risk of breast cancer in BRCA1 mutation carriers and is associated with hormone receptor-negative breast cancer in the general population

[...]

Antonis C. Antoniou¹, Xianshu Wang², Zachary S. Fredericksen², Lesley McGuffog¹ +179 more•Institutions (79)

01 Oct 2010-Nature Genetics

TL;DR: Five SNPs on 19p13 were associated with breast cancer risk and an association with estrogen receptor–positive disease in the opposite direction was identified andotyping these SNPs in 6,800 population-based breast cancer cases and 6,613 controls identified a similar association.

...read moreread less

Abstract: Germline BRCA1 mutations predispose to breast cancer. To identify genetic modifiers of this risk, we performed a genome-wide association study in 1,193 individuals with BRCA1 mutations who were diagnosed with invasive breast cancer under age 40 and 1,190 BRCA1 carriers without breast cancer diagnosis over age 35. We took forward 96 SNPs for replication in another 5,986 BRCA1 carriers (2,974 individuals with breast cancer and 3,012 unaffected individuals). Five SNPs on 19p13 were associated with breast cancer risk (P-trend = 2.3 x 10(-9) to Ptrend = 3.9 x 10(-7)), two of which showed independent associations (rs8170, hazard ratio (HR) = 1.26, 95% CI 1.17-1.35; rs2363956 HR = 0.84, 95% CI 0.80-0.89). Genotyping these SNPs in 6,800 population-based breast cancer cases and 6,613 controls identified a similar association with estrogen receptor-negative breast cancer (rs2363956 per-allele odds ratio (OR) = 0.83, 95% CI 0.75-0.92, P-trend = 0.0003) and an association with estrogen receptor-positive disease in the opposite direction (OR = 1.07, 95% CI 1.01-1.14, P-trend = 0.016). The five SNPs were also associated with triple-negative breast cancer in a separate study of 2,301 triple-negative cases and 3,949 controls (Ptrend = 1 x 10(-7) to Ptrend = 8 x 10(-5); rs2363956 per-allele OR = 0.80, 95% CI 0.74-0.87, P-trend = 1.1 x 10(-7)).

...read moreread less

330 citations

1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
…
22
23
24
25
26
27
28
…
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71

Collapse

Cited by

PDF

Open Access

More filters

Journal Article•DOI•

Fast gapped-read alignment with Bowtie 2

[...]

Ben Langmead¹, Steven L. Salzberg¹, Steven L. Salzberg², Steven L. Salzberg³•Institutions (3)

University of Maryland, College Park¹, Johns Hopkins University School of Medicine², Johns Hopkins University³

01 Apr 2012-Nature Methods

TL;DR: Bowtie 2 combines the strengths of the full-text minute index with the flexibility and speed of hardware-accelerated dynamic programming algorithms to achieve a combination of high speed, sensitivity and accuracy.

...read moreread less

Abstract: As the rate of sequencing increases, greater throughput is demanded from read aligners. The full-text minute index is often used to make alignment very fast and memory-efficient, but the approach is ill-suited to finding longer, gapped alignments. Bowtie 2 combines the strengths of the full-text minute index with the flexibility and speed of hardware-accelerated dynamic programming algorithms to achieve a combination of high speed, sensitivity and accuracy.

...read moreread less

37,898 citations

Journal Article•DOI•

Gene set enrichment analysis: A knowledge-based approach for interpreting genome-wide expression profiles

[...]

Aravind Subramanian¹, Pablo Tamayo¹, Vamsi K. Mootha², Sayan Mukherjee³, Benjamin L. Ebert², Michael A. Gillette², Amanda G. Paulovich⁴, Scott L. Pomeroy², Todd R. Golub², Eric S. Lander¹, Jill P. Mesirov¹ - Show less +7 more•Institutions (4)

Massachusetts Institute of Technology¹, Harvard University², Duke University³, Fred Hutchinson Cancer Research Center⁴

25 Oct 2005-Proceedings of the National Academy of Sciences of the United States of America

TL;DR: The Gene Set Enrichment Analysis (GSEA) method as discussed by the authors focuses on gene sets, that is, groups of genes that share common biological function, chromosomal location, or regulation.

...read moreread less

Abstract: Although genomewide RNA expression analysis has become a routine tool in biomedical research, extracting biological insight from such information remains a major challenge. Here, we describe a powerful analytical method called Gene Set Enrichment Analysis (GSEA) for interpreting gene expression data. The method derives its power by focusing on gene sets, that is, groups of genes that share common biological function, chromosomal location, or regulation. We demonstrate how GSEA yields insights into several cancer-related data sets, including leukemia and lung cancer. Notably, where single-gene analysis finds little similarity between two independent studies of patient survival in lung cancer, GSEA reveals many biological pathways in common. The GSEA method is embodied in a freely available software package, together with an initial database of 1,325 biologically defined gene sets.

...read moreread less

34,830 citations

Journal Article•DOI•

PLINK: A Tool Set for Whole-Genome Association and Population-Based Linkage Analyses

[...]

Shaun Purcell¹, Shaun Purcell², Benjamin M. Neale¹, Benjamin M. Neale³, Kathe Todd-Brown², Lori Thomas², Manuel A. R. Ferreira², David Bender², David Bender¹, Julian Maller¹, Julian Maller², Pamela Sklar², Pamela Sklar¹, Paul I.W. de Bakker², Paul I.W. de Bakker¹, Mark J. Daly², Mark J. Daly¹, Pak C. Sham⁴ - Show less +14 more•Institutions (4)

Massachusetts Institute of Technology¹, Harvard University², University of London³, University of Hong Kong⁴

01 Sep 2007-American Journal of Human Genetics

TL;DR: This work introduces PLINK, an open-source C/C++ WGAS tool set, and describes the five main domains of function: data management, summary statistics, population stratification, association analysis, and identity-by-descent estimation, which focuses on the estimation and use of identity- by-state and identity/descent information in the context of population-based whole-genome studies.

...read moreread less

Abstract: Whole-genome association studies (WGAS) bring new computational, as well as analytic, challenges to researchers. Many existing genetic-analysis tools are not designed to handle such large data sets in a convenient manner and do not necessarily exploit the new opportunities that whole-genome data bring. To address these issues, we developed PLINK, an open-source C/C++ WGAS tool set. With PLINK, large data sets comprising hundreds of thousands of markers genotyped for thousands of individuals can be rapidly manipulated and analyzed in their entirety. As well as providing tools to make the basic analytic steps computationally efficient, PLINK also supports some novel approaches to whole-genome data that take advantage of whole-genome coverage. We introduce PLINK and describe the five main domains of function: data management, summary statistics, population stratification, association analysis, and identity-by-descent estimation. In particular, we focus on the estimation and use of identity-by-state and identity-by-descent information in the context of population-based whole-genome studies. This information can be used to detect and correct for population stratification and to identify extended chromosomal segments that are shared identical by descent between very distantly related individuals. Analysis of the patterns of segmental sharing has the potential to map disease loci that contain multiple rare variants in a population-based linkage analysis.

...read moreread less

26,280 citations

Journal Article•DOI•

Initial sequencing and analysis of the human genome.

[...]

Eric S. Lander¹, Lauren Linton¹, Bruce W. Birren¹, Chad Nusbaum¹ +245 more•Institutions (29)

15 Feb 2001-Nature

TL;DR: The results of an international collaboration to produce and make freely available a draft sequence of the human genome are reported and an initial analysis is presented, describing some of the insights that can be gleaned from the sequence.

...read moreread less

Abstract: The human genome holds an extraordinary trove of information about human development, physiology, medicine and evolution. Here we report the results of an international collaboration to produce and make freely available a draft sequence of the human genome. We also present an initial analysis of the data, describing some of the insights that can be gleaned from the sequence.

...read moreread less

22,269 citations

Journal Article•DOI•

limma powers differential expression analyses for RNA-sequencing and microarray studies

[...]

Matthew E. Ritchie¹, Belinda Phipson², Di Wu³, Yifang Hu¹, Charity W. Law⁴, Wei Shi¹, Gordon K. Smyth⁵, Gordon K. Smyth¹ - Show less +4 more•Institutions (5)

Walter and Eliza Hall Institute of Medical Research¹, Royal Children's Hospital², Harvard University³, University of Zurich⁴, University of Melbourne⁵

20 Apr 2015-Nucleic Acids Research

TL;DR: The philosophy and design of the limma package is reviewed, summarizing both new and historical features, with an emphasis on recent enhancements and features that have not been previously described.

...read moreread less

Abstract: limma is an R/Bioconductor software package that provides an integrated solution for analysing data from gene expression experiments. It contains rich features for handling complex experimental designs and for information borrowing to overcome the problem of small sample sizes. Over the past decade, limma has been a popular choice for gene discovery through differential expression analyses of microarray and high-throughput PCR data. The package contains particularly strong facilities for reading, normalizing and exploring such data. Recently, the capabilities of limma have been significantly expanded in two important directions. First, the package can now perform both differential expression and differential splicing analyses of RNA sequencing (RNA-seq) data. All the downstream analysis tools previously restricted to microarray data are now available for RNA-seq as well. These capabilities allow users to analyse both RNA-seq and microarray data with very similar pipelines. Second, the package is now able to go past the traditional gene-wise expression analyses in a variety of ways, analysing expression profiles in terms of co-regulated sets of genes or in terms of higher-order expression signatures. This provides enhanced possibilities for biological interpretation of gene expression differences. This article reviews the philosophy and design of the limma package, summarizing both new and historical features, with an emphasis on recent enhancements and features that have not been previously described.

...read moreread less

22,147 citations

1
2
3
4
…
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200

Collapse