Home
/
Authors
/
David Altshuler

Author

David Altshuler

Other affiliations: Vertex Pharmaceuticals, Massachusetts Institute of Technology, Broad Institute ...read more

Bio: David Altshuler is an academic researcher from University of Michigan. The author has contributed to research in topics: Genome-wide association study & Population. The author has an hindex of 162, co-authored 345 publications receiving 201782 citations. Previous affiliations of David Altshuler include Vertex Pharmaceuticals & Massachusetts Institute of Technology.

Papers published on a yearly basis

2023
2022
2021
2020
2019
2018
2017
2016
2015
2014
2013
2012
2011
2010
2009
2008
2007
2006
2005
2004
2003
2002
2001
2000
1999
1998
1993
1992

Papers

PDF

Open Access

More filters

Journal Article•DOI•

Demonstrating stratification in a European American population.

[...]

Catarina D. Campbell¹, Elizabeth L. Ogburn¹, Kathryn L. Lunetta², Helen N. Lyon¹, Helen N. Lyon³, Matthew L. Freedman³, Matthew L. Freedman⁴, Leif Groop⁵, David Altshuler⁴, David Altshuler³, Kristin G. Ardlie, Joel N. Hirschhorn⁴, Joel N. Hirschhorn¹, Joel N. Hirschhorn³ - Show less +10 more•Institutions (5)

Boston Children's Hospital¹, Boston University², Harvard University³, Broad Institute⁴, Lund University⁵

24 Jul 2005-Nature Genetics

TL;DR: The failure of standard methods to detect stratification in case-control association studies indicates that new methods may be required, and a SNP in the gene LCT that varies widely in frequency across Europe was strongly associated with height.

...read moreread less

Abstract: Population stratification occurs in case-control association studies when allele frequencies differ between cases and controls because of ancestry. Stratification may lead to false positive associations, although this issue remains controversial. Empirical studies have found little evidence of stratification in European-derived populations, but potentially significant levels of stratification could not be ruled out. We studied a European American panel discordant for height, a heritable trait that varies widely across Europe. Genotyping 178 SNPs and applying standard analytical methods yielded no evidence of stratification. But a SNP in the gene LCT that varies widely in frequency across Europe was strongly associated with height (P < 10(-6)). This apparent association was largely or completely due to stratification; rematching individuals on the basis of European ancestry greatly reduced the apparent association, and no association was observed in Polish or Scandinavian individuals. The failure of standard methods to detect this stratification indicates that new methods may be required.

...read moreread less

459 citations

Journal Article•DOI•

Novel Loci for Adiponectin Levels and Their Influence on Type 2 Diabetes and Metabolic Traits: A Multi-Ethnic Meta-Analysis of 45,891 Individuals

[...]

Zari Dastani¹, Hivert M-F.², Hivert M-F.³, N J Timpson⁴ +615 more•Institutions (128)

29 Mar 2012-PLOS Genetics

TL;DR: A meta-analysis of genome-wide association studies in 39,883 individuals of European ancestry to identify genes associated with metabolic disease identifies novel genetic determinants of adiponectin levels, which, taken together, influence risk of T2D and markers of insulin resistance.

...read moreread less

Abstract: Circulating levels of adiponectin, a hormone produced predominantly by adipocytes, are highly heritable and are inversely associated with type 2 diabetes mellitus (T2D) and other metabolic traits. We conducted a meta-analysis of genome-wide association studies in 39,883 individuals of European ancestry to identify genes associated with metabolic disease. We identified 8 novel loci associated with adiponectin levels and confirmed 2 previously reported loci (P = 4.5×10(-8)-1.2×10(-43)). Using a novel method to combine data across ethnicities (N = 4,232 African Americans, N = 1,776 Asians, and N = 29,347 Europeans), we identified two additional novel loci. Expression analyses of 436 human adipocyte samples revealed that mRNA levels of 18 genes at candidate regions were associated with adiponectin concentrations after accounting for multiple testing (p<3×10(-4)). We next developed a multi-SNP genotypic risk score to test the association of adiponectin decreasing risk alleles on metabolic traits and diseases using consortia-level meta-analytic data. This risk score was associated with increased risk of T2D (p = 4.3×10(-3), n = 22,044), increased triglycerides (p = 2.6×10(-14), n = 93,440), increased waist-to-hip ratio (p = 1.8×10(-5), n = 77,167), increased glucose two hours post oral glucose tolerance testing (p = 4.4×10(-3), n = 15,234), increased fasting insulin (p = 0.015, n = 48,238), but with lower in HDL-cholesterol concentrations (p = 4.5×10(-13), n = 96,748) and decreased BMI (p = 1.4×10(-4), n = 121,335). These findings identify novel genetic determinants of adiponectin levels, which, taken together, influence risk of T2D and markers of insulin resistance.

...read moreread less

456 citations

Journal Article•DOI•

Identifying Relationships among Genomic Disease Regions: Predicting Genes at Pathogenic SNP Associations and Rare Deletions

[...]

Soumya Raychaudhuri¹, Robert M. Plenge¹, Robert M. Plenge², Robert M. Plenge³, Elizabeth J. Rossin⁴, Elizabeth J. Rossin¹, Elizabeth J. Rossin³, Aylwin Ng³, Shaun Purcell¹, Shaun Purcell³, Pamela Sklar, Edward M. Scolnick¹, Edward M. Scolnick³, Ramnik J. Xavier³, David Altshuler, Mark J. Daly³, Mark J. Daly¹ - Show less +13 more•Institutions (4)

Broad Institute¹, Brigham and Women's Hospital², Harvard University³, Massachusetts Institute of Technology⁴

26 Jun 2009-PLOS Genetics

TL;DR: A statistical method that takes a list of disease regions and automatically assesses the degree of relatedness of implicated genes using 250,000 PubMed abstracts, and offers a statistically robust approach to identifying functionally related genes from across multiple disease regions—that likely represent key disease pathways.

...read moreread less

Abstract: Translating a set of disease regions into insight about pathogenic mechanisms requires not only the ability to identify the key disease genes within them, but also the biological relationships among those key genes. Here we describe a statistical method, Gene Relationships Among Implicated Loci (GRAIL), that takes a list of disease regions and automatically assesses the degree of relatedness of implicated genes using 250,000 PubMed abstracts. We first evaluated GRAIL by assessing its ability to identify subsets of highly related genes in common pathways from validated lipid and height SNP associations from recent genome-wide studies. We then tested GRAIL, by assessing its ability to separate true disease regions from many false positive disease regions in two separate practical applications in human genetics. First, we took 74 nominally associated Crohn's disease SNPs and applied GRAIL to identify a subset of 13 SNPs with highly related genes. Of these, ten convincingly validated in follow-up genotyping; genotyping results for the remaining three were inconclusive. Next, we applied GRAIL to 165 rare deletion events seen in schizophrenia cases (less than one-third of which are contributing to disease risk). We demonstrate that GRAIL is able to identify a subset of 16 deletions containing highly related genes; many of these genes are expressed in the central nervous system and play a role in neuronal synapses. GRAIL offers a statistically robust approach to identifying functionally related genes from across multiple disease regions—that likely represent key disease pathways. An online version of this method is available for public use (http://www.broad.mit.edu/mpg/grail/).

...read moreread less

455 citations

Journal Article•DOI•

Three functional variants of IFN regulatory factor 5 (IRF5) define risk and protective haplotypes for human lupus.

[...]

Robert R. Graham¹, Chieko Kyogoku², Snaevar Sigurdsson³, Irina A. Vlasova, Leela Davies⁴, Leela Davies⁵, Emily C. Baechler, Robert M. Plenge⁴, Robert M. Plenge⁵, Thearith Koeuth², Ward A. Ortmann², Ward A. Ortmann⁶, Geoffrey Hom², Geoffrey Hom⁶, Jason W. Bauer², Clarence Gillett², Noël P. Burtt⁴, Noël P. Burtt⁵, Deborah S. Cunninghame Graham⁷, Robert C. Onofrio⁵, Robert C. Onofrio⁴, Michelle Petri⁸, Iva Gunnarsson⁹, Elisabet Svenungsson⁹, Lars Rönnblom³, Gunnel Nordmark³, Peter K. Gregersen¹⁰, Kathy L. Moser², Patrick M. Gaffney², Lindsey A. Criswell¹¹, Timothy J. Vyse⁷, Ann-Christine Syvänen³, Paul R. Bohjanen, Mark J. Daly⁴, Mark J. Daly⁵, Timothy W. Behrens², Timothy W. Behrens⁶, David Altshuler⁴, David Altshuler⁵ - Show less +35 more•Institutions (11)

Massachusetts Institute of Technology¹, University of Minnesota², Uppsala University³, Broad Institute⁴, Harvard University⁵, Genentech⁶, Hammersmith Hospital⁷, Johns Hopkins University⁸, Karolinska Institutet⁹, North Shore-LIJ Health System¹⁰, University of California, San Francisco¹¹

17 Apr 2007-Proceedings of the National Academy of Sciences of the United States of America

TL;DR: Evidence is found for three functional alleles of IRF5: the previously described exon 1B splice site variant, a 30-bp in-frame insertion/deletion variant of exon 6 that alters a proline-, glutamic acid-, serine- and threonine-rich domain region, and a variant in a conserved polyA+ signal sequence that alters the length of the 3′ UTR and stability of IRf5 mRNAs.

...read moreread less

Abstract: Systematic genome-wide studies to map genomic regions associated with human diseases are becoming more practical. Increasingly, efforts will be focused on the identification of the specific functional variants responsible for the disease. The challenges of identifying causal variants include the need for complete ascertainment of genetic variants and the need to consider the possibility of multiple causal alleles. We recently reported that risk of systemic lupus erythematosus (SLE) is strongly associated with a common SNP in IFN regulatory factor 5 (IRF5), and that this variant altered spicing in a way that might provide a functional explanation for the reproducible association to SLE risk. Here, by resequencing and genotyping in patients with SLE, we find evidence for three functional alleles of IRF5: the previously described exon 1B splice site variant, a 30-bp in-frame insertion/deletion variant of exon 6 that alters a proline-, glutamic acid-, serine- and threonine-rich domain region, and a variant in a conserved polyA+ signal sequence that alters the length of the 3' UTR and stability of IRF5 mRNAs. Haplotypes of these three variants define at least three distinct levels of risk to SLE. Understanding how combinations of variants influence IRF5 function may offer etiological and therapeutic insights in SLE; more generally, IRF5 and SLE illustrates how multiple common variants of the same gene can together influence risk of common disease.

...read moreread less

441 citations

Journal Article•DOI•

TXNIP regulates peripheral glucose metabolism in humans

[...]

Hemang Parikh¹, Emma Carlsson², Emma Carlsson¹, William A. Chutkow³, Lovisa Johansson¹, Heidi Storgaard², Pernille Poulsen², Richa Saxena⁴, Richa Saxena⁵, Christine Ladd⁵, P. Christian Schulze³, Michael J. Mazzini³, Christine B. Jensen², Anna Krook⁶, Marie Björnholm⁶, Hans Tornqvist⁷, Juleen R. Zierath⁶, Martin Ridderstråle¹, David Altshuler⁵, David Altshuler⁴, Richard T. Lee³, Allan Vaag², Allan Vaag¹, Leif Groop⁸, Leif Groop¹, Vamsi K. Mootha⁵, Vamsi K. Mootha⁴ - Show less +23 more•Institutions (8)

Lund University¹, Steno Diabetes Center², Brigham and Women's Hospital³, Harvard University⁴, Broad Institute⁵, Karolinska Institutet⁶, Novo Nordisk⁷, University of Helsinki⁸

01 May 2007-PLOS Medicine

TL;DR: The data suggest that TXNIP might play a key role in defective glucose homeostasis preceding overt T2DM, as it regulates both insulin-dependent and insulin-independent pathways of glucose uptake in human skeletal muscle.

...read moreread less

Abstract: Background Type 2 diabetes mellitus (T2DM) is characterized by defects in insulin secretion and action. Impaired glucose uptake in skeletal muscle is believed to be one of the earliest features in the natural history of T2DM, although underlying mechanisms remain obscure. Methods and Findings We combined human insulin/glucose clamp physiological studies with genome-wide expression profiling to identify thioredoxin interacting protein (TXNIP) as a gene whose expression is powerfully suppressed by insulin yet stimulated by glucose. In healthy individuals, its expression was inversely correlated to total body measures of glucose uptake. Forced expression of TXNIP in cultured adipocytes significantly reduced glucose uptake, while silencing with RNA interference in adipocytes and in skeletal muscle enhanced glucose uptake, confirming that the gene product is also a regulator of glucose uptake. TXNIP expression is consistently elevated in the muscle of prediabetics and diabetics, although in a panel of 4,450 Scandinavian individuals, we found no evidence for association between common genetic variation in the TXNIP gene and T2DM.

...read moreread less

437 citations

1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
…
18
19
20
21
22
23
24
…
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71

Collapse

Cited by

PDF

Open Access

More filters

Journal Article•DOI•

Fast gapped-read alignment with Bowtie 2

[...]

Ben Langmead¹, Steven L. Salzberg¹, Steven L. Salzberg², Steven L. Salzberg³•Institutions (3)

University of Maryland, College Park¹, Johns Hopkins University School of Medicine², Johns Hopkins University³

01 Apr 2012-Nature Methods

TL;DR: Bowtie 2 combines the strengths of the full-text minute index with the flexibility and speed of hardware-accelerated dynamic programming algorithms to achieve a combination of high speed, sensitivity and accuracy.

...read moreread less

Abstract: As the rate of sequencing increases, greater throughput is demanded from read aligners. The full-text minute index is often used to make alignment very fast and memory-efficient, but the approach is ill-suited to finding longer, gapped alignments. Bowtie 2 combines the strengths of the full-text minute index with the flexibility and speed of hardware-accelerated dynamic programming algorithms to achieve a combination of high speed, sensitivity and accuracy.

...read moreread less

37,898 citations

Journal Article•DOI•

Gene set enrichment analysis: A knowledge-based approach for interpreting genome-wide expression profiles

[...]

Aravind Subramanian¹, Pablo Tamayo¹, Vamsi K. Mootha², Sayan Mukherjee³, Benjamin L. Ebert², Michael A. Gillette², Amanda G. Paulovich⁴, Scott L. Pomeroy², Todd R. Golub², Eric S. Lander¹, Jill P. Mesirov¹ - Show less +7 more•Institutions (4)

Massachusetts Institute of Technology¹, Harvard University², Duke University³, Fred Hutchinson Cancer Research Center⁴

25 Oct 2005-Proceedings of the National Academy of Sciences of the United States of America

TL;DR: The Gene Set Enrichment Analysis (GSEA) method as discussed by the authors focuses on gene sets, that is, groups of genes that share common biological function, chromosomal location, or regulation.

...read moreread less

Abstract: Although genomewide RNA expression analysis has become a routine tool in biomedical research, extracting biological insight from such information remains a major challenge. Here, we describe a powerful analytical method called Gene Set Enrichment Analysis (GSEA) for interpreting gene expression data. The method derives its power by focusing on gene sets, that is, groups of genes that share common biological function, chromosomal location, or regulation. We demonstrate how GSEA yields insights into several cancer-related data sets, including leukemia and lung cancer. Notably, where single-gene analysis finds little similarity between two independent studies of patient survival in lung cancer, GSEA reveals many biological pathways in common. The GSEA method is embodied in a freely available software package, together with an initial database of 1,325 biologically defined gene sets.

...read moreread less

34,830 citations

Journal Article•DOI•

PLINK: A Tool Set for Whole-Genome Association and Population-Based Linkage Analyses

[...]

Shaun Purcell¹, Shaun Purcell², Benjamin M. Neale¹, Benjamin M. Neale³, Kathe Todd-Brown², Lori Thomas², Manuel A. R. Ferreira², David Bender², David Bender¹, Julian Maller¹, Julian Maller², Pamela Sklar², Pamela Sklar¹, Paul I.W. de Bakker², Paul I.W. de Bakker¹, Mark J. Daly², Mark J. Daly¹, Pak C. Sham⁴ - Show less +14 more•Institutions (4)

Massachusetts Institute of Technology¹, Harvard University², University of London³, University of Hong Kong⁴

01 Sep 2007-American Journal of Human Genetics

TL;DR: This work introduces PLINK, an open-source C/C++ WGAS tool set, and describes the five main domains of function: data management, summary statistics, population stratification, association analysis, and identity-by-descent estimation, which focuses on the estimation and use of identity- by-state and identity/descent information in the context of population-based whole-genome studies.

...read moreread less

Abstract: Whole-genome association studies (WGAS) bring new computational, as well as analytic, challenges to researchers. Many existing genetic-analysis tools are not designed to handle such large data sets in a convenient manner and do not necessarily exploit the new opportunities that whole-genome data bring. To address these issues, we developed PLINK, an open-source C/C++ WGAS tool set. With PLINK, large data sets comprising hundreds of thousands of markers genotyped for thousands of individuals can be rapidly manipulated and analyzed in their entirety. As well as providing tools to make the basic analytic steps computationally efficient, PLINK also supports some novel approaches to whole-genome data that take advantage of whole-genome coverage. We introduce PLINK and describe the five main domains of function: data management, summary statistics, population stratification, association analysis, and identity-by-descent estimation. In particular, we focus on the estimation and use of identity-by-state and identity-by-descent information in the context of population-based whole-genome studies. This information can be used to detect and correct for population stratification and to identify extended chromosomal segments that are shared identical by descent between very distantly related individuals. Analysis of the patterns of segmental sharing has the potential to map disease loci that contain multiple rare variants in a population-based linkage analysis.

...read moreread less

26,280 citations

Journal Article•DOI•

Initial sequencing and analysis of the human genome.

[...]

Eric S. Lander¹, Lauren Linton¹, Bruce W. Birren¹, Chad Nusbaum¹ +245 more•Institutions (29)

15 Feb 2001-Nature

TL;DR: The results of an international collaboration to produce and make freely available a draft sequence of the human genome are reported and an initial analysis is presented, describing some of the insights that can be gleaned from the sequence.

...read moreread less

Abstract: The human genome holds an extraordinary trove of information about human development, physiology, medicine and evolution. Here we report the results of an international collaboration to produce and make freely available a draft sequence of the human genome. We also present an initial analysis of the data, describing some of the insights that can be gleaned from the sequence.

...read moreread less

22,269 citations

Journal Article•DOI•

limma powers differential expression analyses for RNA-sequencing and microarray studies

[...]

Matthew E. Ritchie¹, Belinda Phipson², Di Wu³, Yifang Hu¹, Charity W. Law⁴, Wei Shi¹, Gordon K. Smyth⁵, Gordon K. Smyth¹ - Show less +4 more•Institutions (5)

Walter and Eliza Hall Institute of Medical Research¹, Royal Children's Hospital², Harvard University³, University of Zurich⁴, University of Melbourne⁵

20 Apr 2015-Nucleic Acids Research

TL;DR: The philosophy and design of the limma package is reviewed, summarizing both new and historical features, with an emphasis on recent enhancements and features that have not been previously described.

...read moreread less

Abstract: limma is an R/Bioconductor software package that provides an integrated solution for analysing data from gene expression experiments. It contains rich features for handling complex experimental designs and for information borrowing to overcome the problem of small sample sizes. Over the past decade, limma has been a popular choice for gene discovery through differential expression analyses of microarray and high-throughput PCR data. The package contains particularly strong facilities for reading, normalizing and exploring such data. Recently, the capabilities of limma have been significantly expanded in two important directions. First, the package can now perform both differential expression and differential splicing analyses of RNA sequencing (RNA-seq) data. All the downstream analysis tools previously restricted to microarray data are now available for RNA-seq as well. These capabilities allow users to analyse both RNA-seq and microarray data with very similar pipelines. Second, the package is now able to go past the traditional gene-wise expression analyses in a variety of ways, analysing expression profiles in terms of co-regulated sets of genes or in terms of higher-order expression signatures. This provides enhanced possibilities for biological interpretation of gene expression differences. This article reviews the philosophy and design of the limma package, summarizing both new and historical features, with an emphasis on recent enhancements and features that have not been previously described.

...read moreread less

22,147 citations

1
2
3
4
…
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200

Collapse