scispace - formally typeset
Search or ask a question
Author

David Altshuler

Bio: David Altshuler is an academic researcher from University of Michigan. The author has contributed to research in topics: Genome-wide association study & Population. The author has an hindex of 162, co-authored 345 publications receiving 201782 citations. Previous affiliations of David Altshuler include Vertex Pharmaceuticals & Massachusetts Institute of Technology.


Papers
More filters
01 Jan 2019
TL;DR: The authors used exome-sequencing analyses of a large cohort of patients with Type 2 diabetes and control individuals without diabetes from five ancestries to identify gene-level associations of rare variants that are associated with type 2 diabetes.
Abstract: Protein-coding genetic variants that strongly affect disease risk can yield relevant clues to disease pathogenesis. Here we report exome-sequencing analyses of 20,791 individuals with type 2 diabetes (T2D) and 24,440 non-diabetic control participants from 5 ancestries. We identify gene-level associations of rare variants (with minor allele frequencies of less than 0.5%) in 4 genes at exome-wide significance, including a series of more than 30 SLC30A8 alleles that conveys protection against T2D, and in 12 gene sets, including those corresponding to T2D drug targets (P = 6.1 × 10−3) and candidate genes from knockout mice (P = 5.2 × 10−3). Within our study, the strongest T2D gene-level signals for rare variants explain at most 25% of the heritability of the strongest common single-variant signals, and the gene-level effect sizes of the rare variants that we observed in established T2D drug targets will require 75,000–185,000 sequenced cases to achieve exome-wide significance. We propose a method to interpret these modest rare-variant associations and to incorporate these associations into future target or gene prioritization efforts.Exome-sequencing analyses of a large cohort of patients with type 2 diabetes and control individuals without diabetes from five ancestries are used to identify gene-level associations of rare variants that are associated with type 2 diabetes.

107 citations

Journal ArticleDOI
14 Mar 2013-Nature
TL;DR: Rieder as mentioned in this paper was a member of the Seattle Grand Opportunity group and oversaw data generation and quality control and was one of the pioneers in the development of the GANs.
Abstract: Nature 493, 216–220 (2013); doi:10.1038/nature11690 In this Letter, Mark J. Rieder (Department of Genome Sciences, University of Washington, Seattle, Washington 98195, USA) was inadvertently omitted from the author list. He oversaw data generation and quality control and is a member of the Seattle Grand Opportunity group.

99 citations

Journal ArticleDOI
TL;DR: The authors' association analyses identified more comprehensive sets of variants showing equivalent statistical association with type 2 diabetes or myocardial infarction, but did not identify stronger associations than the original GWAS signals.
Abstract: Noncoding variants at human chromosome 9p21 near CDKN2A and CDKN2B are associated with type 2 diabetes, myocardial infarction, aneurysm, vertical cup disc ratio and at least five cancers. Here we compare approaches to more comprehensively assess genetic variation in the region. We carried out targeted sequencing at high coverage in 47 individuals and compared the results to pilot data from the 1000 Genomes Project. We imputed variants into type 2 diabetes and myocardial infarction cohorts directly from targeted sequencing, from a genotyped reference panel derived from sequencing and from 1000 Genomes Project low-coverage data. Polymorphisms with frequency >5% were captured well by all strategies. Imputation of intermediate-frequency polymorphisms required a higher density of tag SNPs in disease samples than is available on first-generation genome-wide association study (GWAS) arrays. Our association analyses identified more comprehensive sets of variants showing equivalent statistical association with type 2 diabetes or myocardial infarction, but did not identify stronger associations than the original GWAS signals.

99 citations

Journal ArticleDOI
01 Sep 2008-Diabetes
TL;DR: It is unable to replicate the GWAS findings regarding diabetes risk in the DPP, but did observe genotype associations with differences in baseline insulin secretion at the HHEX locus and a possible pharmacogenetic interaction at CDKNA2/B.
Abstract: OBJECTIVE: Genome-wide association scans (GWASs) have identified novel diabetes-associated genes. We evaluated how these variants impact diabetes incidence, quantitative glycemic traits, and respon ...

97 citations

Journal ArticleDOI
TL;DR: This study diagnosed clinically unapparent cholesterol ester storage disease in the affected individuals from this kindred and addressed an outstanding question about risk of cardiovascular disease in LIPA E8SJM heterozygous carriers.
Abstract: Objective— Autosomal recessive hypercholesterolemia is a rare inherited disorder, characterized by extremely high total and low-density lipoprotein cholesterol levels, that has been previously linked to mutations in LDLRAP1 . We identified a family with autosomal recessive hypercholesterolemia not explained by mutations in LDLRAP1 or other genes known to cause monogenic hypercholesterolemia. The aim of this study was to identify the molecular pathogenesis of autosomal recessive hypercholesterolemia in this family. Approach and Results— We used exome sequencing to assess all protein-coding regions of the genome in 3 family members and identified a homozygous exon 8 splice junction mutation (c.894G>A, also known as E8SJM) in LIPA that segregated with the diagnosis of hypercholesterolemia. Because homozygosity for mutations in LIPA is known to cause cholesterol ester storage disease, we performed directed follow-up phenotyping by noninvasively measuring hepatic cholesterol content. We observed abnormal hepatic accumulation of cholesterol in the homozygote individuals, supporting the diagnosis of cholesterol ester storage disease. Given previous suggestions of cardiovascular disease risk in heterozygous LIPA mutation carriers, we genotyped E8SJM in >27 000 individuals and found no association with plasma lipid levels or risk of myocardial infarction, confirming a true recessive mode of inheritance. Conclusions— By integrating observations from Mendelian and population genetics along with directed clinical phenotyping, we diagnosed clinically unapparent cholesterol ester storage disease in the affected individuals from this kindred and addressed an outstanding question about risk of cardiovascular disease in LIPA E8SJM heterozygous carriers.

97 citations


Cited by
More filters
Journal ArticleDOI
TL;DR: Bowtie 2 combines the strengths of the full-text minute index with the flexibility and speed of hardware-accelerated dynamic programming algorithms to achieve a combination of high speed, sensitivity and accuracy.
Abstract: As the rate of sequencing increases, greater throughput is demanded from read aligners. The full-text minute index is often used to make alignment very fast and memory-efficient, but the approach is ill-suited to finding longer, gapped alignments. Bowtie 2 combines the strengths of the full-text minute index with the flexibility and speed of hardware-accelerated dynamic programming algorithms to achieve a combination of high speed, sensitivity and accuracy.

37,898 citations

Journal ArticleDOI
TL;DR: The Gene Set Enrichment Analysis (GSEA) method as discussed by the authors focuses on gene sets, that is, groups of genes that share common biological function, chromosomal location, or regulation.
Abstract: Although genomewide RNA expression analysis has become a routine tool in biomedical research, extracting biological insight from such information remains a major challenge. Here, we describe a powerful analytical method called Gene Set Enrichment Analysis (GSEA) for interpreting gene expression data. The method derives its power by focusing on gene sets, that is, groups of genes that share common biological function, chromosomal location, or regulation. We demonstrate how GSEA yields insights into several cancer-related data sets, including leukemia and lung cancer. Notably, where single-gene analysis finds little similarity between two independent studies of patient survival in lung cancer, GSEA reveals many biological pathways in common. The GSEA method is embodied in a freely available software package, together with an initial database of 1,325 biologically defined gene sets.

34,830 citations

Journal ArticleDOI
TL;DR: This work introduces PLINK, an open-source C/C++ WGAS tool set, and describes the five main domains of function: data management, summary statistics, population stratification, association analysis, and identity-by-descent estimation, which focuses on the estimation and use of identity- by-state and identity/descent information in the context of population-based whole-genome studies.
Abstract: Whole-genome association studies (WGAS) bring new computational, as well as analytic, challenges to researchers. Many existing genetic-analysis tools are not designed to handle such large data sets in a convenient manner and do not necessarily exploit the new opportunities that whole-genome data bring. To address these issues, we developed PLINK, an open-source C/C++ WGAS tool set. With PLINK, large data sets comprising hundreds of thousands of markers genotyped for thousands of individuals can be rapidly manipulated and analyzed in their entirety. As well as providing tools to make the basic analytic steps computationally efficient, PLINK also supports some novel approaches to whole-genome data that take advantage of whole-genome coverage. We introduce PLINK and describe the five main domains of function: data management, summary statistics, population stratification, association analysis, and identity-by-descent estimation. In particular, we focus on the estimation and use of identity-by-state and identity-by-descent information in the context of population-based whole-genome studies. This information can be used to detect and correct for population stratification and to identify extended chromosomal segments that are shared identical by descent between very distantly related individuals. Analysis of the patterns of segmental sharing has the potential to map disease loci that contain multiple rare variants in a population-based linkage analysis.

26,280 citations

Journal ArticleDOI
Eric S. Lander1, Lauren Linton1, Bruce W. Birren1, Chad Nusbaum1  +245 moreInstitutions (29)
15 Feb 2001-Nature
TL;DR: The results of an international collaboration to produce and make freely available a draft sequence of the human genome are reported and an initial analysis is presented, describing some of the insights that can be gleaned from the sequence.
Abstract: The human genome holds an extraordinary trove of information about human development, physiology, medicine and evolution. Here we report the results of an international collaboration to produce and make freely available a draft sequence of the human genome. We also present an initial analysis of the data, describing some of the insights that can be gleaned from the sequence.

22,269 citations

Journal ArticleDOI
TL;DR: The philosophy and design of the limma package is reviewed, summarizing both new and historical features, with an emphasis on recent enhancements and features that have not been previously described.
Abstract: limma is an R/Bioconductor software package that provides an integrated solution for analysing data from gene expression experiments. It contains rich features for handling complex experimental designs and for information borrowing to overcome the problem of small sample sizes. Over the past decade, limma has been a popular choice for gene discovery through differential expression analyses of microarray and high-throughput PCR data. The package contains particularly strong facilities for reading, normalizing and exploring such data. Recently, the capabilities of limma have been significantly expanded in two important directions. First, the package can now perform both differential expression and differential splicing analyses of RNA sequencing (RNA-seq) data. All the downstream analysis tools previously restricted to microarray data are now available for RNA-seq as well. These capabilities allow users to analyse both RNA-seq and microarray data with very similar pipelines. Second, the package is now able to go past the traditional gene-wise expression analyses in a variety of ways, analysing expression profiles in terms of co-regulated sets of genes or in terms of higher-order expression signatures. This provides enhanced possibilities for biological interpretation of gene expression differences. This article reviews the philosophy and design of the limma package, summarizing both new and historical features, with an emphasis on recent enhancements and features that have not been previously described.

22,147 citations