scispace - formally typeset
Search or ask a question
Journal ArticleDOI

A global reference for human genetic variation.

Adam Auton1, Gonçalo R. Abecasis2, David Altshuler3, Richard Durbin4  +514 moreInstitutions (90)
01 Oct 2015-Nature (Nature Publishing Group)-Vol. 526, Iss: 7571, pp 68-74
TL;DR: The 1000 Genomes Project set out to provide a comprehensive description of common human genetic variation by applying whole-genome sequencing to a diverse set of individuals from multiple populations, and has reconstructed the genomes of 2,504 individuals from 26 populations using a combination of low-coverage whole-generation sequencing, deep exome sequencing, and dense microarray genotyping.
Abstract: The 1000 Genomes Project set out to provide a comprehensive description of common human genetic variation by applying whole-genome sequencing to a diverse set of individuals from multiple populations. Here we report completion of the project, having reconstructed the genomes of 2,504 individuals from 26 populations using a combination of low-coverage whole-genome sequencing, deep exome sequencing, and dense microarray genotyping. We characterized a broad spectrum of genetic variation, in total over 88 million variants (84.7 million single nucleotide polymorphisms (SNPs), 3.6 million short insertions/deletions (indels), and 60,000 structural variants), all phased onto high-quality haplotypes. This resource includes >99% of SNP variants with a frequency of >1% for a variety of ancestries. We describe the distribution of genetic variation across the global sample, and discuss the implications for common disease studies.
Citations
More filters
Journal ArticleDOI
TL;DR: Genome-wide single-cell arrays and high-throughput sequencing analyses are dramatically increasing the ability to detect embryonic and fetal genetic lesions, and have substantially improved embryo selection for in vitro fertilization.
Abstract: This Timeline article reviews the evolution of both prenatal and pre-implantation genetic testing, from their historic origins to ongoing development and implementation of tools for genome-wide single-cell and cell-free fetal DNA analysis. Future developments and some of the ethical issues that arise from these new technologies are also discussed. The past decade has seen the development of technologies that have revolutionized prenatal genetic testing; that is, genetic testing from conception until birth. Genome-wide single-cell arrays and high-throughput sequencing analyses are dramatically increasing our ability to detect embryonic and fetal genetic lesions, and have substantially improved embryo selection for in vitro fertilization (IVF). Moreover, both invasive and non-invasive mutation scanning of the genome are helping to identify the genetic causes of prenatal developmental disorders. These advances are changing clinical practice and pose novel challenges for genetic counselling and prenatal care.

149 citations

Journal ArticleDOI
TL;DR: In this paper, a critical review focused upon the ADH and ALDH genes as they affect AUDs was presented, and the likely mechanism of protection against heavy drinking and AUDs in both cases is alteration in the rate of metabolism of EtOH that at least transiently elevates acetaldehyde.
Abstract: Alcohol use disorders (AUDs) are complex traits, meaning that variations in many genes contribute to the risk, as does the environment. Although the total genetic contribution to risk is substantial, most individual variations make only very small contributions. By far the strongest contributors are functional variations in 2 genes involved in alcohol (ethanol [EtOH]) metabolism. A functional variant in alcohol dehydrogenase 1B (ADH1B) is protective in people of European and Asian descent, and a different functional variant in the same gene is protective in those of African descent. A strongly protective variant in aldehyde dehydrogenase 2 (ALDH2) is essentially only found in Asians. This highlights the need to study a wide range of populations. The likely mechanism of protection against heavy drinking and AUDs in both cases is alteration in the rate of metabolism of EtOH that at least transiently elevates acetaldehyde. Other ADH and ALDH variants, including functional variations in ADH1C, have also been implicated in affecting drinking behavior and risk for alcoholism. The pattern of linkage disequilibrium in the ADH region and the differences among populations complicate analyses, particularly of regulatory variants. This critical review focuses upon the ADH and ALDH genes as they affect AUDs.

148 citations

Journal ArticleDOI
TL;DR: Application of SuRE reporter technology to survey the effect of 5.9 million SNPs in the human genome on enhancer and promoter activity identifies over 30,000 SNPs that alter the activity of putative regulatory elements.
Abstract: Most of the millions of SNPs in the human genome are non-coding, and many overlap with putative regulatory elements. Genome-wide association studies (GWAS) have linked many of these SNPs to human traits or to gene expression levels, but rarely with sufficient resolution to identify the causal SNPs. Functional screens based on reporter assays have previously been of insufficient throughput to test the vast space of SNPs for possible effects on regulatory element activity. Here we leveraged the throughput and resolution of the survey of regulatory elements (SuRE) reporter technology to survey the effect of 5.9 million SNPs, including 57% of the known common SNPs, on enhancer and promoter activity. We identified more than 30,000 SNPs that alter the activity of putative regulatory elements, partially in a cell-type-specific manner. Integration of this dataset with GWAS results may help to pinpoint SNPs that underlie human traits.

148 citations

Journal ArticleDOI
02 Sep 2021-Cell
TL;DR: A genome-wide association study meta-analysis across 826,690 individuals and identifies 100 independently associated risk variants across 11 osteoarthritis phenotypes, 52 of which have not been associated with the disease before.

148 citations

Journal ArticleDOI
TL;DR: Analysis of genomic data obtained by sequencing hominin fossils with particular emphasis on the unique information that ancient DNA can provide about the demographic history of humans and the authors' closest relatives reveals historical demographic patterns in a way that could be resolved by analyzing present-day genomes alone.
Abstract: We review studies of genomic data obtained by sequencing hominin fossils with particular emphasis on the unique information that ancient DNA (aDNA) can provide about the demographic history of humans and our closest relatives. We concentrate on nuclear genomic sequences that have been published in the past few years. In many cases, particularly in the Arctic, the Americas, and Europe, aDNA has revealed historical demographic patterns in a way that could not be resolved by analyzing present-day genomes alone. Ancient DNA from archaic hominins has revealed a rich history of admixture between early modern humans, Neanderthals, and Denisovans, and has allowed us to disentangle complex selective processes. Information from aDNA studies is nowhere near saturation, and we believe that future aDNA sequences will continue to change our understanding of hominin history.

148 citations

References
More filters
Journal ArticleDOI
TL;DR: A new approach to rapid sequence comparison, basic local alignment search tool (BLAST), directly approximates alignments that optimize a measure of local similarity, the maximal segment pair (MSP) score.

88,255 citations

Journal ArticleDOI
TL;DR: SAMtools as discussed by the authors implements various utilities for post-processing alignments in the SAM format, such as indexing, variant caller and alignment viewer, and thus provides universal tools for processing read alignments.
Abstract: Summary: The Sequence Alignment/Map (SAM) format is a generic alignment format for storing read alignments against reference sequences, supporting short and long reads (up to 128 Mbp) produced by different sequencing platforms. It is flexible in style, compact in size, efficient in random access and is the format in which alignments from the 1000 Genomes Project are released. SAMtools implements various utilities for post-processing alignments in the SAM format, such as indexing, variant caller and alignment viewer, and thus provides universal tools for processing read alignments. Availability: http://samtools.sourceforge.net Contact: [email protected]

45,957 citations

Journal ArticleDOI
TL;DR: A new software suite for the comparison, manipulation and annotation of genomic features in Browser Extensible Data (BED) and General Feature Format (GFF) format, which allows the user to compare large datasets (e.g. next-generation sequencing data) with both public and custom genome annotation tracks.
Abstract: Motivation: Testing for correlations between different sets of genomic features is a fundamental task in genomics research. However, searching for overlaps between features with existing webbased methods is complicated by the massive datasets that are routinely produced with current sequencing technologies. Fast and flexible tools are therefore required to ask complex questions of these data in an efficient manner. Results: This article introduces a new software suite for the comparison, manipulation and annotation of genomic features in Browser Extensible Data (BED) and General Feature Format (GFF) format. BEDTools also supports the comparison of sequence alignments in BAM format to both BED and GFF features. The tools are extremely efficient and allow the user to compare large datasets (e.g. next-generation sequencing data) with both public and custom genome annotation tracks. BEDTools can be combined with one another as well as with standard UNIX commands, thus facilitating routine genomics tasks as well as pipelines that can quickly answer intricate questions of large genomic datasets. Availability and implementation: BEDTools was written in C++. Source code and a comprehensive user manual are freely available at http://code.google.com/p/bedtools

18,858 citations

Journal ArticleDOI
06 Sep 2012-Nature
TL;DR: The Encyclopedia of DNA Elements project provides new insights into the organization and regulation of the authors' genes and genome, and is an expansive resource of functional annotations for biomedical research.
Abstract: The human genome encodes the blueprint of life, but the function of the vast majority of its nearly three billion bases is unknown. The Encyclopedia of DNA Elements (ENCODE) project has systematically mapped regions of transcription, transcription factor association, chromatin structure and histone modification. These data enabled us to assign biochemical functions for 80% of the genome, in particular outside of the well-studied protein-coding regions. Many discovered candidate regulatory elements are physically associated with one another and with expressed genes, providing new insights into the mechanisms of gene regulation. The newly identified elements also show a statistical correspondence to sequence variants linked to human disease, and can thereby guide interpretation of this variation. Overall, the project provides new insights into the organization and regulation of our genes and genome, and is an expansive resource of functional annotations for biomedical research.

13,548 citations

Journal ArticleDOI
TL;DR: VCFtools is a software suite that implements various utilities for processing VCF files, including validation, merging, comparing and also provides a general Perl API.
Abstract: Summary: The variant call format (VCF) is a generic format for storing DNA polymorphism data such as SNPs, insertions, deletions and structural variants, together with rich annotations. VCF is usually stored in a compressed manner and can be indexed for fast data retrieval of variants from a range of positions on the reference genome. The format was developed for the 1000 Genomes Project, and has also been adopted by other projects such as UK10K, dbSNP and the NHLBI Exome Project. VCFtools is a software suite that implements various utilities for processing VCF files, including validation, merging, comparing and also provides a general Perl API. Availability: http://vcftools.sourceforge.net Contact: [email protected]

10,164 citations