scispace - formally typeset
Open AccessJournal ArticleDOI

A global reference for human genetic variation.

Adam Auton, +517 more
- 01 Oct 2015 - 
- Vol. 526, Iss: 7571, pp 68-74
TLDR
The 1000 Genomes Project set out to provide a comprehensive description of common human genetic variation by applying whole-genome sequencing to a diverse set of individuals from multiple populations, and has reconstructed the genomes of 2,504 individuals from 26 populations using a combination of low-coverage whole-generation sequencing, deep exome sequencing, and dense microarray genotyping.
Abstract
The 1000 Genomes Project set out to provide a comprehensive description of common human genetic variation by applying whole-genome sequencing to a diverse set of individuals from multiple populations. Here we report completion of the project, having reconstructed the genomes of 2,504 individuals from 26 populations using a combination of low-coverage whole-genome sequencing, deep exome sequencing, and dense microarray genotyping. We characterized a broad spectrum of genetic variation, in total over 88 million variants (84.7 million single nucleotide polymorphisms (SNPs), 3.6 million short insertions/deletions (indels), and 60,000 structural variants), all phased onto high-quality haplotypes. This resource includes >99% of SNP variants with a frequency of >1% for a variety of ancestries. We describe the distribution of genetic variation across the global sample, and discuss the implications for common disease studies.

read more

Citations
More filters
Journal ArticleDOI

Analysis of protein-coding genetic variation in 60,706 humans

Monkol Lek, +106 more
- 18 Aug 2016 - 
TL;DR: The aggregation and analysis of high-quality exome (protein-coding region) DNA sequence data for 60,706 individuals of diverse ancestries generated as part of the Exome Aggregation Consortium (ExAC) provides direct evidence for the presence of widespread mutational recurrence.
Journal ArticleDOI

Genetic effects on gene expression across human tissues.

TL;DR: It is found that local genetic variation affects gene expression levels for the majority of genes, and inter-chromosomal genetic effects for 93 genes and 112 loci are identified, enabling a mechanistic interpretation of gene regulation and the genetic basis of disease.
Journal ArticleDOI

Coming of age: ten years of next-generation sequencing technologies

TL;DR: These and other strategies are providing researchers and clinicians a variety of tools to probe genomes in greater depth, leading to an enhanced understanding of how genome sequence variants underlie phenotype and disease.
References
More filters
Journal ArticleDOI

BreakDancer: An algorithm for high resolution mapping of genomic structural variation

TL;DR: The algorithm BreakDancer predicts a wide variety of structural variants including insertion-deletions (indels), inversions and translocations and sensitively and accurately detected indels ranging from 10 base pairs to 1 megabase pair that are difficult to detect via a single conventional approach.
Journal ArticleDOI

CNVnator: An approach to discover, genotype, and characterize typical and atypical CNVs from family and population genome sequencing

TL;DR: By genotyping CNVs in the CEPH, Yoruba, and Chinese-Japanese populations, it is estimated that at least 11% of all CNV loci involve complex, multi-allelic events, a considerably higher estimate than reported earlier.
Journal ArticleDOI

fastSTRUCTURE: Variational Inference of Population Structure in Large SNP Data Sets

TL;DR: Developing efficient algorithms for approximate inference of the model underlying the STRUCTURE program using a variational Bayesian framework and proposing useful heuristic scores to identify the number of populations represented in a data set and a new hierarchical prior to detect weak population structure in the data.
Journal ArticleDOI

Improved whole-chromosome phasing for disease and population genetic studies.

TL;DR: The computational performance of SHAPEIT2 is competitive compared to other methods and had the property that SER decreases as sample size increases, illustrating that the SHAPEit2 model can adapt to data sets with very high SNP density.
Related Papers (5)