Showing papers by "Richard Durbin published in 2015"

PDF

Open Access

Journal Article•DOI•

A global reference for human genetic variation.

[...]

Adam Auton¹, Gonçalo R. Abecasis², David Altshuler³, Richard Durbin⁴ +514 more•Institutions (90)

01 Oct 2015-Nature

TL;DR: The 1000 Genomes Project set out to provide a comprehensive description of common human genetic variation by applying whole-genome sequencing to a diverse set of individuals from multiple populations, and has reconstructed the genomes of 2,504 individuals from 26 populations using a combination of low-coverage whole-generation sequencing, deep exome sequencing, and dense microarray genotyping.

...read moreread less

Abstract: The 1000 Genomes Project set out to provide a comprehensive description of common human genetic variation by applying whole-genome sequencing to a diverse set of individuals from multiple populations. Here we report completion of the project, having reconstructed the genomes of 2,504 individuals from 26 populations using a combination of low-coverage whole-genome sequencing, deep exome sequencing, and dense microarray genotyping. We characterized a broad spectrum of genetic variation, in total over 88 million variants (84.7 million single nucleotide polymorphisms (SNPs), 3.6 million short insertions/deletions (indels), and 60,000 structural variants), all phased onto high-quality haplotypes. This resource includes >99% of SNP variants with a frequency of >1% for a variety of ancestries. We describe the distribution of genetic variation across the global sample, and discuss the implications for common disease studies.

...read moreread less

12,661 citations

A global reference for human genetic variation

[...]

Adam Auton, Gonçalo R. Abecasis, David Altshuler, Richard Durbin +476 more

01 Oct 2015

TL;DR: The 1000 Genomes Project as mentioned in this paper provided a comprehensive description of common human genetic variation by applying whole-genome sequencing to a diverse set of individuals from multiple populations, and reported the completion of the project, having reconstructed the genomes of 2,504 individuals from 26 populations using a combination of low-coverage whole genome sequencing, deep exome sequencing and dense microarray genotyping.

...read moreread less

3,247 citations

Journal Article•DOI•

The UK10K project identifies rare variants in health and disease

[...]

Klaudia Walter¹, J L Min², Jie Huang¹, Lucy Crooks, Yasin Memari³, Shane A. McCarthy³, Perry Jrb.⁴, ChangJiang Xu⁴, Marta Futema⁵, Daniel Lawson², Valentina Iotchkova, Stephan Schiffels³, Audrey E. Hendricks⁶, Petr Danecek³, R Li¹, James A B Floyd⁷, Louise V. Wain², Louise V. Wain⁸, Inês Barroso³, Steve E. Humphries⁵, Matthew E. Hurles³, Eleftheria Zeggini³, Jeffrey C. Barrett³, Vincent Plagnol⁵, J. B. Richards⁴, Greenwood Cmt.², Nicholas J. Timpson², Richard Durbin³, Nicole Soranzo⁹ - Show less +25 more•Institutions (9)

Max Planck Society¹, University of Bristol², Wellcome Trust Sanger Institute³, McGill University⁴, University College London⁵, University of Colorado Denver⁶, Queen Mary University of London⁷, University of Leicester⁸, University of Cambridge⁹

01 Oct 2015-Nature

TL;DR: In extensively phenotyped cohorts, insights from sequencing whole genomes or exomes of nearly 10,000 individuals from population-based and disease collections are described and population structure and functional annotation of rare and low-frequency variants are described.

...read moreread less

Abstract: The contribution of rare and low-frequency variants to human traits is largely unexplored. Here we describe insights from sequencing whole genomes (low read depth, 7×) or exomes (high read depth, 80×) of nearly 10,000 individuals from population-based and disease collections. In extensively phenotyped cohorts we characterize over 24 million novel sequence variants, generate a highly accurate imputation reference panel and identify novel alleles associated with levels of triglycerides (APOB), adiponectin (ADIPOQ) and low-density lipoprotein cholesterol (LDLR and RGAG1) from single-marker and rare variant aggregation tests. We describe population structure and functional annotation of rare and low-frequency variants, use the data to estimate the benefits of sequencing for association studies, and summarize lessons from disease-specific collections. Finally, we make available an extensive resource, including individual-level genetic and phenotypic data and web-based tools to facilitate the exploration of association results.

...read moreread less

948 citations

The UK10K project identifies rare variants in health and disease

[...]

Klaudia Walter, Josine L. Min, Jie Huang, Lucy Crooks +238 more

01 Jan 2015

TL;DR: The contribution of rare and low-frequency variants to human traits is largely unexplored as mentioned in this paper, but the contribution of these variants to the human traits has not yet been fully explored.

...read moreread less

824 citations

Journal Article•DOI•

Genomic evidence for the Pleistocene and recent population history of Native Americans

[...]

Maanasa Raghavan¹, Matthias Steinrücken², Matthias Steinrücken³, Kelley Harris², Stephan Schiffels⁴, Simon Rasmussen⁵, Michael DeGiorgio⁶, Anders Albrechtsen¹, Cristina Valdiosera¹, Cristina Valdiosera⁷, María C. Ávila-Arcos¹, María C. Ávila-Arcos⁸, Anna-Sapfo Malaspinas¹, Anders Eriksson⁹, Anders Eriksson¹⁰, Ida Moltke¹, Mait Metspalu¹¹, Mait Metspalu¹², Julian R. Homburger⁸, Jeffrey D. Wall¹³, Omar E. Cornejo¹⁴, J. Víctor Moreno-Mayar¹, Thorfinn Sand Korneliussen¹, Tracey Pierre¹, Morten Rasmussen⁸, Morten Rasmussen¹, Paula F. Campos¹, Paula F. Campos¹⁵, Peter de Barros Damgaard¹, Morten E. Allentoft¹, John Lindo¹⁶, Ene Metspalu¹², Ene Metspalu¹¹, Ricardo Rodríguez-Varela¹⁷, Josefina Mansilla, Celeste Henrickson¹⁸, Andaine Seguin-Orlando¹, Helena Malmström¹⁹, Thomas W. Stafford²⁰, Thomas W. Stafford¹, Suyash Shringarpure⁸, Andrés Moreno-Estrada⁸, Monika Karmin¹², Monika Karmin¹¹, Kristiina Tambets¹¹, Anders Bergström⁴, Yali Xue⁴, Vera Warmuth²¹, Andrew D. Friend⁹, Joy S. Singarayer²², Paul J. Valdes²³, Francois Balloux, Ilán Leboreiro, Jose Luis Vera, Héctor Rangel-Villalobos²⁴, Davide Pettener²⁵, Donata Luiselli²⁵, Loren G. Davis²⁶, Evelyne Heyer²⁷, Christoph P. E. Zollikofer²⁸, Marcia S. Ponce de León²⁸, Colin Smith⁷, Vaughan Grimes²⁹, Vaughan Grimes³⁰, Kelly-Anne Pike²⁹, Michael Deal²⁹, Benjamin T. Fuller³¹, Bernardo Arriaza³², Vivien G. Standen³², Maria F. Luz, Francois Ricaut³³, Niede Guidon, Ludmila P. Osipova³⁴, Ludmila P. Osipova³⁵, Mikhail Voevoda³⁵, Mikhail Voevoda³⁴, Olga L. Posukh³⁴, Olga L. Posukh³⁵, Oleg Balanovsky, Maria Lavryashina³⁶, Yuri Bogunov, Elza Khusnutdinova³⁴, Elza Khusnutdinova³⁷, Marina Gubina, Elena Balanovska, Sardana A. Fedorova³⁸, Sergey Litvinov³⁴, Sergey Litvinov¹¹, Boris Malyarchuk³⁴, Miroslava Derenko³⁴, M. J. Mosher³⁹, David Archer⁴⁰, Jerome S. Cybulski⁴¹, Jerome S. Cybulski⁴², Barbara Petzelt, Joycelynn Mitchell, Rosita Worl, Paul Norman⁸, Peter Parham⁸, Brian M. Kemp¹⁴, Toomas Kivisild⁹, Toomas Kivisild¹¹, Chris Tyler-Smith⁴, Manjinder S. Sandhu⁴, Manjinder S. Sandhu⁴³, Michael H. Crawford⁴⁴, Richard Villems¹², Richard Villems¹¹, David Glenn Smith⁴⁵, Michael R. Waters⁴⁶, Ted Goebel⁴⁶, John R. Johnson⁴⁷, Ripan S. Malhi¹⁶, Mattias Jakobsson¹⁹, David J. Meltzer¹, David J. Meltzer⁴⁸, Andrea Manica⁹, Richard Durbin⁴, Carlos Bustamante⁸, Yun S. Song², Rasmus Nielsen², Eske Willerslev¹ - Show less +118 more•Institutions (48)

University of Copenhagen¹, University of California, Berkeley², University of Massachusetts Amherst³, Wellcome Trust Sanger Institute⁴, Technical University of Denmark⁵, Pennsylvania State University⁶, La Trobe University⁷, Stanford University⁸, University of Cambridge⁹, King Abdullah University of Science and Technology¹⁰, Estonian Biocentre¹¹, University of Tartu¹², University of California, San Francisco¹³, Washington State University¹⁴, University of Porto¹⁵, University of Illinois at Urbana–Champaign¹⁶, Carlos III Health Institute¹⁷, University of Utah¹⁸, Science for Life Laboratory¹⁹, Aarhus University²⁰, University College London²¹, University of Reading²², University of Bristol²³, University of Guadalajara²⁴, University of Bologna²⁵, Oregon State University²⁶, University of Paris²⁷, University of Zurich²⁸, St. John's University²⁹, Max Planck Society³⁰, University of California, Irvine³¹, University of Tarapacá³², University of Toulouse³³, Russian Academy of Sciences³⁴, Novosibirsk State University³⁵, Kemerovo State University³⁶, Bashkir State University³⁷, North-Eastern Federal University³⁸, Western Washington University³⁹, Northwest Community College⁴⁰, University of Western Ontario⁴¹, Simon Fraser University⁴², Laboratory of Molecular Biology⁴³, University of Kansas⁴⁴, University of California, Davis⁴⁵, Texas A&M University⁴⁶, Santa Barbara Museum of Natural History⁴⁷, Southern Methodist University⁴⁸

21 Aug 2015-Science

TL;DR: The results suggest that there has been gene flow between some Native Americans from both North and South America and groups related to East Asians and Australo-Melanesians, the latter possibly through an East Asian route that might have included ancestors of modern Aleutian Islanders.

...read moreread less

Abstract: How and when the Americas were populated remains contentious. Using ancient and modern genome-wide data, we found that the ancestors of all present-day Native Americans, including Athabascans and Amerindians, entered the Americas as a single migration wave from Siberia no earlier than 23 thousand years ago (ka) and after no more than an 8000-year isolation period in Beringia. After their arrival to the Americas, ancestral Native Americans diversified into two basal genetic branches around 13 ka, one that is now dispersed across North and South America and the other restricted to North America. Subsequent gene flow resulted in some Native Americans sharing ancestry with present-day East Asians (including Siberians) and, more distantly, Australo-Melanesians. Putative "Paleoamerican" relict populations, including the historical Mexican Pericues and South American Fuego-Patagonians, are not directly related to modern Australo-Melanesians as suggested by the Paleoamerican Model.

...read moreread less

459 citations

Journal Article•DOI•

Whole-genome sequencing identifies EN1 as a determinant of bone density and fracture.

[...]

Hou-Feng Zheng¹, Vincenzo Forgetta¹, Yi-Hsiang Hsu², Yi-Hsiang Hsu³ +171 more•Institutions (55)

01 Oct 2015-Nature

TL;DR: Evidence is provided that low‐frequency non‐coding variants have large effects on BMD and fracture, thereby providing rationale for whole‐genome sequencing and improved imputation reference panels to study the genetic architecture of complex traits and disease in the general population.

...read moreread less

Abstract: The extent to which low-frequency (minor allele frequency (MAF) between 1-5%) and rare (MAF ≤ 1%) variants contribute to complex traits and disease in the general population is mainly unknown. Bone mineral density (BMD) is highly heritable, a major predictor of osteoporotic fractures, and has been previously associated with common genetic variants, as well as rare, population-specific, coding variants. Here we identify novel non-coding genetic variants with large effects on BMD (ntotal = 53,236) and fracture (ntotal = 508,253) in individuals of European ancestry from the general population. Associations for BMD were derived from whole-genome sequencing (n = 2,882 from UK10K (ref. 10); a population-based genome sequencing consortium), whole-exome sequencing (n = 3,549), deep imputation of genotyped samples using a combined UK10K/1000 Genomes reference panel (n = 26,534), and de novo replication genotyping (n = 20,271). We identified a low-frequency non-coding variant near a novel locus, EN1, with an effect size fourfold larger than the mean of previously reported common variants for lumbar spine BMD (rs11692564(T), MAF = 1.6%, replication effect size = +0.20 s.d., Pmeta = 2 × 10(-14)), which was also associated with a decreased risk of fracture (odds ratio = 0.85; P = 2 × 10(-11); ncases = 98,742 and ncontrols = 409,511). Using an En1(cre/flox) mouse model, we observed that conditional loss of En1 results in low bone mass, probably as a consequence of high bone turnover. We also identified a novel low-frequency non-coding variant with large effects on BMD near WNT16 (rs148771817(T), MAF = 1.2%, replication effect size = +0.41 s.d., Pmeta = 1 × 10(-11)). In general, there was an excess of association signals arising from deleterious coding and conserved non-coding variants. These findings provide evidence that low-frequency non-coding variants have large effects on BMD and fracture, thereby providing rationale for whole-genome sequencing and improved imputation reference panels to study the genetic architecture of complex traits and disease in the general population.

...read moreread less

410 citations

Journal Article•DOI•

Improved imputation of low-frequency and rare variants using the UK10K haplotype reference panel

[...]

Jie Huang¹, Bryan Howie, Shane A. McCarthy¹, Yasin Memari¹, Klaudia Walter¹, Josine L. Min², Petr Danecek¹, Giovanni Malerba³, Elisabetta Trabetti³, Hou-Feng Zheng⁴, Hou-Feng Zheng⁵, Giovanni Gambaro⁶, J. Brent Richards, Richard Durbin¹, Nicholas J. Timpson², Jonathan Marchini⁷, Jonathan Marchini⁸, Nicole Soranzo⁹, Nicole Soranzo¹ - Show less +15 more•Institutions (9)

Wellcome Trust Sanger Institute¹, University of Bristol², University of Verona³, McGill University⁴, Jewish General Hospital⁵, The Catholic University of America⁶, Wellcome Trust Centre for Human Genetics⁷, University of Oxford⁸, University of Cambridge⁹

14 Sep 2015-Nature Communications

TL;DR: It is shown that large increases in imputation accuracy can be achieved by re-phasing WGS reference panels after initial genotype calling, and a method for combining WGS panels to improve variant coverage and downstream imputations accuracy is presented.

...read moreread less

Abstract: Imputing genotypes from reference panels created by whole-genome sequencing (WGS) provides a cost-effective strategy for augmenting the single-nucleotide polymorphism (SNP) content of genome-wide arrays. The UK10K Cohorts project has generated a data set of 3,781 whole genomes sequenced at low depth (average 7x), aiming to exhaustively characterize genetic variation down to 0.1% minor allele frequency in the British population. Here we demonstrate the value of this resource for improving imputation accuracy at rare and low-frequency variants in both a UK and an Italian population. We show that large increases in imputation accuracy can be achieved by re-phasing WGS reference panels after initial genotype calling. We also present a method for combining WGS panels to improve variant coverage and downstream imputation accuracy, which we illustrate by integrating 7,562 WGS haplotypes from the UK10K project with 2,184 haplotypes from the 1000 Genomes Project. Finally, we introduce a novel approximation that maintains speed without sacrificing imputation accuracy for rare variants.

...read moreread less

318 citations

Journal Article•DOI•

Genomic islands of speciation separate cichlid ecomorphs in an East African crater lake.

[...]

Milan Malinsky¹, Milan Malinsky², Richard Challis³, Alexandra M. Tyers³, Stephan Schiffels², Yohey Terai, Benjamin P. Ngatunga, Eric A. Miska¹, Eric A. Miska², Richard Durbin², Martin J. Genner⁴, George F. Turner³ - Show less +8 more•Institutions (4)

University of Cambridge¹, Wellcome Trust Sanger Institute², Bangor University³, University of Bristol⁴

18 Dec 2015-Science

TL;DR: The discovery and detailed characterization of early-stage adaptive divergence of two cichlid fish ecomorphs in a small crater lake in Tanzania are reported and mechanisms and genomic regions that may play a role in the closely related mega-radiation of Lake Malawi are suggested.

...read moreread less

Abstract: The genomic causes and effects of divergent ecological selection during speciation are still poorly understood. Here we report the discovery and detailed characterization of early-stage adaptive divergence of two cichlid fish ecomorphs in a small (700 meters in diameter) isolated crater lake in Tanzania. The ecomorphs differ in depth preference, male breeding color, body shape, diet, and trophic morphology. With whole-genome sequences of 146 fish, we identified 98 clearly demarcated genomic “islands” of high differentiation and demonstrated the association of genotypes across these islands with divergent mate preferences. The islands contain candidate adaptive genes enriched for functions in sensory perception (including rhodopsin and other twilight-vision–associated genes), hormone signaling, and morphogenesis. Our study suggests mechanisms and genomic regions that may play a role in the closely related mega-radiation of Lake Malawi.

...read moreread less

316 citations

Journal Article•DOI•

Gene-gene and gene-environment interactions detected by transcriptome sequence analysis in twins

[...]

Alfonso Buil¹, Andrew A. Brown², Tuuli Lappalainen¹, Ana Viñuela³, Matthew N. Davies³, Hou-Feng Zheng⁴, J. Brent Richards³, Daniel Glass³, Kerrin S. Small³, Richard Durbin², Tim D. Spector³, Emmanouil T. Dermitzakis¹ - Show less +8 more•Institutions (4)

Swiss Institute of Bioinformatics¹, Wellcome Trust Sanger Institute², King's College London³, McGill University⁴

01 Jan 2015-Nature Genetics

TL;DR: A model where ASE requires genetic variability in cis, a difference in the sequence of both alleles, but where the magnitude of the ASE effect depends on trans genetic and environmental factors that interact with the cis genetic variants is proposed.

...read moreread less

Abstract: Understanding the genetic architecture of gene expression is an intermediate step in understanding the genetic architecture of complex diseases. RNA sequencing technologies have improved the quantification of gene expression and allow measurement of allele-specific expression (ASE). ASE is hypothesized to result from the direct effect of cis regulatory variants, but a proper estimation of the causes of ASE has not been performed thus far. In this study, we take advantage of a sample of twins to measure the relative contributions of genetic and environmental effects to ASE, and we find substantial effects from gene × gene (G×G) and gene × environment (G×E) interactions. We propose a model where ASE requires genetic variability in cis, a difference in the sequence of both alleles, but where the magnitude of the ASE effect depends on trans genetic and environmental factors that interact with the cis genetic variants.

...read moreread less

200 citations

Journal Article•DOI•

The genomic and phenotypic diversity of Schizosaccharomyces pombe

[...]

Daniel C. Jeffares¹, Charalampos Rallis¹, Adrien Rieux¹, Doug Speed¹, Martin Převorovský², Tobias Mourier³, Francesc Xavier Marsellach¹, Zamin Iqbal⁴, Winston Lau¹, Tammy M. K. Cheng⁵, Rodrigo Pracana¹, Michael Mülleder⁶, Jonathan L.D. Lawson⁶, Anatole Chessel⁶, Sendu Bala⁷, Garrett Hellenthal¹, Brendan D. O'Fallon⁸, Thomas M. Keane⁷, Jared T. Simpson⁷, Leanne Bischof⁹, Bartlomiej Tomiczek¹, Danny A. Bitton¹, Theodora C. Sideri¹, Sandra Codlin¹, Josephine E. E. U. Hellberg¹, Laurent van Trigt¹, Linda Jeffery⁵, Juan Juan Li⁵, Sophie R. Atkinson¹, Malte Thodberg³, Melanie Febrer¹⁰, Kirsten McLay¹⁰, Nizar Drou¹⁰, William Brown¹¹, Jacqueline Hayles⁵, Rafael E. Carazo Salas⁶, Markus Ralser¹², Nikolas Maniatis¹, David J. Balding¹, Francois Balloux¹, Richard Durbin⁷, Jürg Bähler¹ - Show less +38 more•Institutions (12)

University College London¹, Charles University in Prague², University of Copenhagen³, Wellcome Trust Centre for Human Genetics⁴, London Research Institute⁵, University of Cambridge⁶, Wellcome Trust Sanger Institute⁷, University of Utah⁸, Commonwealth Scientific and Industrial Research Organisation⁹, Norwich University¹⁰, University of Nottingham¹¹, National Institute for Medical Research¹²

01 Mar 2015-Nature Genetics

TL;DR: The fission yeast Schizosaccharomyces pombe is an important model for eukaryotic biology, but researchers typically use one standard laboratory strain, so this analysis represents a rich resource to examine genotype-phenotype relationships in a tractable model.

...read moreread less

Abstract: Natural variation within species reveals aspects of genome evolution and function. The fission yeast Schizosaccharomyces pombe is an important model for eukaryotic biology, but researchers typically use one standard laboratory strain. To extend the usefulness of this model, we surveyed the genomic and phenotypic variation in 161 natural isolates. We sequenced the genomes of all strains, finding moderate genetic diversity (π = 3 × 10(-3) substitutions/site) and weak global population structure. We estimate that dispersal of S. pombe began during human antiquity (∼340 BCE), and ancestors of these strains reached the Americas at ∼1623 CE. We quantified 74 traits, finding substantial heritable phenotypic diversity. We conducted 223 genome-wide association studies, with 89 traits showing at least one association. The most significant variant for each trait explained 22% of the phenotypic variance on average, with indels having larger effects than SNPs. This analysis represents a rich resource to examine genotype-phenotype relationships in a tractable model.

...read moreread less

174 citations

Journal Article•DOI•

Extending reference assembly models

[...]

Deanna M. Church, Valerie A. Schneider¹, Karyn Meltz Steinberg², Michael C. Schatz³, Aaron R. Quinlan, Chen-Shan Chin⁴, Paul Kitts¹, Bronwen Aken⁵, Gabor T. Marth⁶, Michael M. Hoffman⁷, Michael M. Hoffman⁸, Javier Herrero⁹, M. Lisandra Zepeda Mendoza¹⁰, Richard Durbin¹¹, Paul Flicek⁵ - Show less +11 more•Institutions (11)

National Institutes of Health¹, Washington University in St. Louis², Cold Spring Harbor Laboratory³, Pacific Biosciences⁴, European Bioinformatics Institute⁵, University of Utah⁶, University of Toronto⁷, Princess Margaret Cancer Centre⁸, University College London⁹, University of Copenhagen¹⁰, Wellcome Trust¹¹

24 Jan 2015-Genome Biology

TL;DR: The models and analysis assumptions that underlie the current assembly need revising to fully represent human sequence diversity and improved analysis tools and updated data reporting formats are required.

...read moreread less

Abstract: The human genome reference assembly is crucial for aligning and analyzing sequence data, and for genome annotation, among other roles. However, the models and analysis assumptions that underlie the current assembly need revising to fully represent human sequence diversity. Improved analysis tools and updated data reporting formats are also required.

...read moreread less

Journal Article•DOI•

Tracing the Route of Modern Humans out of Africa by Using 225 Human Genome Sequences from Ethiopians and Egyptians

[...]

Luca Pagani¹, Luca Pagani², Luca Pagani³, Stephan Schiffels², Deepti Gurdasani², Petr Danecek², Aylwyn Scally¹, Yuan Chen², Yali Xue², Marc Haber², Marc Haber⁴, Rosemary Ekong⁵, Tamiru Oljira⁶, Ephrem Mekonnen⁶, Donata Luiselli³, Neil Bradman, Endashaw Bekele⁶, Pierre Zalloua⁴, Pierre Zalloua⁷, Richard Durbin², Toomas Kivisild¹, Chris Tyler-Smith² - Show less +18 more•Institutions (7)

University of Cambridge¹, Wellcome Trust Sanger Institute², University of Bologna³, Lebanese American University⁴, University College London⁵, Addis Ababa University⁶, Harvard University⁷

04 Jun 2015-American Journal of Human Genetics

TL;DR: Both the haplotype and MSMC analyses suggest a predominant northern route out of Africa via Egypt, pointing to Egypt as the more likely gateway in the exodus to the rest of the world.

...read moreread less

Abstract: The predominantly African origin of all modern human populations is well established, but the route taken out of Africa is still unclear Two alternative routes, via Egypt and Sinai or across the Bab el Mandeb strait into Arabia, have traditionally been proposed as feasible gateways in light of geographic, paleoclimatic, archaeological, and genetic evidence Distinguishing among these alternatives has been difficult We generated 225 whole-genome sequences (225 at 8× depth, of which 8 were increased to 30×; Illumina HiSeq 2000) from six modern Northeast African populations (100 Egyptians and five Ethiopian populations each represented by 25 individuals) West Eurasian components were masked out, and the remaining African haplotypes were compared with a panel of sub-Saharan African and non-African genomes We showed that masked Northeast African haplotypes overall were more similar to non-African haplotypes and more frequently present outside Africa than were any sets of haplotypes derived from a West African population Furthermore, the masked Egyptian haplotypes showed these properties more markedly than the masked Ethiopian haplotypes, pointing to Egypt as the more likely gateway in the exodus to the rest of the world Using five Ethiopian and three Egyptian high-coverage masked genomes and the multiple sequentially Markovian coalescent (MSMC) approach, we estimated the genetic split times of Egyptians and Ethiopians from non-African populations at 55,000 and 65,000 years ago, respectively, whereas that of West Africans was estimated to be 75,000 years ago Both the haplotype and MSMC analyses thus suggest a predominant northern route out of Africa via Egypt

...read moreread less

Journal Article•DOI•

Immunofluorescence Analysis and Diagnosis of Primary Ciliary Dyskinesia with Radial Spoke Defects

[...]

Adrien Frommer¹, Rim Hjeij¹, Niki T. Loges¹, Christine Edelbusch¹, Charlotte Jahnke¹, Johanna Raidt¹, Claudius Werner¹, Julia Wallmeier¹, Jörg Große-Onnebrink¹, Heike Olbrich¹, Sandra Cindric¹, Martine Jaspers², Mieke Boon², Yasin Memari³, Richard Durbin³, Anja Kolb-Kokocinski³, Sascha Sauer⁴, June K. Marthin⁵, Kim G. Nielsen⁵, Israel Amirav⁶, Nael Elias, Eitan Kerem⁷, David Shoseyov⁷, Karsten Haeffner, Heymut Omran¹ - Show less +21 more•Institutions (7)

Boston Children's Hospital¹, Katholieke Universiteit Leuven², Wellcome Trust Sanger Institute³, Max Planck Society⁴, Copenhagen University Hospital⁵, University of Alberta⁶, Hebrew University of Jerusalem⁷

01 Oct 2015-American Journal of Respiratory Cell and Molecular Biology

TL;DR: Immunofluorescence analysis can improve diagnosis of PCD in patients with loss-of-function mutations as well as missense variants, and performed high-resolution immunofluorescent analysis of human respiratory cilia.

...read moreread less

Abstract: Primary ciliary dyskinesia (PCD) is a genetically heterogeneous recessive disorder caused by several distinct defects in genes responsible for ciliary beating, leading to defective mucociliary clearance often associated with randomization of left/right body asymmetry. Individuals with PCD caused by defective radial spoke (RS) heads are difficult to diagnose owing to lack of gross ultrastructural defects and absence of situs inversus. Thus far, most mutations identified in human radial spoke genes (RSPH) are loss-of-function mutations, and missense variants have been rarely described. We studied the consequences of different RSPH9, RSPH4A, and RSPH1 mutations on the assembly of the RS complex to improve diagnostics in PCD. We report 21 individuals with PCD (16 families) with biallelic mutations in RSPH9, RSPH4A, and RSPH1, including seven novel mutations comprising missense variants, and performed high-resolution immunofluorescence analysis of human respiratory cilia. Missense variants are frequent genetic defects in PCD with RS defects. Absence of RSPH4A due to mutations in RSPH4A results in deficient axonemal assembly of the RS head components RSPH1 and RSPH9. RSPH1 mutant cilia, lacking RSPH1, fail to assemble RSPH9, whereas RSPH9 mutations result in axonemal absence of RSPH9, but do not affect the assembly of the other head proteins, RSPH1 and RSPH4A. Interestingly, our results were identical in individuals carrying loss-of-function mutations, missense variants, or one amino acid deletion. Immunofluorescence analysis can improve diagnosis of PCD in patients with loss-of-function mutations as well as missense variants. RSPH4A is the core protein of the RS head.

...read moreread less

Journal Article•DOI•

Whole-genome sequence-based analysis of thyroid function

[...]

Peter N. Taylor¹, Eleonora Porcu², Shelby Chew³, Purdey J Campbell³, Michela Traglia, Suzanne J. Brown³, Benjamin H. Mullin⁴, Hashem A. Shihab⁵, J L Min⁵, Klaudia Walter⁶, Yasin Memari⁶, Jie Huang⁶, Michael R. Barnes⁷, John Beilby⁴, Pimphen Charoen⁸, Petr Danecek⁶, Frank Dudbridge⁸, Vincenzo Forgetta⁹, Celia M. T. Greenwood⁹, Elin Grundberg⁹, Andrew D Johnson¹⁰, Jennie Hui⁴, Ee Mun Lim³, Shane A. McCarthy⁶, Dawn Muddyman⁶, Vijay Panicker³, John R. B. Perry¹¹, Jordana T. Bell, Wei Yuan, Caroline L Relton⁵, Tom R. Gaunt⁵, David Schlessinger, Gonçalo R. Abecasis², Francesco Cucca¹², Gabriela L. Surdulescu, Wolfram Woltersdorf, Eleftheria Zeggini⁶, Hou-Feng Zheng¹³, Daniela Toniolo, Colin M. Dayan¹, Silvia Naitza, John P. Walsh⁴, Tim D. Spector, George Davey Smith⁵, Richard Durbin⁶, J. Brent Richards¹³, Serena Sanna, Nicole Soranzo⁶, Nicholas J. Timpson⁵, Scott Wilson⁴ - Show less +46 more•Institutions (13)

Cardiff University¹, University of Michigan², Sir Charles Gairdner Hospital³, University of Western Australia⁴, University of Bristol⁵, Wellcome Trust Sanger Institute⁶, Queen Mary University of London⁷, University of London⁸, McGill University⁹, University of Cambridge¹⁰, King's College London¹¹, University of Sassari¹², King Abdulaziz Medical City¹³

06 Mar 2015-Nature Communications

TL;DR: It is demonstrated that increased coverage in whole-genome sequence association studies identifies novel variants associated with thyroid function as well as common variants that explain ≥20% of the variance in TSH and FT4.

...read moreread less

Abstract: Normal thyroid function is essential for health, but its genetic architecture remains poorly understood. Here, for the heritable thyroid traits thyrotropin (TSH) and free thyroxine (FT4), we analyse whole-genome sequence data from the UK10K project (N=2,287). Using additional whole-genome sequence and deeply imputed data sets, we report meta-analysis results for common variants (MAF≥1%) associated with TSH and FT4 (N=16,335). For TSH, we identify a novel variant in SYN2 (MAF=23.5%, P=6.15 × 10(-9)) and a new independent variant in PDE8B (MAF=10.4%, P=5.94 × 10(-14)). For FT4, we report a low-frequency variant near B4GALT6/SLC25A52 (MAF=3.2%, P=1.27 × 10(-9)) tagging a rare TTR variant (MAF=0.4%, P=2.14 × 10(-11)). All common variants explain ≥20% of the variance in TSH and FT4. Analysis of rare variants (MAF<1%) using sequence kernel association testing reveals a novel association with FT4 in NRG1. Our results demonstrate that increased coverage in whole-genome sequence association studies identifies novel variants associated with thyroid function.

...read moreread less

Journal Article•DOI•

Deficiency of ECHS1 causes mitochondrial encephalopathy with cardiac involvement.

[...]

Tobias B. Haack¹, Christopher B. Jackson², Kei Murayama², Laura S. Kremer¹, André Schaller³, Urania Kotzaeridou⁴, Maaike de Vries⁵, Gudrun Schottmann⁶, Saikat Santra⁷, Boriana Büchner⁸, Thomas Wieland¹, Elisabeth Graf¹, Peter Freisinger, Seila Eggimann², Akira Ohtake⁹, Yasushi Okazaki⁹, Masakazu Kohda⁹, Yoshihito Kishita⁹, Yoshimi Tokuzawa⁹, Sascha Sauer¹⁰, Yasin Memari¹¹, Anja Kolb-Kokocinski¹¹, Richard Durbin¹¹, Oswald Hasselmann², Kirsten Cremer¹², Beate Albrecht¹³, Dagmar Wieczorek¹³, Hartmut Engels¹², Dagmar Hahn², Alexander M. Zink¹², Charlotte L. Alston¹⁴, Robert W. Taylor¹⁴, Richard J. Rodenburg⁵, Regina Trollmann¹⁵, Wolfgang Sperl¹⁶, Tim M. Strom¹, Georg F. Hoffmann⁴, Johannes A. Mayr¹⁶, Thomas Meitinger, Ramona Bolognini³, Markus Schuelke⁶, Jean-Marc Nuoffer², Stefan Kölker⁴, Holger Prokisch¹, Thomas Klopstock¹⁷, Thomas Klopstock⁸ - Show less +42 more•Institutions (17)

Technische Universität München¹, Boston Children's Hospital², University of Bern³, University Hospital Heidelberg⁴, Radboud University Nijmegen⁵, Charité⁶, Children's of Alabama⁷, Ludwig Maximilian University of Munich⁸, Saitama Medical University⁹, Max Planck Society¹⁰, Wellcome Trust Sanger Institute¹¹, University of Bonn¹², University of Duisburg-Essen¹³, Newcastle University¹⁴, University of Erlangen-Nuremberg¹⁵, Paracelsus Private Medical University of Salzburg¹⁶, German Center for Neurodegenerative Diseases¹⁷

13 Mar 2015-Annals of clinical and translational neurology

TL;DR: The broad phenotypic spectrum and pathobiochemistry of individuals with autosomal‐recessive ECHS1 deficiency is described.

...read moreread less

Abstract: OBJECTIVE Short-chain enoyl-CoA hydratase (ECHS1) is a multifunctional mitochondrial matrix enzyme that is involved in the oxidation of fatty acids and essential amino acids such as valine. Here, we describe the broad phenotypic spectrum and pathobiochemistry of individuals with autosomal-recessive ECHS1 deficiency. METHODS Using exome sequencing, we identified ten unrelated individuals carrying compound heterozygous or homozygous mutations in ECHS1. Functional investigations in patient-derived fibroblast cell lines included immunoblotting, enzyme activity measurement, and a palmitate loading assay. RESULTS Patients showed a heterogeneous phenotype with disease onset in the first year of life and course ranging from neonatal death to survival into adulthood. The most prominent clinical features were encephalopathy (10/10), deafness (9/9), epilepsy (6/9), optic atrophy (6/10), and cardiomyopathy (4/10). Serum lactate was elevated and brain magnetic resonance imaging showed white matter changes or a Leigh-like pattern resembling disorders of mitochondrial energy metabolism. Analysis of patients' fibroblast cell lines (6/10) provided further evidence for the pathogenicity of the respective mutations by showing reduced ECHS1 protein levels and reduced 2-enoyl-CoA hydratase activity. While serum acylcarnitine profiles were largely normal, in vitro palmitate loading of patient fibroblasts revealed increased butyrylcarnitine, unmasking the functional defect in mitochondrial β-oxidation of short-chain fatty acids. Urinary excretion of 2-methyl-2,3-dihydroxybutyrate - a potential derivative of acryloyl-CoA in the valine catabolic pathway - was significantly increased, indicating impaired valine oxidation. INTERPRETATION In conclusion, we define the phenotypic spectrum of a new syndrome caused by ECHS1 deficiency. We speculate that both the β-oxidation defect and the block in l-valine metabolism, with accumulation of toxic methacrylyl-CoA and acryloyl-CoA, contribute to the disorder that may be amenable to metabolic treatment approaches.

...read moreread less

Journal Article•DOI•

TCTEX1D2 mutations underlie Jeune asphyxiating thoracic dystrophy with impaired retrograde intraflagellar transport

[...]

Miriam Schmidts¹, Yuqing Hou², Claudio Cortes³, Dorus A. Mans⁴ +183 more•Institutions (37)

05 Jun 2015-Nature Communications

TL;DR: TCTEX1D2 mutations causing Jeune asphyxiating thoracic dystrophy with partially penetrant inheritance are identified and defined as an integral component of the evolutionarily conserved retrograde IFT machinery.

...read moreread less

Abstract: The analysis of individuals with ciliary chondrodysplasias can shed light on sensitive mechanisms controlling ciliogenesis and cell signalling that are essential to embryonic development and survival. Here we identify TCTEX1D2 mutations causing Jeune asphyxiating thoracic dystrophy with partially penetrant inheritance. Loss of TCTEX1D2 impairs retrograde intraflagellar transport (IFT) in humans and the protist Chlamydomonas, accompanied by destabilization of the retrograde IFT dynein motor. We thus define TCTEX1D2 as an integral component of the evolutionarily conserved retrograde IFT machinery. In complex with several IFT dynein light chains, it is required for correct vertebrate skeletal formation but may be functionally redundant under certain conditions.

...read moreread less

Posted Content•DOI•

Health and population effects of rare gene knockouts in adult humans with related parents

[...]

Vagheesh M. Narasimhan¹, Karen A. Hunt², Dan Mason³, Christopher L. Baker, Konrad J. Karczewski⁴, Michael R. Barnes², Anthony H. Barnett⁵, Christopher M. Bates, Srikanth Bellary⁶, Nicholas A. Bockett², Kristina Giorda, Chris Griffiths², Harry Hemingway⁷, Jia Zhilong², Ann M. Kelly⁸, Hajrah A. Khawaja², Monkol Lek⁴, Shaun McCarthy¹, Rosie McEachan³, Kenneth Paigen, Costas Parisinos², Eamonn Sheridan³, Laura Southgate², Louise Tee⁸, Mark G. Thomas¹, Yali Xue¹, Michael Schnall-Levin, Petko M. Petkov, Chris Tyler-Smith¹, Eamonn R. Maher⁹, Richard C. Trembath¹⁰, Daniel G. MacArthur⁴, John Wright³, Richard Durbin¹, David A. van Heel² - Show less +31 more•Institutions (10)

Wellcome Trust Sanger Institute¹, Queen Mary University of London², Blackpool Victoria Hospital³, Broad Institute⁴, Heart of England NHS Foundation Trust⁵, Aston University⁶, University College London⁷, University of Birmingham⁸, University of Cambridge⁹, King's College London¹⁰

14 Nov 2015-bioRxiv

TL;DR: Exome sequenced 3,222 British Pakistani-heritage adults with high parental relatedness, discovering 1,111 rare-variant homozygous likely loss of function (rhLOF) genotypes predicted to disrupt (knockout) 781 genes, and showed meiotic recombination sites localised away from PRDM9-dependent hotspots, demonstratingPRDM9 redundancy in humans.

...read moreread less

Abstract: Complete gene knockouts are highly informative about gene function. We exome sequenced 3,222 British Pakistani-heritage adults with high parental relatedness, discovering 1,111 rare-variant homozygous likely loss of function (rhLOF) genotypes predicted to disrupt (knockout) 781 genes. Based on depletion of rhLOF genotypes, we estimate that 13.6% of knockouts are incompatible with adult life, finding on average 1.6 heterozygous recessive lethal LOF variants per adult. Linking to lifelong health records, we observed no association of rhLOF genotypes with prescription- or doctor-consultation rate, and no disease-related phenotypes in 33 of 42 individuals with rhLOF genotypes in recessive Mendelian disease genes. Phased genome sequencing of a healthy PRDM9 knockout mother, her child and controls, showed meiotic recombination sites localised away from PRDM9-dependent hotspots, demonstrating PRDM9 redundancy in humans.

...read moreread less

Posted Content•DOI•

Iron Age and Anglo-Saxon genomes from East England reveal British migration history

[...]

Stephan Schiffels¹, Wolfgang Haak², Pirita Paajanen¹, Bastien Llamas³, Elizabeth Popescu⁴, Louise Lou⁴, Rachel Clarke⁴, Alice Lyons⁴, Richard Mortimer⁴, Duncan Sayer⁵, Chris Tyler-Smith¹, Alan Cooper³, Richard Durbin¹ - Show less +9 more•Institutions (5)

Wellcome Trust Sanger Institute¹, Max Planck Society², University of Adelaide³, University of Oxford⁴, University of Central Lancashire⁵

17 Jul 2015-bioRxiv

TL;DR: Today’s British are more similar to the Iron Age individuals than to most of the Anglo-Saxon individuals, and it is estimated that the contemporary East English population derives 30% of its ancestry from Anglo- Saxon migrations, with a lower fraction in Wales and Scotland.

...read moreread less

Abstract: British population history has been shaped by a series of immigrations and internal movements, including the early Anglo-Saxon migrations following the breakdown of the Roman administration after 410CE. It remains an open question how these events affected the genetic composition of the current British population. Here, we present whole-genome sequences generated from ten ancient individuals found in archaeological excavations close to Cambridge in the East of England, ranging from 2,300 until 1,200 years before present (Iron Age to Anglo-Saxon period). We use present-day genetic data to characterize the relationship of these ancient individuals to contemporary British and other European populations. By analyzing the distribution of shared rare variants across ancient and modern individuals, we find that today’s British are more similar to the Iron Age individuals than to most of the Anglo-Saxon individuals, and estimate that the contemporary East English population derives 30% of its ancestry from Anglo-Saxon migrations, with a lower fraction in Wales and Scotland. We gain further insight with a new method, rarecoal, which fits a demographic model to the distribution of shared rare variants across a large number of samples, enabling fine scale analysis of subtle genetic differences and yielding explicit estimates of population sizes and split times. Using rarecoal we find that the ancestors of the Anglo-Saxon samples are closest to modern Danish and Dutch populations, while the Iron Age samples share ancestors with multiple Northern European populations including Britain.

...read moreread less

Journal Article•DOI•

Homozygous loss-of-function variants in European cosmopolitan and isolate populations

[...]

Vera B. Kaiser, Victoria Svinti, James G. D. Prendergast¹, You-Ying Chau, Archie Campbell², Inga Patarčić³, Inês Barroso⁴, Peter K. Joshi¹, Nicholas D. Hastie, Ana Miljković³, Martin S. Taylor, Generation Scotland², Uk K⁴, Stefan Enroth⁵, Yasin Memari⁴, Anja Kolb-Kokocinski⁴, Alan F. Wright, Ulf Gyllensten⁵, Richard Durbin⁴, Igor Rudan¹, Harry Campbell¹, Ozren Polasek¹, Ozren Polasek³, Åsa Johansson⁵, Sascha Sauer⁶, David J. Porteous², Ross M. Fraser¹, Camilla Drake, Veronique Vitart, Caroline Hayward, Colin A. Semple, James F. Wilson¹ - Show less +28 more•Institutions (6)

University of Edinburgh¹, Western General Hospital², University of Split³, Wellcome Trust Sanger Institute⁴, Uppsala University⁵, Max Planck Society⁶

14 Jul 2015-Human Molecular Genetics

TL;DR: Overall HLOF genes are enriched for olfactory receptor function and are expressed in testes more often than expected, consistent with reduced purifying selection and incipient pseudogenisation.

...read moreread less

Abstract: Homozygous loss of function (HLOF) variants provide a valuable window on gene function in humans, as well as an inventory of the human genes that are not essential for survival and reproduction. All humans carry at least a few HLOF variants, but the exact number of inactivated genes that can be tolerated is currently unknown—as are the phenotypic effects of losing function for most human genes. Here, we make use of 1432 whole exome sequences from five European populations to expand the catalogue of known human HLOF mutations; after stringent filtering of variants in our dataset, we identify a total of 173 HLOF mutations, 76 (44%) of which have not been observed previously. We find that population isolates are particularly well suited to surveys of novel HLOF genes because individuals in such populations carry extensive runs of homozygosity, which we show are enriched for novel, rare HLOF variants. Further, we make use of extensive phenotypic data to show that most HLOFs, ascertained in population-based samples, appear to have little detectable effect on the phenotype. On the contrary, we document several genes directly implicated in disease that seem to tolerate HLOF variants. Overall HLOF genes are enriched for olfactory receptor function and are expressed in testes more often than expected, consistent with reduced purifying selection and incipient pseudogenisation.

...read moreread less

Posted Content•DOI•

A reference panel of 64,976 haplotypes for genotype imputation

[...]

Shane A. McCarthy¹, Sayantan Das², Warren W. Kretzschmar³, Richard Durbin¹, Gonçalo R. Abecasis², Jonathan Marchini³ - Show less +2 more•Institutions (3)

Wellcome Trust Sanger Institute¹, University of Michigan², University of Oxford³

23 Dec 2015-bioRxiv

TL;DR: A reference panel of 64,976 human haplotypes at 39,235,157 SNPs constructed using whole genome sequence data from 20 studies of predominantly European ancestry is described, leading to a large increase in the number of SNPs tested in association studies.

...read moreread less

Abstract: We describe a reference panel of 64,976 human haplotypes at 39,235,157 SNPs constructed using whole genome sequence data from 20 studies of predominantly European ancestry. Using this resource leads to accurate genotype imputation at minor allele frequencies as low as 0.1%, a large increase in the number of SNPs tested in association studies and can help to discover and refine causal loci. We describe remote server resources that allow researchers to carry out imputation and phasing consistently and efficiently.

...read moreread less

Journal Article•DOI•

Pathway-Based Factor Analysis of Gene Expression Data Produces Highly Heritable Phenotypes That Associate with Age

[...]

Andrew A. Brown¹, Andrew A. Brown², Zhihao Ding¹, Ana Viñuela³, Daniel Glass³, Leopold Parts¹, Tim D. Spector³, John Winn⁴, Richard Durbin¹ - Show less +5 more•Institutions (4)

Wellcome Trust Sanger Institute¹, Oslo University Hospital², King's College London³, Microsoft⁴

01 May 2015-G3: Genes, Genomes, Genetics

TL;DR: It is demonstrated that factor analysis methods combined with biological knowledge can produce more reliable phenotypes with less stochastic noise than the individual gene expression levels, which increases the power to discover biologically relevant associations.

...read moreread less

Abstract: Statistical factor analysis methods have previously been used to remove noise components from high-dimensional data prior to genetic association mapping and, in a guided fashion, to summarize biologically relevant sources of variation. Here, we show how the derived factors summarizing pathway expression can be used to analyze the relationships between expression, heritability, and aging. We used skin gene expression data from 647 twins from the MuTHER Consortium and applied factor analysis to concisely summarize patterns of gene expression to remove broad confounding influences and to produce concise pathway-level phenotypes. We derived 930 "pathway phenotypes" that summarized patterns of variation across 186 KEGG pathways (five phenotypes per pathway). We identified 69 significant associations of age with phenotype from 57 distinct KEGG pathways at a stringent Bonferroni threshold ([Formula: see text]). These phenotypes are more heritable ([Formula: see text]) than gene expression levels. On average, expression levels of 16% of genes within these pathways are associated with age. Several significant pathways relate to metabolizing sugars and fatty acids; others relate to insulin signaling. We have demonstrated that factor analysis methods combined with biological knowledge can produce more reliable phenotypes with less stochastic noise than the individual gene expression levels, which increases our power to discover biologically relevant associations. These phenotypes could also be applied to discover associations with other environmental factors.

...read moreread less

Posted Content•DOI•

Purging of deleterious variants in Italian founder populations with extended autozygosity

[...]

Massimiliano Cocca¹, Marc Pybus², Pier Francesco Palamara³, Erik Garrison⁴, Michela Traglia, Cinzia Sala, Sheila Ulivi, Yasin Memari⁴, Anja Kolb-Kokocinski⁴, Richard Durbin⁴, Paolo Gasparini¹, Daniela Toniolo, Nicole Soranzo⁵, Vincenza Colonna⁶ - Show less +10 more•Institutions (6)

University of Trieste¹, Pompeu Fabra University², Harvard University³, Wellcome Trust⁴, Wellcome Trust Sanger Institute⁵, National Research Council⁶

21 Jul 2015-bioRxiv

TL;DR: This study carried out low-read depth whole-genome sequencing in 568 individuals from three Italian founder populations and compared it to data from other Italian and European populations from the 1000 Genomes Project to conclude that genetic drift and the founder effect should be responsible for the observed purging of deleterious variants.

...read moreread less

Abstract: Purging through inbreeding defines the process through which deleterious alleles can be removed from populations by natural selection when exposed in homozygosis through the occurrence of consanguineous marriage. In this study we carried out low-read depth (4-10x) whole-genome sequencing in 568 individuals from three Italian founder populations, and compared it to data from other Italian and European populations from the 1000 Genomes Project. We show depletion of homozygous genotypes at potentially detrimental sites in the founder populations compared to outbred populations and observe patterns consistent with consanguinity driving the accelerated purging of highly deleterious mutations.

...read moreread less

The Genome 10K Project: A Way Forward Further

[...]

01 Jan 2015

TL;DR: The status of known vertebrate genome projects, recommend standards for pronouncing a genome as sequenced or completed, and the present and future vision of the landscape of Genome 10K are provided.

...read moreread less

Abstract: The Genome 10K Project was established in 2009 by a consortium of biologists and genome scientists determined to facilitate the sequencing and analysis of the complete genomes of 10,000 vertebrate species. Since then the number of selected and initiated species has risen from ∼26 to 277 sequenced or ongoing with funding, an approximately tenfold increase in five years. Here we summarize the advances and commitments that have occurred by mid-2014 and outline the achievements and present challenges of reaching the 10,000-species goal. We summarize the status of known vertebrate genome projects, recommend standards for pronouncing a genome as sequenced or completed, and provide our present and future vision of the landscape of Genome 10K. The endeavor is ambitious, bold, expensive, and uncertain, but together the Genome 10K Consortium of Scientists and the worldwide genomics community are moving toward their goal of delivering to the coming generation the gift of genome empowerment for many vertebrate species.

...read moreread less

Posted Content•DOI•

Pathway based factor analysis of gene expression data produces highly heritable phenotypes that associate with age

[...]

Andrew A. Brown¹, Zhihao Ding¹, Ana Viñuela², Daniel Glass², Leopold Parts³, Tim D. Spector², John Winn⁴, Richard Durbin¹ - Show less +4 more•Institutions (4)

Wellcome Trust Sanger Institute¹, King's College London², University of Toronto³, Microsoft⁴

06 Mar 2015-bioRxiv

...read moreread less

Abstract: Statistical factor analysis methods have previously been used to remove noise components from high dimensional data prior to genetic association mapping, and in a guided fashion to summarise biologically relevant sources of variation. Here we show how the derived factors summarising pathway expression can be used to analyse the relationships between expression, heritability and ageing. We used skin gene expression data from 647 twins from the MuTHER Consortium and applied factor analysis to concisely summarise patterns of gene expression, both to remove broad confounding influences and to produce concise pathway-level phenotypes. We derived 930 "pathway phenotypes" which summarised patterns of variation across 186 KEGG pathways (five phenotypes per pathway). We identified 69 significant associations of age with phenotype from 57 distinct KEGG pathways at a stringent Bonferroni threshold (P<5.38E-5). These phenotypes are more heritable (h^2=0.32) than gene expression levels. On average, expression levels of 16% of genes within these pathways are associated with age. Several significant pathways relate to metabolising sugars and fatty acids, others with insulin signalling. We have demonstrated that factor analysis methods combined with biological knowledge can produce more reliable phenotypes with less stochastic noise than the individual gene expression levels, which increases our power to discover biologically relevant associations. These phenotypes could also be applied to discover associations with other environmental factors.

...read moreread less

Posted Content•DOI•

Purging of deleterious variants due to drift and founder effect in Italian populations with extended autozygosity

[...]

Massimiliano Cocca¹, Marc Pybus², Pier Francesco Palamara³, Erik Garrison⁴, Michela Traglia, Cinzia Sala, Sheila Ulivi, Yasin Memari⁴, Anja Kolb-Kokocinski⁴, Shane A. McCarthy⁵, Richard Durbin⁴, Paolo Gasparini¹, Daniela Toniolo, Nicole Soranzo⁵, Vincenza Colonna⁶ - Show less +11 more•Institutions (6)

University of Trieste¹, Pompeu Fabra University², Harvard University³, Wellcome Trust⁴, Wellcome Trust Sanger Institute⁵, National Research Council⁶

12 Oct 2015-bioRxiv

TL;DR: In this article, the authors carried out low-read depth (4-10x) whole-genome sequencing in 568 individuals from three Italian founder populations, and compared it to data from other Italian and European populations from the 1000 Genomes Project.

...read moreread less

Abstract: Purging through inbreeding occurs when consanguineous marriages increases the rate at which deleterious alleles are present in a homozygous state. In this study we carried out low-read depth (4-10x) whole-genome sequencing in 568 individuals from three Italian founder populations, and compared it to data from other Italian and European populations from the 1000 Genomes Project. We show extended consanguinity and depletion of homozygous genotypes at potentially detrimental sites in the founder populations compared to outbred populations. However these patterns are not compatible with the hypothesis of consanguinity driving the purging of highly deleterious mutations according to simulations. Therefore we conclude that genetic drift and the founder effect should be responsible for the observed purging of deleterious variants.

...read moreread less

Journal Article•DOI•

Tree consistent PBWT and their application to reconstructing Ancestral Recombination Graphs and demographic inference

[...]

Vladimir Shchur, Richard Durbin

21 May 2015-F1000Research

Journal Article•DOI•

Erratum: Whole-genome sequence-based analysis of thyroid function.

[...]

Peter N. Taylor, Eleonora Porcu, Shelby Chew, Purdey J Campbell, Michela Traglia, Suzanne J. Brown, Benjamin H. Mullin¹, Hashem A. Shihab, J L Min, Klaudia Walter, Yasin Memari, Jie Huang, Michael R. Barnes, John Beilby, Pimphen Charoen, Petr Danecek, Frank Dudbridge, Vincenzo Forgetta, Celia M. T. Greenwood, Elin Grundberg, Andrew D. Johnson, Jennie Hui, Ee Mun Lim, Shane A. McCarthy, Dawn Muddyman, Vijay Panicker, John R. B. Perry, Jordana T. Bell, Wei Yuan, Caroline L Relton, Tom R. Gaunt, David Schlessinger, Gonçalo R. Abecasis, Francesco Cucca, Gabriela L. Surdulescu, Wolfram Woltersdorf, Eleftheria Zeggini, Hou-Feng Zheng, Daniela Toniolo², Colin M. Dayan, Silvia Naitza, John P. Walsh, Tim D. Spector, George Davey Smith, Richard Durbin, J. Brent Richards, Serena Sanna, Nicole Soranzo, Nicholas J. Timpson, Scott Wilson - Show less +46 more•Institutions (2)

University of Western Australia¹, Vita-Salute San Raffaele University²

20 May 2015-Nature Communications

TL;DR: The original version of this article noted incorrect affiliations for members of the UK10K Consortium, and contained typographical errors in the spelling of UK10k Consortium and consortium members Valentina Iotchkova and Michael Quail as discussed by the authors.

...read moreread less

Abstract: Nature Communications 6: Article number: 5681 10.1038/ncomms6681 (2015); Published March062015; Updated May202015 The original version of this Article noted incorrect affiliations for members of the UK10K Consortium, and contained typographical errors in the spelling of the UK10K Consortium and consortium members Valentina Iotchkova and Michael Quail. In addition, the author J. Brent Richards was incorrectly duplicated in the list of consortium members as Brent Richards. These errors have now been corrected in the PDF and HTML versions of this Article.

...read moreread less

Journal Article•DOI•

Correction: Quantitative Genetics of CTCF Binding Reveal Local Sequence Effects and Different Modes of X-Chromosome Association (PLoS Genet, 11, 4, (2015))

[...]

Zhihao Ding, Yunyun Ni, Sander W. Timmer, Bum Kyu Lee, Anna Battenhouse, Sandra Louzada, Fengtang Yang, Ian Dunham, Gregory E. Crawford, Jason D. Lieb, Richard Durbin, Vishwanath R. Iyer, Ewan Birney - Show less +9 more

28 Apr 2015-PLOS Genetics

TL;DR: This research presents a novel probabilistic approach to estimating the response of the immune system to laser-spot assisted, 3D image analysis of central nervous system injury.

...read moreread less

Abstract: [This corrects the article DOI: 10.1371/journal.pgen.1004798.].

...read moreread less