Showing papers by "Richard Durbin published in 2013"

PDF

Open Access

Journal Article•DOI•

Assemblathon 2: evaluating de novo methods of genome assembly in three vertebrate species

[...]

Keith Bradnam¹, Joseph Fass¹, Anton Alexandrov, Paul Baranay², Michael Bechner, Inanc Birol, Sébastien Boisvert³, Jarrod Chapman⁴, Guillaume Chapuis⁵, Guillaume Chapuis⁶, Rayan Chikhi⁶, Rayan Chikhi⁵, Hamidreza Chitsaz⁷, Wen-Chi Chou⁸, Jacques Corbeil³, Cristian Del Fabbro⁹, T. Roderick Docking, Richard Durbin¹⁰, Dent Earl¹¹, Scott J. Emrich¹², Pavel Fedotov, Nuno A. Fonseca¹³, Ganeshkumar Ganapathy¹⁴, Richard A. Gibbs¹⁵, Sante Gnerre¹⁶, Elenie Godzaridis³, Steve Goldstein, Matthias Haimel¹³, Giles Hall¹⁶, David Haussler¹¹, Joseph B. Hiatt¹⁷, Isaac Ho⁴, Jason T. Howard¹⁴, Martin Hunt¹⁰, Shaun D. Jackman, David B. Jaffe¹⁶, Erich D. Jarvis¹⁴, Huaiyang Jiang¹⁵, Sergey Kazakov, Paul J. Kersey¹³, Jacob O. Kitzman¹⁷, James R. Knight, Sergey Koren¹⁸, Tak-Wah Lam, Dominique Lavenier⁵, Dominique Lavenier⁶, François Laviolette³, Yingrui Li, Zhenyu Li, Binghang Liu, Yue Liu¹⁵, Ruibang Luo, Iain MacCallum¹⁶, Matthew D. MacManes¹⁹, Nicolas Maillet⁵, Sergey Melnikov, Bruno Vieira²⁰, Delphine Naquin⁵, Zemin Ning¹⁰, Thomas D. Otto¹⁰, Benedict Paten¹¹, Octávio S. Paulo²⁰, Adam M. Phillippy¹⁸, Francisco Pina-Martins²⁰, Michael Place, Dariusz Przybylski¹⁶, Xiang Qin¹⁵, Carson Qu¹⁵, Filipe J. Ribeiro¹⁶, Stephen Richards¹⁵, Daniel S. Rokhsar¹⁹, Daniel S. Rokhsar⁴, J. Graham Ruby²¹, J. Graham Ruby²², Simone Scalabrin⁹, Michael C. Schatz²³, David C. Schwartz, Alexey Sergushichev, Ted Sharpe¹⁶, Timothy I. Shaw⁸, Jay Shendure¹⁷, Yujian Shi, Jared T. Simpson¹⁰, Henry Song¹⁵, Fedor Tsarev, Francesco Vezzi²⁴, Riccardo Vicedomini⁹, Jun Wang, Kim C. Worley¹⁵, Shuangye Yin¹⁶, Siu-Ming Yiu, Jianying Yuan, Guojie Zhang, Hao Zhang, Shiguo Zhou, Ian F Korf¹ - Show less +92 more•Institutions (24)

23 Jan 2013-arXiv: Genomics

TL;DR: The Assemblathon 2 as mentioned in this paper presented a variety of sequence data to be assembled for three vertebrate species (a bird, a fish, and a snake) from 21 participating teams.

...read moreread less

Abstract: Background - The process of generating raw genome sequence data continues to become cheaper, faster, and more accurate. However, assembly of such data into high-quality, finished genome sequences remains challenging. Many genome assembly tools are available, but they differ greatly in terms of their performance (speed, scalability, hardware requirements, acceptance of newer read technologies) and in their final output (composition of assembled sequence). More importantly, it remains largely unclear how to best assess the quality of assembled genome sequences. The Assemblathon competitions are intended to assess current state-of-the-art methods in genome assembly. Results - In Assemblathon 2, we provided a variety of sequence data to be assembled for three vertebrate species (a bird, a fish, and snake). This resulted in a total of 43 submitted assemblies from 21 participating teams. We evaluated these assemblies using a combination of optical map data, Fosmid sequences, and several statistical methods. From over 100 different metrics, we chose ten key measures by which to assess the overall quality of the assemblies. Conclusions - Many current genome assemblers produced useful assemblies, containing a significant representation of their genes, regulatory sequences, and overall genome structure. However, the high degree of variability between the entries suggests that there is still much room for improvement in the field of genome assembly and that approaches which work well in assembling the genome of one species may not necessarily work well for another.

...read moreread less

690 citations

Journal Article•DOI•

Assemblathon 2: evaluating de novo methods of genome assembly in three vertebrate species

[...]

Keith Bradnam, Joseph Fass, Anton Alexandrov, Paul Baranay¹, Michael Bechner, Inanc Birol², Sébastien Boisvert³, Jarrod Chapman⁴, Guillaume Chapuis⁵, Guillaume Chapuis⁶, Rayan Chikhi⁵, Rayan Chikhi⁶, Hamidreza Chitsaz⁷, Wen-Chi Chou⁸, Jacques Corbeil³, Cristian Del Fabbro, Roderick R. Docking², Richard Durbin⁹, Dent Earl¹⁰, Scott J. Emrich¹¹, Pavel Fedotov, Nuno A. Fonseca¹², Ganeshkumar Ganapathy¹³, Richard A. Gibbs¹⁴, Sante Gnerre¹⁵, Elenie Godzaridis³, Steve Goldstein, Matthias Haimel¹², Giles Hall¹⁵, David Haussler¹⁰, Joseph B. Hiatt¹⁶, Isaac Ho⁴, Jason T. Howard¹³, Martin Hunt⁹, Shaun D. Jackman², David B. Jaffe¹⁵, Erich D. Jarvis¹³, Huaiyang Jiang¹⁴, Sergey Kazakov, Paul J. Kersey¹², Jacob O. Kitzman¹⁶, James R. Knight, Sergey Koren¹⁷, Tak-Wah Lam¹⁸, Dominique Lavenier¹⁹, Dominique Lavenier⁵, Dominique Lavenier⁶, François Laviolette³, Yingrui Li¹⁸, Zhenyu Li, Binghang Liu, Yue Liu¹⁴, Ruibang Luo¹⁸, Iain MacCallum¹⁵, Matthew D. MacManes²⁰, Nicolas Maillet¹⁹, Nicolas Maillet⁶, Sergey Melnikov, Delphine Naquin¹⁹, Delphine Naquin⁶, Zemin Ning⁹, Thomas D. Otto⁹, Benedict Paten¹⁰, Octávio S. Paulo²¹, Adam M. Phillippy¹⁷, Francisco Pina-Martins²¹, Michael Place, Dariusz Przybylski¹⁵, Xiang Qin¹⁴, Carson Qu¹⁴, Filipe J. Ribeiro, Stephen Richards¹⁴, Daniel S. Rokhsar⁴, Daniel S. Rokhsar²², J. Graham Ruby²³, J. Graham Ruby²⁴, Simone Scalabrin, Michael C. Schatz²⁵, David C. Schwartz, Alexey Sergushichev, Ted Sharpe¹⁵, Timothy I. Shaw⁸, Jay Shendure¹⁶, Yujian Shi, Jared T. Simpson⁹, Henry Song¹⁴, Fedor Tsarev, Francesco Vezzi²⁶, Riccardo Vicedomini²⁷, Bruno Vieira²¹, Jun Wang, Kim C. Worley¹⁴, Shuangye Yin¹⁵, Siu-Ming Yiu¹⁸, Jianying Yuan, Guojie Zhang, Hao Zhang, Shiguo Zhou, Ian F Korf - Show less +95 more•Institutions (27)

Yale University¹, BC Cancer Agency², Laval University³, Joint Genome Institute⁴, École normale supérieure de Cachan⁵, Centre national de la recherche scientifique⁶, Wayne State University⁷, University of Georgia⁸, Wellcome Trust Sanger Institute⁹, University of California, Santa Cruz¹⁰, University of Notre Dame¹¹, European Bioinformatics Institute¹², Duke University¹³, Baylor College of Medicine¹⁴, Broad Institute¹⁵, University of Washington¹⁶, University of Maryland, College Park¹⁷, University of Hong Kong¹⁸, French Institute for Research in Computer Science and Automation¹⁹, California Institute for Quantitative Biosciences²⁰, University of Lisbon²¹, University of California, Berkeley²², Howard Hughes Medical Institute²³, University of California, San Francisco²⁴, Cold Spring Harbor Laboratory²⁵, Royal Institute of Technology²⁶, University of Udine²⁷

22 Jul 2013-GigaScience

TL;DR: The Assemblathon 2 as discussed by the authors presented a variety of sequence data to be assembled for three vertebrate species (a bird, a fish, and a snake) from 21 participating teams.

...read moreread less

Abstract: Background: The process of generating raw genome sequence data continues to become cheaper, faster, and more accurate. However, assembly of such data into high-quality, finished genome sequences remains challenging. Many genome assembly tools are available, but they differ greatly in terms of their performance (speed, scalability, hardware requirements, acceptance of newer read technologies) and in their final output (composition of assembled sequence). More importantly, it remains largely unclear how to best assess the quality of assembled genome sequences. The Assemblathon competitions are intended to assess current state-of-the-art methods in genome assembly. Results: In Assemblathon 2, we provided a variety of sequence data to be assembled for three vertebrate species (a bird, a fish, and snake). This resulted in a total of 43 submitted assemblies from 21 participating teams. We evaluated these assemblies using a combination of optical map data, Fosmid sequences, and several statistical methods. From over 100 different metrics, we chose ten key measures by which to assess the overall quality of the assemblies. (Continued on next page)

...read moreread less

602 citations

Journal Article•DOI•

Gene expression changes with age in skin, adipose tissue, blood and brain.

[...]

Daniel Glass¹, Daniel Glass², Ana Viñuela², Matthew N. Davies², Adaikalavan Ramasamy², Leopold Parts³, David A. Knowles⁴, Andrew A. Brown³, Åsa K. Hedman⁵, Kerrin S. Small², Kerrin S. Small³, Alfonso Buil⁶, Elin Grundberg³, Elin Grundberg², Alexandra C. Nica⁶, Paola Di Meglio², Frank O. Nestle², Mina Ryten², Richard Durbin³, Mark I. McCarthy⁷, Mark I. McCarthy⁵, Panagiotis Deloukas³, Emmanouil T. Dermitzakis⁶, Michael E. Weale², Veronique Bataille², Tim D. Spector² - Show less +22 more•Institutions (7)

Northwick Park Hospital¹, King's College London², Wellcome Trust Sanger Institute³, Stanford University⁴, Wellcome Trust Centre for Human Genetics⁵, University of Geneva⁶, University of Oxford⁷

26 Jul 2013-Genome Biology

TL;DR: Skin showed the most age-related gene expression changes of all the tissues investigated, with many of the genes being previously implicated in fatty acid metabolism, mitochondrial activity, cancer and splicing.

...read moreread less

Abstract: Background: Previous studies have demonstrated that gene expression levels change with age These changes are hypothesized to influence the aging rate of an individual We analyzed gene expression changes with age in abdominal skin, subcutaneous adipose tissue and lymphoblastoid cell lines in 856 female twins in the age range of 39-85 years Additionally, we investigated genotypic variants involved in genotype-by-age interactions to understand how the genomic regulation of gene expression alters with age Results: Using a linear mixed model, differential expression with age was identified in 1,672 genes in skin and 188 genes in adipose tissue Only two genes expressed in lymphoblastoid cell lines showed significant changes with age Genes significantly regulated by age were compared with expression profiles in 10 brain regions from 100 postmortem brains aged 16 to 83 years We identified only one age-related gene common to the three tissues There were 12 genes that showed differential expression with age in both skin and brain tissue and three common to adipose and brain tissues Conclusions: Skin showed the most age-related gene expression changes of all the tissues investigated, with many of the genes being previously implicated in fatty acid metabolism, mitochondrial activity, cancer and splicing A significant proportion of age-related changes in gene expression appear to be tissue-specific with only a few genes sharing an age effect in expression across tissues More research is needed to improve our understanding of the genetic influences on aging and the relationship with age-related diseases

...read moreread less

262 citations

Journal Article•DOI•

High-Resolution Mapping of Complex Traits with a Four-Parent Advanced Intercross Yeast Population

[...]

Francisco A. Cubillos¹, Francisco A. Cubillos², Leopold Parts³, Leopold Parts⁴, Francisco Salinas⁵, Anders Bergström⁵, Eugenio Scovacricchi², Amin Zia³, Christopher J. R. Illingworth⁴, Ville Mustonen⁴, Sebastian Ibstedt⁶, Jonas Warringer⁶, Edward J. Louis², Edward J. Louis⁷, Richard Durbin⁴, Gianni Liti⁵ - Show less +12 more•Institutions (7)

University of Santiago, Chile¹, University of Nottingham², University of Toronto³, Wellcome Trust Sanger Institute⁴, French Institute of Health and Medical Research⁵, University of Gothenburg⁶, University of Leicester⁷

01 Nov 2013-Genetics

TL;DR: The most parsimonious model for the majority of loci mapped using either approach was the effect of an allele private to one founder, which could validate examples of pleiotropic effects and complex allelic series at a locus.

...read moreread less

Abstract: A large fraction of human complex trait heritability is due to a high number of variants with small marginal effects and their interactions with genotype and environment. Such alleles are more easily studied in model organisms, where environment, genetic makeup, and allele frequencies can be controlled. Here, we examine the effect of natural genetic variation on heritable traits in a very large pool of baker’s yeast from a multiparent 12th generation intercross. We selected four representative founder strains to produce the Saccharomyces Genome Resequencing Project (SGRP)-4X mapping population and sequenced 192 segregants to generate an accurate genetic map. Using these individuals, we mapped 25 loci linked to growth traits under heat stress, arsenite, and paraquat, the majority of which were best explained by a diverging phenotype caused by a single allele in one condition. By sequencing pooled DNA from millions of segregants grown under heat stress, we further identified 34 and 39 regions selected in haploid and diploid pools, respectively, with most of the selection against a single allele. While the most parsimonious model for the majority of loci mapped using either approach was the effect of an allele private to one founder, we could validate examples of pleiotropic effects and complex allelic series at a locus. SGRP-4X is a deeply characterized resource that provides a framework for powerful and high-resolution genetic analysis of yeast phenotypes and serves as a test bed for testing avenues to attack human complex traits.

...read moreread less

129 citations

Journal Article•DOI•

The anatomy of successful computational biology software.

[...]

Stephen F. Altschul, Barry Demchak¹, Richard Durbin², Robert Gentleman³, Martin Krzywinski⁴, Heng Li⁵, Anton Nekrutenko⁶, James T. Robinson⁵, Wayne Rasband⁷, James Taylor⁸, Cole Trapnell⁹ - Show less +7 more•Institutions (9)

University of California, San Diego¹, Wellcome Trust Sanger Institute², Genentech³, BC Cancer Research Centre⁴, Broad Institute⁵, Pennsylvania State University⁶, National Institutes of Health⁷, Emory University⁸, Harvard University⁹

01 Oct 2013-Nature Biotechnology

TL;DR: Creators of software widely used in computational biology discuss the factors that contributed to their success and suggest ideas for future generations of software developers.

...read moreread less

Abstract: Creators of software widely used in computational biology discuss the factors that contributed to their success

...read moreread less

35 citations

Journal Article•DOI•

A genome-wide survey of genetic variation in gorillas using reduced representation sequencing.

[...]

Aylwyn Scally¹, Bryndis Yngvadottir¹, Bryndis Yngvadottir², Yali Xue¹, Qasim Ayub¹, Richard Durbin¹, Chris Tyler-Smith¹ - Show less +3 more•Institutions (2)

Wellcome Trust Sanger Institute¹, University of Cambridge²

04 Jun 2013-PLOS ONE

TL;DR: A genome-wide survey of genetic variation in gorillas using a reduced representation sequencing approach, focusing on the two lowland subspecies, suggests that despite their maintaining an overall level of genetic diversity equal to or greater than that of humans, population decline has been a significant factor in recent and long-term pressures on wild gorilla populations.

...read moreread less

Abstract: All non-human great apes are endangered in the wild, and it is therefore important to gain an understanding of their demography and genetic diversity. Whole genome assembly projects have provided an invaluable foundation for understanding genetics in all four genera, but to date genetic studies of multiple individuals within great ape species have largely been confined to mitochondrial DNA and a small number of other loci. Here, we present a genome-wide survey of genetic variation in gorillas using a reduced representation sequencing approach, focusing on the two lowland subspecies. We identify 3,006,670 polymorphic sites in 14 individuals: 12 western lowland gorillas (Gorilla gorilla gorilla) and 2 eastern lowland gorillas (Gorilla beringei graueri). We find that the two species are genetically distinct, based on levels of heterozygosity and patterns of allele sharing. Focusing on the western lowland population, we observe evidence for population substructure, and a deficit of rare genetic variants suggesting a recent episode of population contraction. In western lowland gorillas, there is an elevation of variation towards telomeres and centromeres on the chromosomal scale. On a finer scale, we find substantial variation in genetic diversity, including a marked reduction close to the major histocompatibility locus, perhaps indicative of recent strong selection there. These findings suggest that despite their maintaining an overall level of genetic diversity equal to or greater than that of humans, population decline, perhaps associated with disease, has been a significant factor in recent and long-term pressures on wild gorilla populations.

...read moreread less

27 citations

read sequencing De novo assembly of human genomes with massively parallel short

[...]

Jared T. Simpson, Richard Durbin

01 Jan 2013

5 citations