Home
/
Authors
/
Marcela Uliano-Silva

Author

Marcela Uliano-Silva

Other affiliations: Leibniz Association, Universidade Federal de Santa Catarina, Federal University of Rio de Janeiro

Bio: Marcela Uliano-Silva is an academic researcher from Wellcome Trust Sanger Institute. The author has contributed to research in topics: Genome & Biology. The author has an hindex of 12, co-authored 29 publications receiving 552 citations. Previous affiliations of Marcela Uliano-Silva include Leibniz Association & Universidade Federal de Santa Catarina.

Topics: Genome, Biology, Reference genome, Medicine, Sequence assembly ...read more

Papers

PDF

Open Access

More filters

Journal Article•DOI•

Towards complete and error-free genome assemblies of all vertebrate species

[...]

Arang Rhie¹, Shane A. McCarthy², Shane A. McCarthy³, Olivier Fedrigo⁴, Joana Damas⁵, Giulio Formenti⁴, Sergey Koren¹, Marcela Uliano-Silva⁶, William Chow², Arkarachai Fungtammasan, J. H. Kim⁷, Chul Hee Lee⁷, Byung June Ko⁷, Mark Chaisson⁸, Gregory Gedman⁴, Lindsey J. Cantin⁴, Françoise Thibaud-Nissen¹, Leanne Haggerty⁹, Iliana Bista³, Iliana Bista², Michelle Smith², Bettina Haase⁴, Jacquelyn Mountcastle⁴, Sylke Winkler¹⁰, Sylke Winkler¹¹, Sadye Paez⁴, Jason T. Howard, Sonja C. Vernes¹⁰, Sonja C. Vernes¹², Sonja C. Vernes¹³, Tanya M. Lama¹⁴, Frank Grützner¹⁵, Wesley C. Warren¹⁶, Christopher N. Balakrishnan¹⁷, Dave W Burt¹⁸, Jimin George¹⁹, Matthew T. Biegler⁴, David Iorns, Andrew Digby, Daryl Eason, Bruce C. Robertson²⁰, Taylor Edwards²¹, Mark Wilkinson²², George F. Turner²³, Axel Meyer²⁴, Andreas F. Kautt²⁵, Andreas F. Kautt²⁴, Paolo Franchini²⁴, H. William Detrich²⁶, Hannes Svardal²⁷, Hannes Svardal²⁸, Maximilian Wagner²⁹, Gavin J. P. Naylor³⁰, Martin Pippel¹⁰, Milan Malinsky², Milan Malinsky³¹, Mark Mooney, Maria Simbirsky, Brett T. Hannigan, Trevor Pesout³², Marlys L. Houck³³, Ann C Misuraca³³, Sarah B. Kingan³⁴, Richard Hall³⁴, Zev N. Kronenberg³⁴, Ivan Sović³⁴, Christopher Dunn³⁴, Zemin Ning², Alex Hastie, Joyce V. Lee, Siddarth Selvaraj, Richard E. Green³², Nicholas H. Putnam, Ivo Gut³⁵, Jay Ghurye³⁶, Erik Garrison³², Ying Sims², Joanna Collins², Sarah Pelan², James Torrance², Alan Tracey², Jonathan Wood², Robel E. Dagnew⁸, Dengfeng Guan³⁷, Dengfeng Guan³, Sarah E. London³⁸, David F. Clayton¹⁹, Claudio V. Mello³⁹, Samantha R. Friedrich³⁹, Peter V. Lovell³⁹, Ekaterina Osipova¹⁰, Farooq O. Al-Ajli⁴⁰, Farooq O. Al-Ajli⁴¹, Simona Secomandi⁴², Heebal Kim⁷, Constantina Theofanopoulou⁴, Michael Hiller⁴³, Yang Zhou, Robert S. Harris⁴⁴, Kateryna D. Makova⁴⁴, Paul Medvedev⁴⁴, Jinna Hoffman¹, Patrick Masterson¹, Karen Clark¹, Fergal J. Martin⁹, Kevin L. Howe⁹, Paul Flicek⁹, Brian P. Walenz¹, Woori Kwak, Hiram Clawson³², Mark Diekhans³², Luis R Nassar³², Benedict Paten³², Robert H. S. Kraus¹⁰, Robert H. S. Kraus²⁴, Andrew J. Crawford⁴⁵, M. Thomas P. Gilbert⁴⁶, M. Thomas P. Gilbert⁴⁷, Guojie Zhang, Byrappa Venkatesh⁴⁸, Robert W. Murphy⁴⁹, Klaus-Peter Koepfli⁵⁰, Beth Shapiro³², Beth Shapiro⁵¹, Warren E. Johnson⁵², Warren E. Johnson⁵⁰, Federica Di Palma⁵³, Tomas Marques-Bonet, Emma C. Teeling⁵⁴, Tandy Warnow⁵⁵, Jennifer A. Marshall Graves⁵⁶, Oliver A. Ryder⁵⁷, Oliver A. Ryder³³, David Haussler³², Stephen J. O'Brien⁵⁸, Jonas Korlach³⁴, Harris A. Lewin⁵, Kerstin Howe², Eugene W. Myers¹¹, Eugene W. Myers¹⁰, Richard Durbin³, Richard Durbin², Adam M. Phillippy¹, Erich D. Jarvis⁵¹, Erich D. Jarvis⁴ - Show less +141 more•Institutions (58)

National Institutes of Health¹, Wellcome Trust Sanger Institute², University of Cambridge³, Rockefeller University⁴, University of California, Davis⁵, Leibniz Association⁶, Seoul National University⁷, University of Southern California⁸, European Bioinformatics Institute⁹, Max Planck Society¹⁰, Dresden University of Technology¹¹, Radboud University Nijmegen¹², University of St Andrews¹³, University of Massachusetts Amherst¹⁴, University of Adelaide¹⁵, University of Missouri¹⁶, East Carolina University¹⁷, University of Queensland¹⁸, Clemson University¹⁹, University of Otago²⁰, University of Arizona²¹, Natural History Museum²², Bangor University²³, University of Konstanz²⁴, Harvard University²⁵, Northeastern University²⁶, University of Antwerp²⁷, National Museum of Natural History²⁸, University of Graz²⁹, University of Florida³⁰, University of Basel³¹, University of California, Santa Cruz³², Zoological Society of San Diego³³, Pacific Biosciences³⁴, Pompeu Fabra University³⁵, University of Maryland, College Park³⁶, Harbin Institute of Technology³⁷, University of Chicago³⁸, Oregon Health & Science University³⁹, Monash University Malaysia Campus⁴⁰, Qatar Airways⁴¹, University of Milan⁴², Goethe University Frankfurt⁴³, Pennsylvania State University⁴⁴, University of Los Andes⁴⁵, University of Copenhagen⁴⁶, Norwegian University of Science and Technology⁴⁷, Agency for Science, Technology and Research⁴⁸, Royal Ontario Museum⁴⁹, Smithsonian Institution⁵⁰, Howard Hughes Medical Institute⁵¹, Walter Reed Army Institute of Research⁵², University of East Anglia⁵³, University College Dublin⁵⁴, University of Illinois at Urbana–Champaign⁵⁵, La Trobe University⁵⁶, University of California, San Diego⁵⁷, Nova Southeastern University⁵⁸

28 Apr 2021-Nature

TL;DR: The Vertebrate Genomes Project (VGP) as mentioned in this paper is an international effort to generate high quality, complete reference genomes for all of the roughly 70,000 extant vertebrate species and to help to enable a new era of discovery across the life sciences.

...read moreread less

Abstract: High-quality and complete reference genome assemblies are fundamental for the application of genomics to biology, disease, and biodiversity conservation. However, such assemblies are available for only a few non-microbial species1-4. To address this issue, the international Genome 10K (G10K) consortium5,6 has worked over a five-year period to evaluate and develop cost-effective methods for assembling highly accurate and nearly complete reference genomes. Here we present lessons learned from generating assemblies for 16 species that represent six major vertebrate lineages. We confirm that long-read sequencing technologies are essential for maximizing genome quality, and that unresolved complex repeats and haplotype heterozygosity are major sources of assembly error when not handled correctly. Our assemblies correct substantial errors, add missing sequence in some of the best historical reference genomes, and reveal biological discoveries. These include the identification of many false gene duplications, increases in gene sizes, chromosome rearrangements that are specific to lineages, a repeated independent chromosome breakpoint in bat genomes, and a canonical GC-rich pattern in protein-coding genes and their regulatory regions. Adopting these lessons, we have embarked on the Vertebrate Genomes Project (VGP), an international effort to generate high-quality, complete reference genomes for all of the roughly 70,000 extant vertebrate species and to help to enable a new era of discovery across the life sciences.

...read moreread less

647 citations

Posted Content•DOI•

Towards complete and error-free genome assemblies of all vertebrate species

[...]

Arang Rhie¹, Shane A. McCarthy², Olivier Fedrigo³, Joana Damas⁴, Giulio Formenti³, Sergey Koren¹, Marcela Uliano-Silva², William Chow², Arkarachai Fungtammasan, Gregory Gedman³, Lindsey J. Cantin³, Françoise Thibaud-Nissen¹, Leanne Haggerty⁵, Chul Hee Lee⁶, Byung June Ko⁶, J. H. Kim⁶, Iliana Bista², Michelle Smith², Bettina Haase³, Jacquelyn Mountcastle³, Sylke Winkler⁷, Sadye Paez³, Jason T. Howard⁸, Sonja C. Vernes⁷, Tanya M. Lama⁹, Frank Grützner¹⁰, Wesley C. Warren¹¹, Christopher N. Balakrishnan¹², Dave W Burt¹³, Jimin George¹⁴, Matthew T. Biegler³, David Iorns¹⁵, Andrew Digby, Daryl Eason, Taylor Edwards¹⁶, Mark Wilkinson¹⁷, George F. Turner¹⁸, Axel Meyer¹⁹, Andreas F. Kautt¹⁹, Paolo Franchini¹⁹, H. William Detrich²⁰, Hannes Svardal²¹, Maximilian Wagner²², Gavin J. P. Naylor²³, Martin Pippel⁷, Milan Malinsky², Mark Mooney, Maria Simbirsky, Brett T. Hannigan, Trevor Pesout²⁴, Marlys L. Houck, Ann C Misuraca, Sarah B. Kingan²⁵, Richard Hall²⁵, Zev N. Kronenberg²⁵, Jonas Korlach²⁵, Ivan Sović²⁵, Christopher Dunn²⁵, Zemin Ning², Alex Hastie, Joyce V. Lee, Siddarth Selvaraj, Richard E. Green²⁴, Nicholas H. Putnam, Jay Ghurye²⁶, Erik Garrison²⁴, Ying Sims², Joanna Collins², Sarah Pelan², James Torrance², Alan Tracey², Jonathan Wood², Dengfeng Guan²⁷, Sarah E. London²⁸, David F. Clayton¹⁴, Claudio V. Mello²⁹, Samantha R. Friedrich²⁹, Peter V. Lovell²⁹, Ekaterina Osipova⁷, Farooq O. Al-Ajli³⁰, Simona Secomandi³¹, Heebal Kim⁶, Constantina Theofanopoulou³, Yang Zhou³², Robert S. Harris³³, Kateryna D. Makova³³, Paul Medvedev³³, Jinna Hoffman¹, Patrick Masterson¹, Karen Clark¹, Fergal J. Martin⁵, Kevin L. Howe⁵, Paul Flicek⁵, Brian P. Walenz¹, Woori Kwak, Hiram Clawson²⁴, Mark Diekhans²⁴, Luis R Nassar²⁴, Benedict Paten²⁴, Robert H. S. Kraus¹⁹, Harris A. Lewin⁴, Andrew J. Crawford³⁴, M. Thomas P. Gilbert³², Guojie Zhang³², Byrappa Venkatesh³⁵, Robert W. Murphy³⁶, Klaus-Peter Koepfli³⁷, Beth Shapiro²⁴, Warren E. Johnson³⁷, Federica Di Palma³⁸, Tomas Marques-Bonet³⁹, Emma C. Teeling⁴⁰, Tandy Warnow⁴¹, Jennifer A. Marshall Graves⁴², Oliver A. Ryder⁴³, David Haussler²⁴, Stephen J. O'Brien⁴⁴, Kerstin Howe², Eugene W. Myers⁴⁵, Richard Durbin², Adam M. Phillippy¹, Erich D. Jarvis³ - Show less +118 more•Institutions (45)

National Institutes of Health¹, Wellcome Trust Sanger Institute², Rockefeller University³, University of California, Davis⁴, European Bioinformatics Institute⁵, Seoul National University⁶, Max Planck Society⁷, Durham University⁸, University of Massachusetts Amherst⁹, University of Adelaide¹⁰, University of Missouri¹¹, East Carolina University¹², University of Queensland¹³, Queen Mary University of London¹⁴, Wellington Management Company¹⁵, University of Arizona¹⁶, Natural History Museum¹⁷, Bangor University¹⁸, University of Konstanz¹⁹, Northeastern University²⁰, Naturalis²¹, University of Graz²², Florida Museum of Natural History²³, University of California, Santa Cruz²⁴, Pacific Biosciences²⁵, University of Maryland, College Park²⁶, Harbin Institute of Technology²⁷, University of Chicago²⁸, Oregon Health & Science University²⁹, Monash University Malaysia Campus³⁰, University of Milan³¹, University of Copenhagen³², Pennsylvania State University³³, University of Los Andes³⁴, Agency for Science, Technology and Research³⁵, Royal Ontario Museum³⁶, Smithsonian Conservation Biology Institute³⁷, University of East Anglia³⁸, Pompeu Fabra University³⁹, University College Dublin⁴⁰, University of Illinois at Urbana–Champaign⁴¹, La Trobe University⁴², University of California, San Diego⁴³, UPRRP College of Natural Sciences⁴⁴, Dresden University of Technology⁴⁵

23 May 2020-bioRxiv

TL;DR: The Vertebrate Genomes Project is embarked on, an effort to generate high-quality, complete reference genomes for all ~70,000 extant vertebrate species and help enable a new era of discovery across the life sciences.

...read moreread less

Abstract: High-quality and complete reference genome assemblies are fundamental for the application of genomics to biology, disease, and biodiversity conservation. However, such assemblies are only available for a few non-microbial species. To address this issue, the international Genome 10K (G10K) consortium has worked over a five-year period to evaluate and develop cost-effective methods for assembling the most accurate and complete reference genomes to date. Here we summarize these developments, introduce a set of quality standards, and present lessons learned from sequencing and assembling 16 species representing major vertebrate lineages (mammals, birds, reptiles, amphibians, teleost fishes and cartilaginous fishes). We confirm that long-read sequencing technologies are essential for maximizing genome quality and that unresolved complex repeats and haplotype heterozygosity are major sources of error in assemblies. Our new assemblies identify and correct substantial errors in some of the best historical reference genomes. Adopting these lessons, we have embarked on the Vertebrate Genomes Project (VGP), an effort to generate high-quality, complete reference genomes for all ~70,000 extant vertebrate species and help enable a new era of discovery across the life sciences.

...read moreread less

567 citations

Journal Article•DOI•

MitoHiFi: a python pipeline for mitochondrial genome assembly from PacBio high fidelity reads

[...]

Marcela Uliano-Silva, João Gabriel R. N. Ferreira, Ksenia Krasheninnikova, Giulio Formenti, Linelle Ann Lacson Abueg, James Torrance, Eugene W. Myers, Richard Durbin, Mark Blaxter, Shane A. McCarthy - Show less +6 more

10 Jan 2023-bioRxiv

166 citations

DOI•

marcelauliano/MitoHiFi: mitohifi_v2.0

[...]

Marcela Uliano-Silva, Joāo Gabriel Ferreira Nunes, Ksenia Krasheninnikova, Shane A. McCarthy

16 Aug 2021

119 citations

Journal Article•DOI•

The Earth BioGenome Project 2020: Starting the clock

[...]

Harris A. Lewin, Stephen Edward Richards, Erez Lieberman Aiden, Miguel L. Allende, John Archibald, Miklós Bálint, Katharine Barker, Bridget L. Baumgartner, Katherine Belov, Giorgio Bertorelle, Mark Blaxter, Jing Cai, Nicolette Caperello, Keith Thor Carlson, Juan Carlos Castilla‐Rubio, Shu-Miaw Chaw, Li Chen, Anna K. Childers, Jonathan A. Coddington, Dalia Amor Conde, Montserrat Gorchs Corominas, Keith A. Crandall, Andrew J. Crawford, F J DiPalma, Richard Rep Durbin, ThankGod Ebenezer, Scott V. Edwards, Olivier Fedrigo, Paul Flicek, Giulio Formenti, Richard A. Gibbs, M. Thomas P. Gilbert, Melissa M. Goldstein, J. M. Graves, Henry T. Greely, Igor V. Grigoriev, Kevin J. Hackett, Neil Hall, David Haussler, Kristofer M. Helgen, Carolyn J. Hogg, Sachiko Isobe, Kjetill S. Jakobsen, Axel Janke, Erich D. Jarvis, Warren Johnson, Steven J.M. Jones, Elinor K. Karlsson, Paul J. Kersey, Jin Hyoung Kim, W. John Kress, Shigehiro Kuraku, Mara K. N. Lawniczak, Jim Leebens-Mack, Xueyan Liu, Kerstin Lindblad-Toh, Xin Liu, Jose V. Lopez, Tomas Marques-Bonet, Sophie Mazard, Jonna A. K. Mazet, Camila J. Mazzoni, Eugene W. Myers, Rachel J. O’Neill, Sadye Paez, Hyun Ju Park, Gene E. Robinson, Cristina Roquet, Oliver A. Ryder, Jamal S. M. Sabir, H. Bradley Shaffer, Timothy M. Shank, Jacob S. Sherkow, Pamela S. Soltis, Bo-Ping Tang, Leho Tedersoo, Marcela Uliano-Silva, Kun Wang, Xiaofeng Wei, Regina Wetzer, Julia Wilson, Xun Xu, Huanming Yang, Anne D. Yoder, Guojie Zhang - Show less +81 more

18 Jan 2022-Proceedings of the National Academy of Sciences of the United States of America

TL;DR: This dissertation aims to provide a history of web exceptionalism from 1989 to 2002, a period chosen in order to explore its roots as well as specific cases up to and including the year in which descriptions of “Web 2.0” began to circulate.

...read moreread less

Abstract: Harris A. Lewin , Stephen Richards , Erez Lieberman Aiden, Miguel L. Allende , John M. Archibald, Mikl os B alint, Katharine B. Barker, Bridget Baumgartner, Katherine Belov, Giorgio Bertorelle, Mark L. Blaxter , Jing Cai, Nicolette D. Caperello, Keith Carlson, Juan Carlos Castilla-Rubio, Shu-Miaw Chaw, Lei Chen, Anna K. Childers, Jonathan A. Coddington , Dalia A. Conde , Montserrat Corominas , Keith A. Crandall , Andrew J. Crawford, Federica DiPalma, Richard Durbin , ThankGod E. Ebenezer, Scott V. Edwards , Olivier Fedrigo, Paul Flicek, Giulio Formenti, Richard A. Gibbs, M. Thomas P. Gilbert , Melissa M. Goldstein, Jennifer Marshall Graves , Henry T. Greely , Igor V. Grigoriev , Kevin J. Hackett, Neil Hall, David Haussler, Kristofer M. Helgen, Carolyn J. Hogg , Sachiko Isobe, Kjetill Sigurd Jakobsen , Axel Janke , Erich D. Jarvis, Warren E. Johnson , Steven J. M. Jones, Elinor K. Karlsson , Paul J. Kersey, Jin-Hyoung Kim, W. John Kress , Shigehiro Kuraku, Mara K. N. Lawniczak, James H. Leebens-Mack , Xueyan Li, Kerstin Lindblad-Toh , Xin Liu, Jose V. Lopez, Tomas Marques-Bonet , Sophie Mazard, Jonna A. K. Mazet , Camila J. Mazzoni, Eugene W. Myers , Rachel J. O’Neill, Sadye Paez, Hyun Park, Gene E. Robinson , Cristina Roquet , Oliver A. Ryder , Jamal S. M. Sabir , H. Bradley Shaffer , Timothy M. Shank, Jacob S. Sherkow , Pamela S. Soltis , Boping Tang , Leho Tedersoo, Marcela Uliano-Silva, Kun Wang, Xiaofeng Wei, Regina Wetzer, Julia L. Wilson, Xun Xu, Huanming Yang, Anne D. Yoder , and Guojie Zhang

...read moreread less

83 citations

1
2
3
4
…
5
6
7
8

Collapse

Cited by

PDF

Open Access

More filters

Integrative Genomics Viewer

[...]

James T. Robinson¹, Helga Thorvaldsdottir¹, Wendy Winckler¹, Mitchell Guttman¹, Eric S. Lander¹, Eric S. Lander², Gad Getz¹, Jill P. Mesirov¹ - Show less +4 more•Institutions (2)

Massachusetts Institute of Technology¹, Harvard University²

01 Jan 2011

TL;DR: The sheer volume and scope of data posed by this flood of data pose a significant challenge to the development of efficient and intuitive visualization tools able to scale to very large data sets and to flexibly integrate multiple data types, including clinical data.

...read moreread less

Abstract: Rapid improvements in sequencing and array-based platforms are resulting in a flood of diverse genome-wide data, including data from exome and whole-genome sequencing, epigenetic surveys, expression profiling of coding and noncoding RNAs, single nucleotide polymorphism (SNP) and copy number profiling, and functional assays. Analysis of these large, diverse data sets holds the promise of a more comprehensive understanding of the genome and its relation to human disease. Experienced and knowledgeable human review is an essential component of this process, complementing computational approaches. This calls for efficient and intuitive visualization tools able to scale to very large data sets and to flexibly integrate multiple data types, including clinical data. However, the sheer volume and scope of data pose a significant challenge to the development of such tools.

...read moreread less

2,187 citations

Journal Article•DOI•

The complete sequence of a human genome

[...]

Sergey Koren¹, Sergey Nurk¹, Mikko Rautiainen¹, B Ren², Weijun Zhu¹, Richard Lawless³, Саидмуродов Мамур Таирович¹ - Show less +3 more•Institutions (3)

National Human Genome Research Institute¹, Howard Hughes Medical Institute², University of California, Santa Cruz³

01 Apr 2022-Science

TL;DR: The T2T-CHM13-T2T Consortium presented a complete 3.055 billion-base pair sequence of a human genome, including gapless assemblies for all chromosomes except Y, corrected errors in the prior references, and introduced nearly 200 million base pairs of sequence containing gene predictions, 99 of which are predicted to be protein coding as discussed by the authors .

...read moreread less

Abstract: Since its initial release in 2000, the human reference genome has covered only the euchromatic fraction of the genome, leaving important heterochromatic regions unfinished. Addressing the remaining 8% of the genome, the Telomere-to-Telomere (T2T) Consortium presents a complete 3.055 billion-base pair sequence of a human genome, T2T-CHM13, that includes gapless assemblies for all chromosomes except Y, corrects errors in the prior references, and introduces nearly 200 million base pairs of sequence containing 1956 gene predictions, 99 of which are predicted to be protein coding. The completed regions include all centromeric satellite arrays, recent segmental duplications, and the short arms of all five acrocentric chromosomes, unlocking these complex regions of the genome to variational and functional studies.

...read moreread less

717 citations

Journal Article•DOI•

Towards complete and error-free genome assemblies of all vertebrate species

[...]

Arang Rhie¹, Shane A. McCarthy², Shane A. McCarthy³, Olivier Fedrigo⁴, Joana Damas⁵, Giulio Formenti⁴, Sergey Koren¹, Marcela Uliano-Silva⁶, William Chow², Arkarachai Fungtammasan, J. H. Kim⁷, Chul Hee Lee⁷, Byung June Ko⁷, Mark Chaisson⁸, Gregory Gedman⁴, Lindsey J. Cantin⁴, Françoise Thibaud-Nissen¹, Leanne Haggerty⁹, Iliana Bista², Iliana Bista³, Michelle Smith², Bettina Haase⁴, Jacquelyn Mountcastle⁴, Sylke Winkler¹⁰, Sylke Winkler¹¹, Sadye Paez⁴, Jason T. Howard, Sonja C. Vernes¹², Sonja C. Vernes¹³, Sonja C. Vernes¹¹, Tanya M. Lama¹⁴, Frank Grützner¹⁵, Wesley C. Warren¹⁶, Christopher N. Balakrishnan¹⁷, Dave W Burt¹⁸, Jimin George¹⁹, Matthew T. Biegler⁴, David Iorns, Andrew Digby, Daryl Eason, Bruce C. Robertson²⁰, Taylor Edwards²¹, Mark Wilkinson²², George F. Turner²³, Axel Meyer²⁴, Andreas F. Kautt²⁴, Andreas F. Kautt²⁵, Paolo Franchini²⁴, H. William Detrich²⁶, Hannes Svardal²⁷, Hannes Svardal²⁸, Maximilian Wagner²⁹, Gavin J. P. Naylor³⁰, Martin Pippel¹¹, Milan Malinsky³¹, Milan Malinsky², Mark Mooney, Maria Simbirsky, Brett T. Hannigan, Trevor Pesout³², Marlys L. Houck³³, Ann C Misuraca³³, Sarah B. Kingan³⁴, Richard Hall³⁴, Zev N. Kronenberg³⁴, Ivan Sović³⁴, Christopher Dunn³⁴, Zemin Ning², Alex Hastie, Joyce V. Lee, Siddarth Selvaraj, Richard E. Green³², Nicholas H. Putnam, Ivo Gut³⁵, Jay Ghurye³⁶, Erik Garrison³², Ying Sims², Joanna Collins², Sarah Pelan², James Torrance², Alan Tracey², Jonathan Wood², Robel E. Dagnew⁸, Dengfeng Guan³⁷, Dengfeng Guan³, Sarah E. London³⁸, David F. Clayton¹⁹, Claudio V. Mello³⁹, Samantha R. Friedrich³⁹, Peter V. Lovell³⁹, Ekaterina Osipova¹¹, Farooq O. Al-Ajli⁴⁰, Farooq O. Al-Ajli⁴¹, Simona Secomandi⁴², Heebal Kim⁷, Constantina Theofanopoulou⁴, Michael Hiller⁴³, Yang Zhou, Robert S. Harris⁴⁴, Kateryna D. Makova⁴⁴, Paul Medvedev⁴⁴, Jinna Hoffman¹, Patrick Masterson¹, Karen Clark¹, Fergal J. Martin⁹, Kevin L. Howe⁹, Paul Flicek⁹, Brian P. Walenz¹, Woori Kwak, Hiram Clawson³², Mark Diekhans³², Luis R Nassar³², Benedict Paten³², Robert H. S. Kraus²⁴, Robert H. S. Kraus¹¹, Andrew J. Crawford⁴⁵, M. Thomas P. Gilbert⁴⁶, M. Thomas P. Gilbert⁴⁷, Guojie Zhang, Byrappa Venkatesh⁴⁸, Robert W. Murphy⁴⁹, Klaus-Peter Koepfli⁵⁰, Beth Shapiro³², Beth Shapiro⁵¹, Warren E. Johnson⁵⁰, Warren E. Johnson⁵², Federica Di Palma⁵³, Tomas Marques-Bonet, Emma C. Teeling⁵⁴, Tandy Warnow⁵⁵, Jennifer A. Marshall Graves⁵⁶, Oliver A. Ryder⁵⁷, Oliver A. Ryder³³, David Haussler³², Stephen J. O'Brien⁵⁸, Jonas Korlach³⁴, Harris A. Lewin⁵, Kerstin Howe², Eugene W. Myers¹¹, Eugene W. Myers¹⁰, Richard Durbin², Richard Durbin³, Adam M. Phillippy¹, Erich D. Jarvis⁵¹, Erich D. Jarvis⁴ - Show less +141 more•Institutions (58)

National Institutes of Health¹, Wellcome Trust Sanger Institute², University of Cambridge³, Rockefeller University⁴, University of California, Davis⁵, Leibniz Association⁶, Seoul National University⁷, University of Southern California⁸, European Bioinformatics Institute⁹, Dresden University of Technology¹⁰, Max Planck Society¹¹, University of St Andrews¹², Radboud University Nijmegen¹³, University of Massachusetts Amherst¹⁴, University of Adelaide¹⁵, University of Missouri¹⁶, East Carolina University¹⁷, University of Queensland¹⁸, Clemson University¹⁹, University of Otago²⁰, University of Arizona²¹, Natural History Museum²², Bangor University²³, University of Konstanz²⁴, Harvard University²⁵, Northeastern University²⁶, National Museum of Natural History²⁷, University of Antwerp²⁸, University of Graz²⁹, University of Florida³⁰, University of Basel³¹, University of California, Santa Cruz³², Zoological Society of San Diego³³, Pacific Biosciences³⁴, Pompeu Fabra University³⁵, University of Maryland, College Park³⁶, Harbin Institute of Technology³⁷, University of Chicago³⁸, Oregon Health & Science University³⁹, Monash University Malaysia Campus⁴⁰, Qatar Airways⁴¹, University of Milan⁴², Goethe University Frankfurt⁴³, Pennsylvania State University⁴⁴, University of Los Andes⁴⁵, University of Copenhagen⁴⁶, Norwegian University of Science and Technology⁴⁷, Agency for Science, Technology and Research⁴⁸, Royal Ontario Museum⁴⁹, Smithsonian Institution⁵⁰, Howard Hughes Medical Institute⁵¹, Walter Reed Army Institute of Research⁵², University of East Anglia⁵³, University College Dublin⁵⁴, University of Illinois at Urbana–Champaign⁵⁵, La Trobe University⁵⁶, University of California, San Diego⁵⁷, Nova Southeastern University⁵⁸

28 Apr 2021-Nature

...read moreread less

647 citations

Journal Article•DOI•

Merqury: reference-free quality, completeness, and phasing assessment for genome assemblies

[...]

Arang Rhie¹, Brian P. Walenz¹, Sergey Koren¹, Adam M. Phillippy¹•Institutions (1)

National Institutes of Health¹

14 Sep 2020-Genome Biology

TL;DR: This work presents Merqury, a novel tool for reference-free assembly evaluation based on efficient k-mer set operations, and demonstrates on both human and plant genomes that it is a fast and robust method for assembly validation.

...read moreread less

Abstract: Recent long-read assemblies often exceed the quality and completeness of available reference genomes, making validation challenging. Here we present Merqury, a novel tool for reference-free assembly evaluation based on efficient k-mer set operations. By comparing k-mers in a de novo assembly to those found in unassembled high-accuracy reads, Merqury estimates base-level accuracy and completeness. For trios, Merqury can also evaluate haplotype-specific accuracy, completeness, phase block continuity, and switch errors. Multiple visualizations, such as k-mer spectrum plots, can be generated for evaluation. We demonstrate on both human and plant genomes that Merqury is a fast and robust method for assembly validation.

...read moreread less

477 citations

Journal Article•DOI•

Significantly improving the quality of genome assemblies through curation

[...]

Kerstin Howe¹, William Chow¹, Joanna Collins¹, Sarah Pelan¹, Damon-Lee Pointon¹, Ying Sims¹, James Torrance¹, Alan Tracey¹, Jonathan Wood¹ - Show less +5 more•Institutions (1)

Wellcome Trust Sanger Institute¹

09 Jan 2021-GigaScience

TL;DR: In this paper, a tried and tested approach for genome curation using gEVAL, the genome evaluation browser, is described and recommended for assembly curation in a GEVAL-independent context to facilitate the uptake of genome curations in the wider community.

...read moreread less

Abstract: Genome sequence assemblies provide the basis for our understanding of biology. Generating error-free assemblies is therefore the ultimate, but sadly still unachieved goal of a multitude of research projects. Despite the ever-advancing improvements in data generation, assembly algorithms and pipelines, no automated approach has so far reliably generated near error-free genome assemblies for eukaryotes. Whilst working towards improved datasets and fully automated pipelines, assembly evaluation and curation is actively used to bridge this shortcoming and significantly reduce the number of assembly errors. In addition to this increase in product value, the insights gained from assembly curation are fed back into the automated assembly strategy and contribute to notable improvements in genome assembly quality. We describe our tried and tested approach for assembly curation using gEVAL, the genome evaluation browser. We outline the procedures applied to genome curation using gEVAL and also our recommendations for assembly curation in a gEVAL-independent context to facilitate the uptake of genome curation in the wider community.

...read moreread less

373 citations

1
2
3
4
…
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200

Collapse