M
Michael DiCuccio
Researcher at National Institutes of Health
Publications - 31
Citations - 12307
Michael DiCuccio is an academic researcher from National Institutes of Health. The author has contributed to research in topics: Genome & RefSeq. The author has an hindex of 19, co-authored 28 publications receiving 8752 citations. Previous affiliations of Michael DiCuccio include Immersion Corporation.
Papers
More filters
Journal ArticleDOI
Reference sequence (RefSeq) database at NCBI: current status, taxonomic expansion, and functional annotation
Nuala A. O'Leary,Mathew W. Wright,J. Rodney Brister,Stacy Ciufo,Diana Haddad,Richard McVeigh,Bhanu Rajput,Barbara Robbertse,Brian Smith-White,Danso Ako-adjei,Alexander Astashyn,Azat Badretdin,Yiming Bao,Olga Blinkova,Vyacheslav Brover,Vyacheslav Chetvernin,Jinna Choi,Eric Cox,Olga Ermolaeva,Catherine M. Farrell,Tamara Goldfarb,Tripti Gupta,Daniel H. Haft,Eneida L. Hatcher,Wratko Hlavina,Vinita Joardar,Vamsi K. Kodali,Wenjun Li,Donna Maglott,Patrick Masterson,Kelly M. McGarvey,Michael R. Murphy,Kathleen O'Neill,Shashikant Pujar,Sanjida H. Rangwala,Daniel Rausch,Lillian D. Riddick,Conrad L. Schoch,Andrei Shkeda,Susan S. Storz,Hanzhen Sun,Françoise Thibaud-Nissen,Igor Tolstoy,Raymond E. Tully,Anjana R. Vatsan,Craig Wallin,David Webb,Wendy Wu,Melissa J. Landrum,Avi Kimchi,Tatiana Tatusova,Michael DiCuccio,Paul Kitts,Terence Murphy,Kim D. Pruitt +54 more
TL;DR: The approach to utilizing available RNA-Seq and other data types in the authors' manual curation process for vertebrate, plant, and other species is summarized, and a new direction for prokaryotic genomes and protein name management is described.
Journal ArticleDOI
NCBI prokaryotic genome annotation pipeline
Tatiana Tatusova,Michael DiCuccio,Azat Badretdin,Vyacheslav Chetvernin,Eric P. Nawrocki,Leonid Zaslavsky,Alexandre Lomsadze,Kim D. Pruitt,Mark Borodovsky,James Ostell +9 more
TL;DR: The new NCBI's Prokaryotic Genome Annotation Pipeline (PGAP) relies less on sequence similarity when confident comparative data are available, while it relies more on statistical predictions in the absence of external evidence.
Journal ArticleDOI
RefSeq: an update on mammalian reference sequences
Kim D. Pruitt,Garth Brown,Susan M. Hiatt,Françoise Thibaud-Nissen,Alexander Astashyn,Olga Ermolaeva,Catherine M. Farrell,Jennifer Hart,Melissa J. Landrum,Kelly M. McGarvey,Michael R. Murphy,Nuala A. O'Leary,Shashikant Pujar,Bhanu Rajput,Sanjida H. Rangwala,Lillian D. Riddick,Andrei Shkeda,Hanzhen Sun,Pamela Tamez,Raymond E. Tully,Craig Wallin,David Webb,Janet Weber,Wendy Wu,Michael DiCuccio,Paul Kitts,Donna Maglott,Terence Murphy,James Ostell +28 more
TL;DR: The National Center for Biotechnology Information (NCBI) Reference Sequence (RefSeq) database is a collection of annotated genomic, transcript and protein sequence records derived from data in public sequence archives and from computation, curation and collaboration.
Journal ArticleDOI
RefSeq: an update on prokaryotic genome annotation and curation.
Daniel H. Haft,Michael DiCuccio,Azat Badretdin,Vyacheslav Brover,Vyacheslav Chetvernin,Kathleen O'Neill,Wenjun Li,Farideh Chitsaz,Myra K. Derbyshire,Noreen R. Gonzales,Marc Gwadz,Fu Lu,Gabriele H. Marchler,James S. Song,Narmada Thanki,Roxanne A. Yamashita,Chanjuan Zheng,Françoise Thibaud-Nissen,Lewis Y. Geer,Aron Marchler-Bauer,Kim D. Pruitt +20 more
TL;DR: The Reference Sequence (RefSeq) project at the National Center for Biotechnology Information provides annotation for over 95 000 prokaryotic genomes that meet standards for sequence quality, completeness, and freedom from contamination.
Journal ArticleDOI
The consensus coding sequence (CCDS) project: Identifying a common protein-coding gene set for the human and mouse genomes
Kim D. Pruitt,Jennifer Harrow,Rachel A. Harte,Craig Wallin,Mark Diekhans,Donna Maglott,Steve Searle,Catherine M. Farrell,Jane E. Loveland,Barbara J. Ruef,Elizabeth M. Hart,Marie-Marthe Suner,Melissa J. Landrum,Bronwen Aken,Sarah Ayling,Robert Baertsch,Julio Fernandez-Banet,Joshua L. Cherry,Val Curwen,Michael DiCuccio,Manolis Kellis,Jennifer M. Lee,Michael F. Lin,Michael Schuster,Andrew Shkeda,Clara Amid,Garth Brown,Oksana Dukhanina,Adam Frankish,Jennifer Hart,Bonnie L. Maidak,Jonathan M. Mudge,Michael R. Murphy,Terence Murphy,Jeena Rajan,Bhanu Rajput,Lillian D. Riddick,Catherine E. Snow,Charles A. Steward,David Webb,Janet Weber,Laurens G. Wilming,Wenyu Wu,Ewan Birney,David Haussler,Tim Hubbard,James Ostell,Richard Durbin,David J. Lipman +48 more
TL;DR: The CCDS database centralizes the function of identifying well-supported, identically-annotated, protein-coding regions and indicates that the entries in the CCDS set are highly likely to represent real proteins, more so than annotations from contributing groups not included in CCDS.