Home
/
Authors
/
J. Rodney Brister

Author

J. Rodney Brister

Other affiliations: University of Florida

Bio: J. Rodney Brister is an academic researcher from National Institutes of Health. The author has contributed to research in topics: Virus classification & Genome. The author has an hindex of 23, co-authored 45 publications receiving 6275 citations. Previous affiliations of J. Rodney Brister include University of Florida.

Papers

PDF

Open Access

More filters

Journal Article•DOI•

Reference sequence (RefSeq) database at NCBI: current status, taxonomic expansion, and functional annotation

[...]

Nuala A. O'Leary¹, Mathew W. Wright¹, J. Rodney Brister¹, Stacy Ciufo¹, Diana Haddad¹, Richard McVeigh¹, Bhanu Rajput¹, Barbara Robbertse¹, Brian Smith-White¹, Danso Ako-adjei¹, Alexander Astashyn¹, Azat Badretdin¹, Yiming Bao¹, Olga Blinkova¹, Vyacheslav Brover¹, Vyacheslav Chetvernin¹, Jinna Choi¹, Eric Cox¹, Olga Ermolaeva¹, Catherine M. Farrell¹, Tamara Goldfarb¹, Tripti Gupta¹, Daniel H. Haft¹, Eneida L. Hatcher¹, Wratko Hlavina¹, Vinita Joardar¹, Vamsi K. Kodali¹, Wenjun Li¹, Donna Maglott¹, Patrick Masterson¹, Kelly M. McGarvey¹, Michael R. Murphy¹, Kathleen O'Neill¹, Shashikant Pujar¹, Sanjida H. Rangwala¹, Daniel Rausch¹, Lillian D. Riddick¹, Conrad L. Schoch¹, Andrei Shkeda¹, Susan S. Storz¹, Hanzhen Sun¹, Françoise Thibaud-Nissen¹, Igor Tolstoy¹, Raymond E. Tully¹, Anjana R. Vatsan¹, Craig Wallin¹, David Webb¹, Wendy Wu¹, Melissa J. Landrum¹, Avi Kimchi¹, Tatiana Tatusova¹, Michael DiCuccio¹, Paul Kitts¹, Terence Murphy¹, Kim D. Pruitt¹ - Show less +51 more•Institutions (1)

National Institutes of Health¹

04 Jan 2016-Nucleic Acids Research

TL;DR: The approach to utilizing available RNA-Seq and other data types in the authors' manual curation process for vertebrate, plant, and other species is summarized, and a new direction for prokaryotic genomes and protein name management is described.

...read moreread less

Abstract: The RefSeq project at the National Center for Biotechnology Information (NCBI) maintains and curates a publicly available database of annotated genomic, transcript, and protein sequence records (http://www.ncbi.nlm.nih.gov/refseq/). The RefSeq project leverages the data submitted to the International Nucleotide Sequence Database Collaboration (INSDC) against a combination of computation, manual curation, and collaboration to produce a standard set of stable, non-redundant reference sequences. The RefSeq project augments these reference sequences with current knowledge including publications, functional features and informative nomenclature. The database currently represents sequences from more than 55,000 organisms (>4800 viruses, >40,000 prokaryotes and >10,000 eukaryotes; RefSeq release 71), ranging from a single record to complete genomes. This paper summarizes the current status of the viral, prokaryotic, and eukaryotic branches of the RefSeq project, reports on improvements to data access and details efforts to further expand the taxonomic representation of the collection. We also highlight diverse functional curation initiatives that support multiple uses of RefSeq data including taxonomic validation, genome annotation, comparative genomics, and clinical testing. We summarize our approach to utilizing available RNA-Seq and other data types in our manual curation process for vertebrate, plant, and other species, and describe a new direction for prokaryotic genomes and protein name management.

...read moreread less

4,104 citations

Journal Article•DOI•

Uniformity of rotavirus strain nomenclature proposed by the Rotavirus Classification Working Group (RCWG).

[...]

Jelle Matthijnssens¹, Max Ciarlet², Sarah M. McDonald³, Houssam Attoui, Krisztián Bányai⁴, J. Rodney Brister³, Javier Buesa⁵, Mathew D. Esona⁶, Mary K. Estes⁷, Jon R. Gentsch⁶, Miren Iturriza-Gomara⁸, Reimar Johne⁹, Carl D. Kirkwood¹⁰, Vito Martella¹¹, Peter P. C. Mertens, Osamu Nakagomi¹², Viviana Parreño¹³, Mustafizur Rahman¹⁴, Franco Maria Ruggeri¹⁵, Linda J. Saif¹⁶, Norma Santos¹⁷, Andrej Steyer¹⁸, Koki Taniguchi¹⁹, John T. Patton³, Ulrich Desselberger²⁰, Marc Van Ranst¹ - Show less +22 more•Institutions (20)

Katholieke Universiteit Leuven¹, Novartis², National Institutes of Health³, Hungarian Academy of Sciences⁴, University of Valencia⁵, Centers for Disease Control and Prevention⁶, Baylor College of Medicine⁷, Health Protection Agency⁸, Federal Institute for Risk Assessment⁹, Royal Children's Hospital¹⁰, University of Bari¹¹, Nagasaki University¹², International Trademark Association¹³, International Centre for Diarrhoeal Disease Research, Bangladesh¹⁴, Istituto Superiore di Sanità¹⁵, Ohio State University¹⁶, Federal University of Rio de Janeiro¹⁷, University of Ljubljana¹⁸, Fujita Health University¹⁹, University of Cambridge²⁰

20 May 2011-Archives of Virology

TL;DR: With increasing numbers of complete RV genome sequences becoming available, a standardized RV strain nomenclature system is needed, and the RCWG proposes that individual RV strains are named as follows: RV group/species of origin/country of identification/common name/year of identification /G- and P-type.

...read moreread less

Abstract: In April 2008, a nucleotide-sequence-based, complete genome classification system was developed for group A rotaviruses (RVs). This system assigns a specific genotype to each of the 11 genome segments of a particular RV strain according to established nucleotide percent cutoff values. Using this approach, the genome of individual RV strains are given the complete descriptor of Gx-P[x]-Ix-Rx-Cx-Mx-Ax-Nx-Tx-Ex-Hx. The Rotavirus Classification Working Group (RCWG) was formed by scientists in the field to maintain, evaluate and develop the RV genotype classification system, in particular to aid in the designation of new genotypes. Since its conception, the group has ratified 51 new genotypes: as of April 2011, new genotypes for VP7 (G20-G27), VP4 (P[28]-P[35]), VP6 (I12-I16), VP1 (R5-R9), VP2 (C6-C9), VP3 (M7-M8), NSP1 (A15-A16), NSP2 (N6-N9), NSP3 (T8-T12), NSP4 (E12-E14) and NSP5/6 (H7-H11) have been defined for RV strains recovered from humans, cows, pigs, horses, mice, South American camelids (guanaco), chickens, turkeys, pheasants, bats and a sugar glider. With increasing numbers of complete RV genome sequences becoming available, a standardized RV strain nomenclature system is needed, and the RCWG proposes that individual RV strains are named as follows: RV group/species of origin/country of identification/common name/year of identification/G- and P-type. In collaboration with the National Center for Biotechnology Information (NCBI), the RCWG is also working on developing a RV-specific resource for the deposition of nucleotide sequences. This resource will provide useful information regarding RV strains, including, but not limited to, the individual gene genotypes and epidemiological and clinical information. Together, the proposed nomenclature system and the NCBI RV resource will offer highly useful tools for investigators to search for, retrieve, and analyze the ever-growing volume of RV genomic data.

...read moreread less

836 citations

Journal Article•DOI•

Consensus statement: Virus taxonomy in the age of metagenomics

[...]

Peter Simmonds¹, Michael J. Adams, Mária Benkő², Mya Breitbart³, J. Rodney Brister⁴, Eric B. Carstens⁵, Andrew J. Davison⁶, Eric Delwart⁷, Eric Delwart⁸, Alexander E. Gorbalenya⁹, Alexander E. Gorbalenya¹⁰, Balázs Harrach², Roger Hull¹¹, Andrew M. Q. King¹², Eugene V. Koonin⁴, Mart Krupovic¹³, Jens H. Kuhn⁴, Elliot J. Lefkowitz¹⁴, Max L. Nibert¹⁵, Richard J. Orton⁶, Marilyn J. Roossinck¹⁶, Sead Sabanadzovic¹⁷, Matthew B. Sullivan¹⁸, Curtis A. Suttle¹⁹, Curtis A. Suttle²⁰, Robert B. Tesh²¹, René van der Vlugt²², Arvind Varsani²³, F. Murilo Zerbini²⁴ - Show less +25 more•Institutions (24)

University of Oxford¹, Hungarian Academy of Sciences², University of South Florida³, National Institutes of Health⁴, Queen's University⁵, Medical Research Council⁶, University of California, San Francisco⁷, Systems Research Institute⁸, Moscow State University⁹, Leiden University Medical Center¹⁰, John Innes Centre¹¹, Institute for Animal Health¹², Pasteur Institute¹³, University of Alabama at Birmingham¹⁴, Harvard University¹⁵, Pennsylvania State University¹⁶, Mississippi State University¹⁷, Ohio State University¹⁸, Canadian Institute for Advanced Research¹⁹, University of British Columbia²⁰, University of Texas Medical Branch²¹, Wageningen University and Research Centre²², Arizona State University²³, Universidade Federal de Viçosa²⁴

01 Mar 2017-Nature Reviews Microbiology

TL;DR: The rationale for why metagenomic sequence data should, and how it can, be incorporated into the ICTV taxonomy is considered, and present proposals that have been endorsed by the Executive Committee of the ITV.

...read moreread less

Abstract: The number and diversity of viral sequences that are identified in metagenomic data far exceeds that of experimentally characterized virus isolates. In a recent workshop, a panel of experts discussed the proposal that, with appropriate quality control, viruses that are known only from metagenomic data can, and should be, incorporated into the official classification scheme of the International Committee on Taxonomy of Viruses (ICTV). Although a taxonomy that is based on metagenomic sequence data alone represents a substantial departure from the traditional reliance on phenotypic properties, the development of a robust framework for sequence-based virus taxonomy is indispensable for the comprehensive characterization of the global virome. In this Consensus Statement article, we consider the rationale for why metagenomic sequence data should, and how it can, be incorporated into the ICTV taxonomy, and present proposals that have been endorsed by the Executive Committee of the ICTV.

...read moreread less

525 citations

Journal Article•DOI•

NCBI Viral Genomes Resource

[...]

J. Rodney Brister¹, Danso Ako-adjei¹, Yiming Bao¹, Olga Blinkova¹•Institutions (1)

National Institutes of Health¹

28 Jan 2015-Nucleic Acids Research

TL;DR: The NCBI Viral Genomes Resource is a reference resource designed to bring order to this sequence shockwave and improve usability of viral sequence data.

...read moreread less

Abstract: Recent technological innovations have ignited an explosion in virus genome sequencing that promises to fundamentally alter our understanding of viral biology and profoundly impact public health policy. Yet, any potential benefits from the billowing cloud of next generation sequence data hinge upon well implemented reference resources that facilitate the identification of sequences, aid in the assembly of sequence reads and provide reference annotation sources. The NCBI Viral Genomes Resource is a reference resource designed to bring order to this sequence shockwave and improve usability of viral sequence data. The resource can be accessed at http://www.ncbi.nlm.nih.gov/genome/viruses/ and catalogs all publicly available virus genome sequences and curates reference genome sequences. As the number of genome sequences has grown, so too have the difficulties in annotating and maintaining reference sequences. The rapid expansion of the viral sequence universe has forced a recalibration of the data model to better provide extant sequence representation and enhanced reference sequence products to serve the needs of the various viral communities. This, in turn, has placed increased emphasis on leveraging the knowledge of individual scientific communities to identify important viral sequences and develop well annotated reference virus genome sets.

...read moreread less

467 citations

Journal Article•DOI•

Taxonomic assignment of uncultivated prokaryotic virus genomes is enabled by gene-sharing networks

[...]

Ho Bin Jang¹, Benjamin Bolduc¹, Olivier Zablocki¹, Jens H. Kuhn², Simon Roux³, Evelien M. Adriaenssens⁴, Evelien M. Adriaenssens⁵, J. Rodney Brister², Andrew M. Kropinski⁶, Andrew M. Kropinski⁷, Mart Krupovic⁸, Rob Lavigne⁹, Dann Turner¹⁰, Matthew B. Sullivan¹ - Show less +10 more•Institutions (10)

Ohio State University¹, National Institutes of Health², United States Department of Energy³, Norwich Research Park⁴, University of Liverpool⁵, University of Guelph⁶, Ontario Veterinary College⁷, Pasteur Institute⁸, Katholieke Universiteit Leuven⁹, University of the West of England¹⁰

06 May 2019-Nature Biotechnology

TL;DR: This work presents vConTACT v.2.0, a network-based application utilizing whole genome gene-sharing profiles for virus taxonomy that integrates distance-based hierarchical clustering and confidence scores for all taxonomic predictions, and applies it to analyze 15,280 Global Ocean Virome genome fragments.

...read moreread less

Abstract: Microbiomes from every environment contain a myriad of uncultivated archaeal and bacterial viruses, but studying these viruses is hampered by the lack of a universal, scalable taxonomic framework. We present vConTACT v.2.0, a network-based application utilizing whole genome gene-sharing profiles for virus taxonomy that integrates distance-based hierarchical clustering and confidence scores for all taxonomic predictions. We report near-identical (96%) replication of existing genus-level viral taxonomy assignments from the International Committee on Taxonomy of Viruses for National Center for Biotechnology Information virus RefSeq. Application of vConTACT v.2.0 to 1,364 previously unclassified viruses deposited in virus RefSeq as reference genomes produced automatic, high-confidence genus assignments for 820 of the 1,364. We applied vConTACT v.2.0 to analyze 15,280 Global Ocean Virome genome fragments and were able to provide taxonomic assignments for 31% of these data, which shows that our algorithm is scalable to very large metagenomic datasets. Our taxonomy tool can be automated and applied to metagenomes from any environment for virus classification.

...read moreread less

434 citations

1
2
3
4
…
5
6
7
8
9
10

Collapse

Cited by

PDF

Open Access

More filters

Journal Article•DOI•

KEGG: new perspectives on genomes, pathways, diseases and drugs

[...]

Minoru Kanehisa¹, Miho Furumichi¹, Mao Tanabe¹, Yoko Sato², Kanae Morishima¹ - Show less +1 more•Institutions (2)

Kyoto University¹, Fujitsu²

04 Jan 2017-Nucleic Acids Research

TL;DR: The content has been expanded and the quality improved irrespective of whether or not the KOs appear in the three molecular network databases, and the newly introduced addendum category of the GENES database is a collection of individual proteins whose functions are experimentally characterized and from which an increasing number of KOs are defined.

...read moreread less

Abstract: KEGG (http://www.kegg.jp/ or http://www.genome.jp/kegg/) is an encyclopedia of genes and genomes. Assigning functional meanings to genes and genomes both at the molecular and higher levels is the primary objective of the KEGG database project. Molecular-level functions are stored in the KO (KEGG Orthology) database, where each KO is defined as a functional ortholog of genes and proteins. Higher-level functions are represented by networks of molecular interactions, reactions and relations in the forms of KEGG pathway maps, BRITE hierarchies and KEGG modules. In the past the KO database was developed for the purpose of defining nodes of molecular networks, but now the content has been expanded and the quality improved irrespective of whether or not the KOs appear in the three molecular network databases. The newly introduced addendum category of the GENES database is a collection of individual proteins whose functions are experimentally characterized and from which an increasing number of KOs are defined. Furthermore, the DISEASE and DRUG databases have been improved by systematic analysis of drug labels for better integration of diseases and drugs with the KEGG molecular networks. KEGG is moving towards becoming a comprehensive knowledge base for both functional interpretation and practical application of genomic information.

...read moreread less

5,741 citations

Journal Article•DOI•

The species Severe acute respiratory syndrome-related coronavirus: classifying 2019-nCoV and naming it SARS-CoV-2

[...]

Alexander E. Gorbalenya¹, Susan C. Baker², Ralph S. Baric³, Raoul J. de Groot⁴, Christian Drosten, Anastasia A. Gulyaeva¹, Bart L. Haagmans⁵, Chris Lauber¹, Andrey M. Leontovich⁶, Benjamin W. Neuman⁷, Dmitry Penzar⁶, Stanley Perlman⁸, Leo L.M. Poon⁹, Dmitry V. Samborskiy⁶, Igor A. Sidorov¹, Isabel Sola¹⁰, John Ziebuhr¹¹ - Show less +13 more•Institutions (11)

Leiden University¹, Loyola University Chicago², University of North Carolina at Chapel Hill³, Utrecht University⁴, Erasmus University Rotterdam⁵, Moscow State University⁶, Texas A&M University–Texarkana⁷, University of Iowa⁸, University of Hong Kong⁹, Spanish National Research Council¹⁰, University of Giessen¹¹

02 Mar 2020-Nature microbiology

TL;DR: The independent zoonotic transmission of SARS-CoV and SARS -CoV-2 highlights the need for studying viruses at the species level to complement research focused on individual pathogenic viruses of immediate significance.

...read moreread less

Abstract: The present outbreak of a coronavirus-associated acute respiratory disease called coronavirus disease 19 (COVID-19) is the third documented spillover of an animal coronavirus to humans in only two decades that has resulted in a major epidemic. The Coronaviridae Study Group (CSG) of the International Committee on Taxonomy of Viruses, which is responsible for developing the classification of viruses and taxon nomenclature of the family Coronaviridae, has assessed the placement of the human pathogen, tentatively named 2019-nCoV, within the Coronaviridae. Based on phylogeny, taxonomy and established practice, the CSG recognizes this virus as forming a sister clade to the prototype human and bat severe acute respiratory syndrome coronaviruses (SARS-CoVs) of the species Severe acute respiratory syndrome-related coronavirus, and designates it as SARS-CoV-2. In order to facilitate communication, the CSG proposes to use the following naming convention for individual isolates: SARS-CoV-2/host/location/isolate/date. While the full spectrum of clinical manifestations associated with SARS-CoV-2 infections in humans remains to be determined, the independent zoonotic transmission of SARS-CoV and SARS-CoV-2 highlights the need for studying viruses at the species level to complement research focused on individual pathogenic viruses of immediate significance. This will improve our understanding of virus–host interactions in an ever-changing environment and enhance our preparedness for future outbreaks.

...read moreread less

5,527 citations

Journal Article•DOI•

Reference sequence (RefSeq) database at NCBI: current status, taxonomic expansion, and functional annotation

[...]

National Institutes of Health¹

04 Jan 2016-Nucleic Acids Research

...read moreread less

4,104 citations

Journal Article•DOI•

NCBI prokaryotic genome annotation pipeline

[...]

Tatiana Tatusova¹, Michael DiCuccio¹, Azat Badretdin¹, Vyacheslav Chetvernin¹, Eric P. Nawrocki¹, Leonid Zaslavsky¹, Alexandre Lomsadze², Kim D. Pruitt¹, Mark Borodovsky², James Ostell¹ - Show less +6 more•Institutions (2)

National Institutes of Health¹, Georgia Institute of Technology²

19 Aug 2016-Nucleic Acids Research

TL;DR: The new NCBI's Prokaryotic Genome Annotation Pipeline (PGAP) relies less on sequence similarity when confident comparative data are available, while it relies more on statistical predictions in the absence of external evidence.

...read moreread less

Abstract: Recent technological advances have opened unprecedented opportunities for large-scale sequencing and analysis of populations of pathogenic species in disease outbreaks, as well as for large-scale diversity studies aimed at expanding our knowledge across the whole domain of prokaryotes. To meet the challenge of timely interpretation of structure, function and meaning of this vast genetic information, a comprehensive approach to automatic genome annotation is critically needed. In collaboration with Georgia Tech, NCBI has developed a new approach to genome annotation that combines alignment based methods with methods of predicting protein-coding and RNA genes and other functional elements directly from sequence. A new gene finding tool, GeneMarkS+, uses the combined evidence of protein and RNA placement by homology as an initial map of annotation to generate and modify ab initio gene predictions across the whole genome. Thus, the new NCBI's Prokaryotic Genome Annotation Pipeline (PGAP) relies more on sequence similarity when confident comparative data are available, while it relies more on statistical predictions in the absence of external evidence. The pipeline provides a framework for generation and analysis of annotation on the full breadth of prokaryotic taxonomy. For additional information on PGAP see https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ and the NCBI Handbook, https://www.ncbi.nlm.nih.gov/books/NBK174280/.

...read moreread less

3,902 citations

Journal Article•DOI•

A Completely Reimplemented MPI Bioinformatics Toolkit with a New HHpred Server at its Core

[...]

Lukas Zimmermann¹, Andrew Stephens¹, Seung-Zin Nam¹, David Rau¹, Jonas M. Kübler¹, Marko Lozajic¹, Felix Gabler¹, Johannes Söding¹, Andrei N. Lupas¹, Vikram Alva¹ - Show less +6 more•Institutions (1)

Max Planck Society¹

01 Dec 2017-Journal of Molecular Biology

TL;DR: The new version of the MPI Bioinformatics Toolkit is introduced, focusing on improved features for the comprehensive analysis of proteins, as well as on promoting teaching.

...read moreread less

1,757 citations

1
2
3
4
…
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200

Collapse