Reference sequence (RefSeq) database at NCBI: current status, taxonomic expansion, and functional annotation
Citations
5,741 citations
Additional excerpts
...The viruses category is generated from the bi-monthly release of RefSeq, where individual viral genomes are not distinguished and the category is identified by the twoletter code ‘vg’....
[...]
...This category also contains GENOME and GENES, which are derived from RefSeq (3), Genbank (4) and NCBI Taxonomy (5) databases and given KEGG original annotations....
[...]
...The definition field of each GENES entry contains the data source name in parentheses, such as (RefSeq) and (GenBank), indicating that the Database name Subject KEGG Cancer Cancer research KEGG Pathogen Infectious diseases, pathogens and antimicrobial resistance KEGG Virus Virus research KEGG Plant Plant research KEGG Glycan Glycobiology research KEGG Annotation KO annotation of genes and proteins KEGG RModule Architecture of metabolic network definition was given by the original database as shown in Figure 1A. KEGG original annotation is given in the following KO subfield, K19188 in this case....
[...]
...The KEGG organisms category is the main part of GENES consisting of completely or almost completely sequenced genomes taken from RefSeq and GenBank databases....
[...]
1,757 citations
1,517 citations
1,394 citations
1,323 citations
Cites methods from "Reference sequence (RefSeq) databas..."
...Specifically, the miRNA sequences were downloaded from miRBase version 22 (3); target transcript sequences were retrieved from the NCBI RefSeq database (12) and further parsed with BioPerl to extract the 3’-UTR sequences....
[...]
References
22,269 citations
4,705 citations
4,229 citations
"Reference sequence (RefSeq) databas..." refers background in this paper
...Curators generally work from lists of genes with data conflicts identified by quality assurance (QA) tests, some of which were previously described (12)....
[...]
4,116 citations
4,050 citations