scispace - formally typeset
Search or ask a question
Journal ArticleDOI

CARD 2020: antibiotic resistome surveillance with the comprehensive antibiotic resistance database

TL;DR: A new Resistomes & Variants module provides analysis and statistical summary of in silico predicted resistance variants from 82 pathogens and over 100 000 genomes, able to summarize predicted resistance using the information included in CARD, identify trends in AMR mobility and determine previously undescribed and novel resistance variants.
Abstract: The Comprehensive Antibiotic Resistance Database (CARD; https://card.mcmaster.ca) is a curated resource providing reference DNA and protein sequences, detection models and bioinformatics tools on the molecular basis of bacterial antimicrobial resistance (AMR). CARD focuses on providing high-quality reference data and molecular sequences within a controlled vocabulary, the Antibiotic Resistance Ontology (ARO), designed by the CARD biocuration team to integrate with software development efforts for resistome analysis and prediction, such as CARD's Resistance Gene Identifier (RGI) software. Since 2017, CARD has expanded through extensive curation of reference sequences, revision of the ontological structure, curation of over 500 new AMR detection models, development of a new classification paradigm and expansion of analytical tools. Most notably, a new Resistomes & Variants module provides analysis and statistical summary of in silico predicted resistance variants from 82 pathogens and over 100 000 genomes. By adding these resistance variants to CARD, we are able to summarize predicted resistance using the information included in CARD, identify trends in AMR mobility and determine previously undescribed and novel resistance variants. Here, we describe updates and recent expansions to CARD and its biocuration process, including new resources for community biocuration of AMR molecular reference data.

Content maybe subject to copyright    Report

Citations
More filters
Journal ArticleDOI
TL;DR: A review of high throughput technologies applied to microbial diversity and antimicrobial resistance studies in aquacultures can be found in this paper, where a snapshot of these high-throughput technologies is presented.
Abstract: The shortage of wild fishery resources and the rising demand for human nutrition has driven a great expansion in aquaculture during the last decades in terms of production and economic value. As such, sustainable aquaculture production is one of the main priorities of the European Union’s 2030 agenda. However, the intensification of seafood farming has resulted in higher risks of disease outbreaks and in the increased use of antimicrobials to control them. The selective pressure exerted by these drugs provides the ideal conditions for the emergence of antimicrobial resistance hotspots in aquaculture facilities. Omics technology is an umbrella term for modern technologies such as genomics, metagenomics, transcriptomics, proteomics, culturomics, and metabolomics. These techniques have received increasing recognition because of their potential to unravel novel mechanisms in biological science. Metagenomics allows the study of genomes in microbial communities contained within a certain environment. The potential uses of metagenomics in aquaculture environments include the study of microbial diversity, microbial functions, and antibiotic resistance genes. A snapshot of these high throughput technologies applied to microbial diversity and antimicrobial resistance studies in aquacultures will be presented in this review.

5 citations

Journal ArticleDOI
TL;DR: In this paper, the authors investigated the genomic context of Type B dihydrofolate reductase (dfrb) genes to gain information on their current distribution in bacterial genomes and found that the intrinsically TMP-resistant dfrbs are a potential emerging threat to public health and justify closer surveillance of these genes.
Abstract: Type B dihydrofolate reductase (dfrb) genes were identified following the introduction of trimethoprim in the 1960s. Although they intrinsically confer resistance to trimethoprim (TMP) that is orders of magnitude greater than through other mechanisms, the distribution and prevalence of these short (237 bp) genes is unknown. Indeed, this knowledge has been hampered by systematic biases in search methodologies. Here, we investigate the genomic context of dfrbs to gain information on their current distribution in bacterial genomes. Upon searching publicly available databases, we identified 61 sequences containing dfrbs within an analyzable genomic context. The majority (70%) of those sequences also harbor virulence genes and 97% of the dfrbs are found near a mobile genetic element, representing a potential risk for antibiotic resistance genes. We further identified and confirmed the TMP-resistant phenotype of two new members of the family, dfrb10 and dfrb11. Dfrbs are found both in Betaproteobacteria and Gammaproteobacteria, a majority (59%) being in Pseudomonas aeruginosa. Previously labelled as strictly plasmid-borne, we found 69% of dfrbs in the chromosome of pathogenic bacteria. Our results demonstrate that the intrinsically TMP-resistant dfrbs are a potential emerging threat to public health and justify closer surveillance of these genes.

5 citations

Posted ContentDOI
08 Oct 2020-bioRxiv
TL;DR: Comparative sequence analysis strongly inferred that pAC1633-1/pAC1530 was derived from two separate plasmids in an IS1006-mediated recombination or transposition event, suggestive of conjugative transfer but mating experiments failed to demonstrate transmissibility under standard laboratory conditions.
Abstract: Carbapenem-resistant Acinetobacter spp. are considered priority drug-resistant human pathogenic bacteria. The genomes of two carbapenem-resistant Acinetobacter spp. clinical isolates obtained from the same tertiary hospital in Terengganu, Malaysia, namely A. baumannii AC1633 and A. nosocomialis AC1530, were sequenced. Both isolates were found to harbor the carbapenemase genes blaNDM-1 and blaOXA-58 in a large (ca. 170 kb) plasmid designated pAC1633-1 and pAC1530, respectively, that also encodes genes that confer resistance to aminoglycosides, sulfonamides, and macrolides. The two plasmids were almost identical except for the insertion of ISAba11 and an IS4 family element in pAC1633-1, and ISAba11 along with relBE toxin-antitoxin genes flanked by inversely orientated pdif (XerC/XerD) recombination sites in pAC1530. The blaNDM-1 gene was encoded in a Tn125 composite transposon structure flanked by ISAba125 whereas blaOXA-58 was flanked by ISAba11 and ISAba3 downstream and a partial ISAba3 element upstream within a pdif module. The presence of conjugative genes in plasmids pAC1633-1/pAC1530 and their discovery in two distinct species of Acinetobacter from the same hospital are suggestive of conjugative transfer but mating experiments failed to demonstrate transmissibility under standard laboratory conditions. Comparative sequence analysis strongly inferred that pAC1633-1/pAC1530 was derived from two separate plasmids in an IS1006-mediated recombination or transposition event. A. baumannii AC1633 also harbored three other plasmids designated pAC1633-2, pAC1633-3 and pAC1633-4. Both pAC1633-3 and pAC1633-4 are cryptic plasmids whereas pAC1633-2 is a 12,651 bp plasmid of the GR8/GR23 Rep3-superfamily group that encodes the tetA(39) tetracycline resistance determinant in a pdif module.

5 citations


Additional excerpts

  • ...ca/) (82), whereas ISFinder (https://isfinder....

    [...]

Journal ArticleDOI
TL;DR: This work describes how to curate the genes, point mutations and blast rules, and hidden Markov models used in NCBI’s AMRFinderPlus, along with the quality-control steps taken to ensure database quality, and discusses how the computed analyses generated by those tools can be accessed through a web interface.
Abstract: Antimicrobial resistance (AMR) is a significant public health threat. Low-cost whole-genome sequencing, which is often used in surveillance programmes, provides an opportunity to assess AMR gene content in these genomes using in silico approaches. A variety of bioinformatic tools have been developed to identify these genomic elements. Most of those tools rely on reference databases of nucleotide or protein sequences and collections of models and rules for analysis. While the tools are critical for the identification of AMR genes, the databases themselves also provide significant utility for researchers, for applications ranging from sequence analysis to information about AMR phenotypes. Additionally, these databases can be evaluated by domain experts and others to ensure their accuracy. Here we describe how we curate the genes, point mutations and blast rules, and hidden Markov models used in NCBI’s AMRFinderPlus, along with the quality-control steps we take to ensure database quality. We also describe the web interfaces that display the full structure of the database and their newly developed cross-browser relationships. Then, using the Reference Gene Catalog as an example, we detail how the databases, rules and models are made publicly available, as well as how to access the software. In addition, as part of the Pathogen Detection system, we have analysed over 1 million publicly available genomes using AMRFinderPlus and its databases. We discuss how the computed analyses generated by those tools can be accessed through a web interface. Finally, we conclude with NCBI’s plans to make these databases accessible over the long-term.

5 citations

Journal ArticleDOI
06 Feb 2020
TL;DR: Complete genome sequences of four Enterococcus faecium isolates are presented, obtained from two patients with apparent vancomycin-resistant Enteritis faecia bacteremia; these isolates also carried two mutations known to be associated with daptomycin resistance.
Abstract: Here, we present complete genome sequences of four Enterococcus faecium isolates, obtained from two patients with apparent vancomycin-resistant Enterococcus faecium bacteremia; these isolates also carried two mutations known to be associated with daptomycin resistance. Sequences were obtained using de novo and hybrid assembly of Oxford Nanopore and Illumina sequence data.

5 citations


Cites methods from "CARD 2020: antibiotic resistome sur..."

  • ...Volume 9 Issue 6 e01380-19 mra.asm.org 2 commonly associated with daptomycin resistance (14) were identified against the CARD database....

    [...]

  • ...The Comprehensive Antibiotic Resistance Database (CARD) (13) was used to complement the detection of antibiotic resistance genes in the assembled genomes....

    [...]

References
More filters
Journal ArticleDOI
TL;DR: A new approach to rapid sequence comparison, basic local alignment search tool (BLAST), directly approximates alignments that optimize a measure of local similarity, the maximal segment pair (MSP) score.

88,255 citations


"CARD 2020: antibiotic resistome sur..." refers background in this paper

  • ...The latter is described by CARD’s Model Ontology (MO, Supplementary Figure S1), which includes reference nucleotide and protein sequences, as well as additional search parameters including mutations conferring AMR (if applicable) and curated BLAST(P/N) (34,35) bit score cut-offs....

    [...]

Journal ArticleDOI
TL;DR: Burrows-Wheeler Alignment tool (BWA) is implemented, a new read alignment package that is based on backward search with Burrows–Wheeler Transform (BWT), to efficiently align short sequencing reads against a large reference sequence such as the human genome, allowing mismatches and gaps.
Abstract: Motivation: The enormous amount of short reads generated by the new DNA sequencing technologies call for the development of fast and accurate read alignment programs. A first generation of hash table-based methods has been developed, including MAQ, which is accurate, feature rich and fast enough to align short reads from a single individual. However, MAQ does not support gapped alignment for single-end reads, which makes it unsuitable for alignment of longer reads where indels may occur frequently. The speed of MAQ is also a concern when the alignment is scaled up to the resequencing of hundreds of individuals. Results: We implemented Burrows-Wheeler Alignment tool (BWA), a new read alignment package that is based on backward search with Burrows–Wheeler Transform (BWT), to efficiently align short sequencing reads against a large reference sequence such as the human genome, allowing mismatches and gaps. BWA supports both base space reads, e.g. from Illumina sequencing machines, and color space reads from AB SOLiD machines. Evaluations on both simulated and real data suggest that BWA is ~10–20× faster than MAQ, while achieving similar accuracy. In addition, BWA outputs alignment in the new standard SAM (Sequence Alignment/Map) format. Variant calling and other downstream analyses after the alignment can be achieved with the open source SAMtools software package. Availability: http://maq.sourceforge.net Contact: [email protected]

43,862 citations

Journal ArticleDOI
TL;DR: Bowtie 2 combines the strengths of the full-text minute index with the flexibility and speed of hardware-accelerated dynamic programming algorithms to achieve a combination of high speed, sensitivity and accuracy.
Abstract: As the rate of sequencing increases, greater throughput is demanded from read aligners. The full-text minute index is often used to make alignment very fast and memory-efficient, but the approach is ill-suited to finding longer, gapped alignments. Bowtie 2 combines the strengths of the full-text minute index with the flexibility and speed of hardware-accelerated dynamic programming algorithms to achieve a combination of high speed, sensitivity and accuracy.

37,898 citations


"CARD 2020: antibiotic resistome sur..." refers methods in this paper

  • ...Metagenomics analysis (i.e. RGI bwt) uses Bowtie2 (40) or BWA (41) mapping of sequencing reads to CARD’s PHM reference sequences only, while annotation of genomes or assembly contigs predicts resistome using four of CARD’s AMR detection models: PHM, PVM, RVM and POM (note: RGI currently only scans for nonsynonymous substitutions; not frameshifts, deletions or insertions)....

    [...]

  • ...RGI bwt) uses Bowtie2 (40) or BWA (41) mapping of sequencing reads to CARD’s PHM reference sequences only, while annotation of genomes or assembly contigs predicts resistome using four of CARD’s AMR detection models: PHM, PVM, RVM and POM (note: RGI currently only scans for nonsynonymous substitutions; not frameshifts, deletions or insertions)....

    [...]

Journal ArticleDOI
TL;DR: The goals of the PDB are described, the systems in place for data deposition and access, how to obtain further information and plans for the future development of the resource are described.
Abstract: The Protein Data Bank (PDB; http://www.rcsb.org/pdb/ ) is the single worldwide archive of structural data of biological macromolecules. This paper describes the goals of the PDB, the systems in place for data deposition and access, how to obtain further information, and near-term plans for the future development of the resource.

34,239 citations


"CARD 2020: antibiotic resistome sur..." refers methods in this paper

  • ...In 2017, we described the CARD*Shark text-mining algorithm (26) for computer-assisted literature triage, which we have expanded based on the new ARO Drug Class classification tags....

    [...]

Journal ArticleDOI
TL;DR: The new BLAST command-line applications, compared to the current BLAST tools, demonstrate substantial speed improvements for long queries as well as chromosome length database sequences.
Abstract: Sequence similarity searching is a very important bioinformatics task. While Basic Local Alignment Search Tool (BLAST) outperforms exact methods through its use of heuristics, the speed of the current BLAST software is suboptimal for very long queries or database sequences. There are also some shortcomings in the user-interface of the current command-line applications. We describe features and improvements of rewritten BLAST software and introduce new command-line applications. Long query sequences are broken into chunks for processing, in some cases leading to dramatically shorter run times. For long database sequences, it is possible to retrieve only the relevant parts of the sequence, reducing CPU time and memory usage for searches of short queries against databases of contigs or chromosomes. The program can now retrieve masking information for database sequences from the BLAST databases. A new modular software library can now access subject sequence data from arbitrary data sources. We introduce several new features, including strategy files that allow a user to save and reuse their favorite set of options. The strategy files can be uploaded to and downloaded from the NCBI BLAST web site. The new BLAST command-line applications, compared to the current BLAST tools, demonstrate substantial speed improvements for long queries as well as chromosome length database sequences. We have also improved the user interface of the command-line applications.

13,223 citations


"CARD 2020: antibiotic resistome sur..." refers background or methods in this paper

  • ...The website also includes a built-in BLAST instance for comparing sequences to CARD reference sequences and a web instance of RGI for resistome prediction with data visualization tools (https:// card.mcmaster.ca/analyze)....

    [...]

  • ...The RVM is functionally similar to the PVM, except it works for rRNA mutations and therefore uses a nucleotide reference sequence and a BLASTN bit score cut-off....

    [...]

  • ...Briefly, RGI algorithmically predicts AMR genes and mutations from submitted genomes using a combination of open reading frame prediction with Prodigal (38), sequence alignment with BLAST (35) or DIAMOND (39), and curated resistance mutations included with the AMR detection model....

    [...]

  • ...In the same time period, the CARD website hosted ∼45 000 BLAST analyses, ∼220 000 RGI analyses, ∼64 000 data file downloads, and ∼10,000 RGI software downloads....

    [...]

  • ...We had determined that the asymptotic nature of the BLAST expectation value (E) gave it very low discriminatory power between different -lactamase gene families (nearly 13 of CARD’s content), but that the linear nature of the BLAST bit score (S′) allowed this level of discrimination....

    [...]