scispace - formally typeset
Search or ask a question
Journal ArticleDOI

CARD 2020: antibiotic resistome surveillance with the comprehensive antibiotic resistance database

TL;DR: A new Resistomes & Variants module provides analysis and statistical summary of in silico predicted resistance variants from 82 pathogens and over 100 000 genomes, able to summarize predicted resistance using the information included in CARD, identify trends in AMR mobility and determine previously undescribed and novel resistance variants.
Abstract: The Comprehensive Antibiotic Resistance Database (CARD; https://card.mcmaster.ca) is a curated resource providing reference DNA and protein sequences, detection models and bioinformatics tools on the molecular basis of bacterial antimicrobial resistance (AMR). CARD focuses on providing high-quality reference data and molecular sequences within a controlled vocabulary, the Antibiotic Resistance Ontology (ARO), designed by the CARD biocuration team to integrate with software development efforts for resistome analysis and prediction, such as CARD's Resistance Gene Identifier (RGI) software. Since 2017, CARD has expanded through extensive curation of reference sequences, revision of the ontological structure, curation of over 500 new AMR detection models, development of a new classification paradigm and expansion of analytical tools. Most notably, a new Resistomes & Variants module provides analysis and statistical summary of in silico predicted resistance variants from 82 pathogens and over 100 000 genomes. By adding these resistance variants to CARD, we are able to summarize predicted resistance using the information included in CARD, identify trends in AMR mobility and determine previously undescribed and novel resistance variants. Here, we describe updates and recent expansions to CARD and its biocuration process, including new resources for community biocuration of AMR molecular reference data.

Content maybe subject to copyright    Report

Citations
More filters
Journal ArticleDOI
TL;DR: In this paper , the authors collected 29 surface water and 29 sediment samples in the Huangshui River on the Qinghai-Tibet Plateau during the wet and dry season, and 11 water samples from wastewater treatment plants and wetlands along the river.

5 citations

Journal ArticleDOI
TL;DR: In this article, the genomes, taxonomy, and phylogenetic relationships with respect to other Bacteroides fragilis genomes of two resistant B. fragils strains (CNM20180471 and CNM20200206) were examined.
Abstract: Background: Bacteroides fragilis shows high antimicrobial resistance (AMR) rates and possesses numerous AMR mechanisms. Its carbapenem-resistant strains (metallo-β-lactamase cfiA-positive) appear as an emergent, evolving clade. Methods: This work examines the genomes, taxonomy, and phylogenetic relationships with respect to other B. fragilis genomes of two B. fragilis strains (CNM20180471 and CNM20200206) resistant to meropenem+EDTA and other antimicrobial agents. Results: Both strains possessed cfiA genes (cfiA14b and the new cfiA28), along with other AMR mechanisms. The presence of other efflux-pump genes, mexAB/mexJK/mexXY-oprM, acrEF/mdtEF-tolC, and especially cusR, which reduces the entry of carbapenem via the repression of porin OprD, may be related to meropenem–EDTA resistance. None of the detected insertion sequences were located upstream of cfiA. The genomes of these and other B. fragilis strains that clustered together in phylogenetic analyses did not meet the condition of >95% average nucleotide/amino acid identity, or >70% in silico genome-to-genome hybridization similarity, to be deemed members of the same species, although <1% difference in the genomic G+C content was seen with respect to the reference genome B. fragilis NCTC 9343T. Conclusions: Carbapenem-resistant strains may be considered a distinct clonal entity, and their surveillance is recommended given the ease with which they appear to acquire AMR.

5 citations

Posted ContentDOI
05 Aug 2022-bioRxiv
TL;DR: PlaSquid, a dockerized tool developed in Nextflow that expands plasmid detection and improves replicon typing and mobility groups classification schemes, outperforming previously available methods in both precision and sensitivity is presented.
Abstract: Plasmids are mobile genetic elements important for bacterial adaptation. The study of plasmids from sequencing data is challenging because short reads produce fragmented assemblies, requiring of subsequent discrimination between chromosome and plasmid sequences. Although circularized assemblies are now possible using long-read data, there is still a need to differentiate plasmids from other circular elements. Here, we present plaSquid, a dockerized tool developed in Nextflow that expands plasmid detection and improves replicon typing and mobility groups classification schemes, outperforming previously available methods in both precision and sensitivity. When applied to ∼10.5 million metagenomic contigs, plaSquid revealed a 2.7-fold increase in plasmid phylogenetic diversity. Also, we used plaSquid to uncover a significant role of plasmids in the widespread distribution of clinically-relevant antimicrobial resistance genes in the built environment, from cities to spacecraft. Together, we present an improved approach to study plasmid biology from fragmented or circularized genomic and metagenomic assemblies.

5 citations

Journal ArticleDOI
TL;DR: The resistome-associated mobilome in 345 publicly available Pasteurellaceae genomes is investigated, finding that MGEs are comparable and dispersed across species and that they also co-occur in genomes, contributing to the family’s ecology via gene transfer.
Abstract: Mobile genetic elements (MGEs) and antimicrobial resistance (AMR) drive important ecological relationships in microbial communities and pathogen-host interaction. In this study, we investigated the resistome-associated mobilome in 345 publicly available Pasteurellaceae genomes, a large family of Gram-negative bacteria including major human and animal pathogens. We generated a comprehensive dataset of the mobilome integrated into genomes, including 10,820 insertion sequences, 2,939 prophages, and 43 integrative and conjugative elements. Also, we assessed plasmid sequences of Pasteurellaceae. Our findings greatly expand the diversity of MGEs for the family, including a description of novel elements. We discovered that MGEs are comparable and dispersed across species and that they also co-occur in genomes, contributing to the family’s ecology via gene transfer. In addition, we investigated the impact of these elements in the dissemination and shaping of AMR genes. A total of 55 different AMR genes were mapped to 721 locations in the dataset. MGEs are linked with 77.6% of AMR genes discovered, indicating their important involvement in the acquisition and transmission of such genes. This study provides an uncharted view of the Pasteurellaceae by demonstrating the global distribution of resistance genes linked with MGEs.

5 citations

Journal ArticleDOI
TL;DR: The pangenome of C. baratii was open, indicating it comprises genetically diverse organisms, and the genomic analysis indicated significant horizontal gene transfer (HGT) events as defined by the presence of prophage genomes.
Abstract: Clostridium baratii strains are rare opportunistic pathogens associated with botulism intoxication. They have been isolated from foods, soil and be carried asymptomatically or cause botulism outbreaks. Is not taxonomically related to Clostridium botulinum, but some strains are equipped with BoNT/F7 cluster. Despite their relationship with diseases, our knowledge regarding the genomic features and phylogenetic characteristics is limited. We analyzed the pangenome of C. baratii to understand the diversity and genomic features of this species. We compared existing genomes in public databases, metagenomes, and one newly sequenced strain isolated from an asymptomatic subject. The pangenome was open, indicating it comprises genetically diverse organisms. The core genome contained 28.49% of the total genes of the pangenome. Profiling virulence factors confirmed the presence of phospholipase C in some strains, a toxin capable of disrupting eukaryotic cell membranes. Furthermore, the genomic analysis indicated significant horizontal gene transfer (HGT) events as defined by the presence of prophage genomes. Seven strains were equipped with BoNT/F7 cluster. The active site was conserved in all strains, identifying a missing 7-aa region upstream of the active site in C. baratii genomes. This analysis could be important to advance our knowledge regarding opportunistic clostridia and better understand their contribution to disease.

5 citations

References
More filters
Journal ArticleDOI
TL;DR: A new approach to rapid sequence comparison, basic local alignment search tool (BLAST), directly approximates alignments that optimize a measure of local similarity, the maximal segment pair (MSP) score.

88,255 citations


"CARD 2020: antibiotic resistome sur..." refers background in this paper

  • ...The latter is described by CARD’s Model Ontology (MO, Supplementary Figure S1), which includes reference nucleotide and protein sequences, as well as additional search parameters including mutations conferring AMR (if applicable) and curated BLAST(P/N) (34,35) bit score cut-offs....

    [...]

Journal ArticleDOI
TL;DR: Burrows-Wheeler Alignment tool (BWA) is implemented, a new read alignment package that is based on backward search with Burrows–Wheeler Transform (BWT), to efficiently align short sequencing reads against a large reference sequence such as the human genome, allowing mismatches and gaps.
Abstract: Motivation: The enormous amount of short reads generated by the new DNA sequencing technologies call for the development of fast and accurate read alignment programs. A first generation of hash table-based methods has been developed, including MAQ, which is accurate, feature rich and fast enough to align short reads from a single individual. However, MAQ does not support gapped alignment for single-end reads, which makes it unsuitable for alignment of longer reads where indels may occur frequently. The speed of MAQ is also a concern when the alignment is scaled up to the resequencing of hundreds of individuals. Results: We implemented Burrows-Wheeler Alignment tool (BWA), a new read alignment package that is based on backward search with Burrows–Wheeler Transform (BWT), to efficiently align short sequencing reads against a large reference sequence such as the human genome, allowing mismatches and gaps. BWA supports both base space reads, e.g. from Illumina sequencing machines, and color space reads from AB SOLiD machines. Evaluations on both simulated and real data suggest that BWA is ~10–20× faster than MAQ, while achieving similar accuracy. In addition, BWA outputs alignment in the new standard SAM (Sequence Alignment/Map) format. Variant calling and other downstream analyses after the alignment can be achieved with the open source SAMtools software package. Availability: http://maq.sourceforge.net Contact: [email protected]

43,862 citations

Journal ArticleDOI
TL;DR: Bowtie 2 combines the strengths of the full-text minute index with the flexibility and speed of hardware-accelerated dynamic programming algorithms to achieve a combination of high speed, sensitivity and accuracy.
Abstract: As the rate of sequencing increases, greater throughput is demanded from read aligners. The full-text minute index is often used to make alignment very fast and memory-efficient, but the approach is ill-suited to finding longer, gapped alignments. Bowtie 2 combines the strengths of the full-text minute index with the flexibility and speed of hardware-accelerated dynamic programming algorithms to achieve a combination of high speed, sensitivity and accuracy.

37,898 citations


"CARD 2020: antibiotic resistome sur..." refers methods in this paper

  • ...Metagenomics analysis (i.e. RGI bwt) uses Bowtie2 (40) or BWA (41) mapping of sequencing reads to CARD’s PHM reference sequences only, while annotation of genomes or assembly contigs predicts resistome using four of CARD’s AMR detection models: PHM, PVM, RVM and POM (note: RGI currently only scans for nonsynonymous substitutions; not frameshifts, deletions or insertions)....

    [...]

  • ...RGI bwt) uses Bowtie2 (40) or BWA (41) mapping of sequencing reads to CARD’s PHM reference sequences only, while annotation of genomes or assembly contigs predicts resistome using four of CARD’s AMR detection models: PHM, PVM, RVM and POM (note: RGI currently only scans for nonsynonymous substitutions; not frameshifts, deletions or insertions)....

    [...]

Journal ArticleDOI
TL;DR: The goals of the PDB are described, the systems in place for data deposition and access, how to obtain further information and plans for the future development of the resource are described.
Abstract: The Protein Data Bank (PDB; http://www.rcsb.org/pdb/ ) is the single worldwide archive of structural data of biological macromolecules. This paper describes the goals of the PDB, the systems in place for data deposition and access, how to obtain further information, and near-term plans for the future development of the resource.

34,239 citations


"CARD 2020: antibiotic resistome sur..." refers methods in this paper

  • ...In 2017, we described the CARD*Shark text-mining algorithm (26) for computer-assisted literature triage, which we have expanded based on the new ARO Drug Class classification tags....

    [...]

Journal ArticleDOI
TL;DR: The new BLAST command-line applications, compared to the current BLAST tools, demonstrate substantial speed improvements for long queries as well as chromosome length database sequences.
Abstract: Sequence similarity searching is a very important bioinformatics task. While Basic Local Alignment Search Tool (BLAST) outperforms exact methods through its use of heuristics, the speed of the current BLAST software is suboptimal for very long queries or database sequences. There are also some shortcomings in the user-interface of the current command-line applications. We describe features and improvements of rewritten BLAST software and introduce new command-line applications. Long query sequences are broken into chunks for processing, in some cases leading to dramatically shorter run times. For long database sequences, it is possible to retrieve only the relevant parts of the sequence, reducing CPU time and memory usage for searches of short queries against databases of contigs or chromosomes. The program can now retrieve masking information for database sequences from the BLAST databases. A new modular software library can now access subject sequence data from arbitrary data sources. We introduce several new features, including strategy files that allow a user to save and reuse their favorite set of options. The strategy files can be uploaded to and downloaded from the NCBI BLAST web site. The new BLAST command-line applications, compared to the current BLAST tools, demonstrate substantial speed improvements for long queries as well as chromosome length database sequences. We have also improved the user interface of the command-line applications.

13,223 citations


"CARD 2020: antibiotic resistome sur..." refers background or methods in this paper

  • ...The website also includes a built-in BLAST instance for comparing sequences to CARD reference sequences and a web instance of RGI for resistome prediction with data visualization tools (https:// card.mcmaster.ca/analyze)....

    [...]

  • ...The RVM is functionally similar to the PVM, except it works for rRNA mutations and therefore uses a nucleotide reference sequence and a BLASTN bit score cut-off....

    [...]

  • ...Briefly, RGI algorithmically predicts AMR genes and mutations from submitted genomes using a combination of open reading frame prediction with Prodigal (38), sequence alignment with BLAST (35) or DIAMOND (39), and curated resistance mutations included with the AMR detection model....

    [...]

  • ...In the same time period, the CARD website hosted ∼45 000 BLAST analyses, ∼220 000 RGI analyses, ∼64 000 data file downloads, and ∼10,000 RGI software downloads....

    [...]

  • ...We had determined that the asymptotic nature of the BLAST expectation value (E) gave it very low discriminatory power between different -lactamase gene families (nearly 13 of CARD’s content), but that the linear nature of the BLAST bit score (S′) allowed this level of discrimination....

    [...]