scispace - formally typeset
Search or ask a question
Journal ArticleDOI

CARD 2020: antibiotic resistome surveillance with the comprehensive antibiotic resistance database

TL;DR: A new Resistomes & Variants module provides analysis and statistical summary of in silico predicted resistance variants from 82 pathogens and over 100 000 genomes, able to summarize predicted resistance using the information included in CARD, identify trends in AMR mobility and determine previously undescribed and novel resistance variants.
Abstract: The Comprehensive Antibiotic Resistance Database (CARD; https://card.mcmaster.ca) is a curated resource providing reference DNA and protein sequences, detection models and bioinformatics tools on the molecular basis of bacterial antimicrobial resistance (AMR). CARD focuses on providing high-quality reference data and molecular sequences within a controlled vocabulary, the Antibiotic Resistance Ontology (ARO), designed by the CARD biocuration team to integrate with software development efforts for resistome analysis and prediction, such as CARD's Resistance Gene Identifier (RGI) software. Since 2017, CARD has expanded through extensive curation of reference sequences, revision of the ontological structure, curation of over 500 new AMR detection models, development of a new classification paradigm and expansion of analytical tools. Most notably, a new Resistomes & Variants module provides analysis and statistical summary of in silico predicted resistance variants from 82 pathogens and over 100 000 genomes. By adding these resistance variants to CARD, we are able to summarize predicted resistance using the information included in CARD, identify trends in AMR mobility and determine previously undescribed and novel resistance variants. Here, we describe updates and recent expansions to CARD and its biocuration process, including new resources for community biocuration of AMR molecular reference data.

Content maybe subject to copyright    Report

Citations
More filters
Journal ArticleDOI
TL;DR: The results suggest that sufficient variation exists in the core non-AMR genes of a species for predicting AMR phenotypes, and small core gene models built from small sets of randomly-selected core genes after removing the AMR genes are suggested.
Abstract: A growing number of studies are using machine learning models to accurately predict antimicrobial resistance (AMR) phenotypes from bacterial sequence data. Although these studies are showing promise, the models are typically trained using features derived from comprehensive sets of AMR genes or whole genome sequences and may not be suitable for use when genomes are incomplete. In this study, we explore the possibility of predicting AMR phenotypes using incomplete genome sequence data. Models were built from small sets of randomly-selected core genes after removing the AMR genes. For Klebsiella pneumoniae, Mycobacterium tuberculosis, Salmonella enterica, and Staphylococcus aureus, we report that it is possible to classify susceptible and resistant phenotypes with average F1 scores ranging from 0.80-0.89 with as few as 100 conserved non-AMR genes, with very major error rates ranging from 0.11-0.23 and major error rates ranging from 0.10-0.20. Models built from core genes have predictive power in cases where the primary AMR mechanisms result from SNPs or horizontal gene transfer. By randomly sampling non-overlapping sets of core genes, we show that F1 scores and error rates are stable and have little variance between replicates. Although these small core gene models have lower accuracies and higher error rates than models built from the corresponding assembled genomes, the results suggest that sufficient variation exists in the core non-AMR genes of a species for predicting AMR phenotypes.

21 citations

Journal ArticleDOI
TL;DR: The soil-root continuum is identified as an interconnected sink through which certain ARGs and pathogens can flow from soil into the plant and be facilitated by a multiplicity of mobile genetic elements.

21 citations

Journal ArticleDOI
TL;DR: As in many other countries, poultry-associated strains were likely a major source of human infection but almost half of local disease cases were attributable to genotypes that are rare outside of Peru, suggesting an important role for host factors in the cryptic epidemiology of campylobacteriosis in LMICs.
Abstract: Campylobacter is the leading bacterial cause of gastroenteritis worldwide and its incidence is especially high in low- and middle-income countries (LMIC). Disease epidemiology in LMICs is different compared to high income countries like the USA or in Europe. Children in LMICs commonly have repeated and chronic infections even in the absence of symptoms, which can lead to deficits in early childhood development. In this study, we sequenced and characterized C. jejuni (n = 62) from a longitudinal cohort study of children under the age of 5 with and without diarrheal symptoms, and contextualized them within a global C. jejuni genome collection. Epidemiological differences in disease presentation were reflected in the genomes, specifically by the absence of some of the most common global disease-causing lineages. As in many other countries, poultry-associated strains were likely a major source of human infection but almost half of local disease cases (15 of 31) were attributable to genotypes that are rare outside of Peru. Asymptomatic infection was not limited to a single (or few) human adapted lineages but resulted from phylogenetically divergent strains suggesting an important role for host factors in the cryptic epidemiology of campylobacteriosis in LMICs.

21 citations

Journal ArticleDOI
TL;DR: Evidence for phage modulation of the gut microbiome is discussed, postulating that phages are pivotal contributors to the gut ecosystem dynamics.
Abstract: Phages, short for bacteriophages, are viruses that specifically infect bacteria and are the most abundant biological entities on earth found in every explored environment, from the deep sea to the Sahara Desert. Phages are abundant within the human biome and are gaining increasing recognition as potential modulators of the gut ecosystem. For example, they have been connected to gastrointestinal diseases and the treatment efficacy of Fecal Microbiota Transplant. The ability of phages to modulate the human gut microbiome has been attributed to the predation of bacteria or the promotion of bacterial survival by the transfer of genes that enhance bacterial fitness upon infection. In addition, phages have been shown to interact with the human immune system with variable outcomes. Despite the increasing evidence supporting the importance of phages in the gut ecosystem, the extent of their influence on the shape of the gut ecosystem is yet to be fully understood. Here, we discuss evidence for phage modulation of the gut microbiome, postulating that phages are pivotal contributors to the gut ecosystem dynamics. We therefore propose novel research questions to further elucidate the role(s) that they have within the human ecosystem and its impact on our health and well-being.

20 citations

Journal ArticleDOI
TL;DR: In this article, a comprehensive insight into the prevalence of mcr-carrying CRE from patients in Thailand was provided, highlighting the importance of strengthening official active surveillance efforts to detect, control, and prevent mCR-harboring CRE and the need for rational drug use in all sectors.
Abstract: Mobile colistin-resistant genes (mcr) have become an increasing public health concern. Since the first report of mcr-1 in Thailand in 2016, perspective surveillance was conducted to explore the genomic characteristics of clinical carbapenem-resistant Enterobacterales (CRE) isolates harboring mcr in 2016-2019. Thirteen (0.28%) out of 4,516 CRE isolates were found to carry mcr genes, including 69.2% (9/13) of E. coli and 30.8% (4/13) of K. pneumoniae isolates. Individual mcr-1.1 was detected in eight E. coli (61.5%) isolates, whereas the co-occurrence of mcr-1.1 and mcr-3.5 was seen in only one E. coli isolate (7.7%). No CRE were detected carrying mcr-2, mcr-4, or mcr-5 through to mcr-9. Analysis of plasmid replicon types carrying mcr revealed that IncX4 was the most common (61.5%; 8/13), followed by IncI2 (15.4%; 2/13). The minimum inhibitory concentration values for colistin were in the range of 4-16 μg/ml for all CRE isolates harboring mcr, suggesting they have 100% colistin resistance. Clermont phylotyping of nine mcr-harboring carbapenem-resistant E. coli isolates demonstrated phylogroup C was predominant in ST410. In contrast, ST336 belonged to CC17, and the KL type 25 was predominant in carbapenem-resistant K. pneumoniae isolates. This report provides a comprehensive insight into the prevalence of mcr-carrying CRE from patients in Thailand. The information highlights the importance of strengthening official active surveillance efforts to detect, control, and prevent mcr-harboring CRE and the need for rational drug use in all sectors.

20 citations

References
More filters
Journal ArticleDOI
TL;DR: A new approach to rapid sequence comparison, basic local alignment search tool (BLAST), directly approximates alignments that optimize a measure of local similarity, the maximal segment pair (MSP) score.

88,255 citations


"CARD 2020: antibiotic resistome sur..." refers background in this paper

  • ...The latter is described by CARD’s Model Ontology (MO, Supplementary Figure S1), which includes reference nucleotide and protein sequences, as well as additional search parameters including mutations conferring AMR (if applicable) and curated BLAST(P/N) (34,35) bit score cut-offs....

    [...]

Journal ArticleDOI
TL;DR: Burrows-Wheeler Alignment tool (BWA) is implemented, a new read alignment package that is based on backward search with Burrows–Wheeler Transform (BWT), to efficiently align short sequencing reads against a large reference sequence such as the human genome, allowing mismatches and gaps.
Abstract: Motivation: The enormous amount of short reads generated by the new DNA sequencing technologies call for the development of fast and accurate read alignment programs. A first generation of hash table-based methods has been developed, including MAQ, which is accurate, feature rich and fast enough to align short reads from a single individual. However, MAQ does not support gapped alignment for single-end reads, which makes it unsuitable for alignment of longer reads where indels may occur frequently. The speed of MAQ is also a concern when the alignment is scaled up to the resequencing of hundreds of individuals. Results: We implemented Burrows-Wheeler Alignment tool (BWA), a new read alignment package that is based on backward search with Burrows–Wheeler Transform (BWT), to efficiently align short sequencing reads against a large reference sequence such as the human genome, allowing mismatches and gaps. BWA supports both base space reads, e.g. from Illumina sequencing machines, and color space reads from AB SOLiD machines. Evaluations on both simulated and real data suggest that BWA is ~10–20× faster than MAQ, while achieving similar accuracy. In addition, BWA outputs alignment in the new standard SAM (Sequence Alignment/Map) format. Variant calling and other downstream analyses after the alignment can be achieved with the open source SAMtools software package. Availability: http://maq.sourceforge.net Contact: [email protected]

43,862 citations

Journal ArticleDOI
TL;DR: Bowtie 2 combines the strengths of the full-text minute index with the flexibility and speed of hardware-accelerated dynamic programming algorithms to achieve a combination of high speed, sensitivity and accuracy.
Abstract: As the rate of sequencing increases, greater throughput is demanded from read aligners. The full-text minute index is often used to make alignment very fast and memory-efficient, but the approach is ill-suited to finding longer, gapped alignments. Bowtie 2 combines the strengths of the full-text minute index with the flexibility and speed of hardware-accelerated dynamic programming algorithms to achieve a combination of high speed, sensitivity and accuracy.

37,898 citations


"CARD 2020: antibiotic resistome sur..." refers methods in this paper

  • ...Metagenomics analysis (i.e. RGI bwt) uses Bowtie2 (40) or BWA (41) mapping of sequencing reads to CARD’s PHM reference sequences only, while annotation of genomes or assembly contigs predicts resistome using four of CARD’s AMR detection models: PHM, PVM, RVM and POM (note: RGI currently only scans for nonsynonymous substitutions; not frameshifts, deletions or insertions)....

    [...]

  • ...RGI bwt) uses Bowtie2 (40) or BWA (41) mapping of sequencing reads to CARD’s PHM reference sequences only, while annotation of genomes or assembly contigs predicts resistome using four of CARD’s AMR detection models: PHM, PVM, RVM and POM (note: RGI currently only scans for nonsynonymous substitutions; not frameshifts, deletions or insertions)....

    [...]

Journal ArticleDOI
TL;DR: The goals of the PDB are described, the systems in place for data deposition and access, how to obtain further information and plans for the future development of the resource are described.
Abstract: The Protein Data Bank (PDB; http://www.rcsb.org/pdb/ ) is the single worldwide archive of structural data of biological macromolecules. This paper describes the goals of the PDB, the systems in place for data deposition and access, how to obtain further information, and near-term plans for the future development of the resource.

34,239 citations


"CARD 2020: antibiotic resistome sur..." refers methods in this paper

  • ...In 2017, we described the CARD*Shark text-mining algorithm (26) for computer-assisted literature triage, which we have expanded based on the new ARO Drug Class classification tags....

    [...]

Journal ArticleDOI
TL;DR: The new BLAST command-line applications, compared to the current BLAST tools, demonstrate substantial speed improvements for long queries as well as chromosome length database sequences.
Abstract: Sequence similarity searching is a very important bioinformatics task. While Basic Local Alignment Search Tool (BLAST) outperforms exact methods through its use of heuristics, the speed of the current BLAST software is suboptimal for very long queries or database sequences. There are also some shortcomings in the user-interface of the current command-line applications. We describe features and improvements of rewritten BLAST software and introduce new command-line applications. Long query sequences are broken into chunks for processing, in some cases leading to dramatically shorter run times. For long database sequences, it is possible to retrieve only the relevant parts of the sequence, reducing CPU time and memory usage for searches of short queries against databases of contigs or chromosomes. The program can now retrieve masking information for database sequences from the BLAST databases. A new modular software library can now access subject sequence data from arbitrary data sources. We introduce several new features, including strategy files that allow a user to save and reuse their favorite set of options. The strategy files can be uploaded to and downloaded from the NCBI BLAST web site. The new BLAST command-line applications, compared to the current BLAST tools, demonstrate substantial speed improvements for long queries as well as chromosome length database sequences. We have also improved the user interface of the command-line applications.

13,223 citations


"CARD 2020: antibiotic resistome sur..." refers background or methods in this paper

  • ...The website also includes a built-in BLAST instance for comparing sequences to CARD reference sequences and a web instance of RGI for resistome prediction with data visualization tools (https:// card.mcmaster.ca/analyze)....

    [...]

  • ...The RVM is functionally similar to the PVM, except it works for rRNA mutations and therefore uses a nucleotide reference sequence and a BLASTN bit score cut-off....

    [...]

  • ...Briefly, RGI algorithmically predicts AMR genes and mutations from submitted genomes using a combination of open reading frame prediction with Prodigal (38), sequence alignment with BLAST (35) or DIAMOND (39), and curated resistance mutations included with the AMR detection model....

    [...]

  • ...In the same time period, the CARD website hosted ∼45 000 BLAST analyses, ∼220 000 RGI analyses, ∼64 000 data file downloads, and ∼10,000 RGI software downloads....

    [...]

  • ...We had determined that the asymptotic nature of the BLAST expectation value (E) gave it very low discriminatory power between different -lactamase gene families (nearly 13 of CARD’s content), but that the linear nature of the BLAST bit score (S′) allowed this level of discrimination....

    [...]