scispace - formally typeset
Search or ask a question
Journal ArticleDOI

CARD 2020: antibiotic resistome surveillance with the comprehensive antibiotic resistance database

TL;DR: A new Resistomes & Variants module provides analysis and statistical summary of in silico predicted resistance variants from 82 pathogens and over 100 000 genomes, able to summarize predicted resistance using the information included in CARD, identify trends in AMR mobility and determine previously undescribed and novel resistance variants.
Abstract: The Comprehensive Antibiotic Resistance Database (CARD; https://card.mcmaster.ca) is a curated resource providing reference DNA and protein sequences, detection models and bioinformatics tools on the molecular basis of bacterial antimicrobial resistance (AMR). CARD focuses on providing high-quality reference data and molecular sequences within a controlled vocabulary, the Antibiotic Resistance Ontology (ARO), designed by the CARD biocuration team to integrate with software development efforts for resistome analysis and prediction, such as CARD's Resistance Gene Identifier (RGI) software. Since 2017, CARD has expanded through extensive curation of reference sequences, revision of the ontological structure, curation of over 500 new AMR detection models, development of a new classification paradigm and expansion of analytical tools. Most notably, a new Resistomes & Variants module provides analysis and statistical summary of in silico predicted resistance variants from 82 pathogens and over 100 000 genomes. By adding these resistance variants to CARD, we are able to summarize predicted resistance using the information included in CARD, identify trends in AMR mobility and determine previously undescribed and novel resistance variants. Here, we describe updates and recent expansions to CARD and its biocuration process, including new resources for community biocuration of AMR molecular reference data.

Content maybe subject to copyright    Report

Citations
More filters
Journal ArticleDOI
08 Sep 2021
TL;DR: In this paper, 154 genomes from public databases and four new Chilean isolates were genomically compared through an in-silico approach to identify genomic divergence in genes associated with carbohydrate consumption and their possible adaptations to different human intestinal niches.
Abstract: Bifidobacterium longum subsp. longum is a prevalent group in the human gut microbiome. Its persistence in the intestinal microbial community suggests a close host-microbe relationship according to age. The subspecies adaptations are related to metabolic capabilities and genomic and functional diversity. In this study, 154 genomes from public databases and four new Chilean isolates were genomically compared through an in silico approach to identify genomic divergence in genes associated with carbohydrate consumption and their possible adaptations to different human intestinal niches. The pangenome of the subspecies was open, which correlates with its remarkable ability to colonize several niches. The new genomes homogenously clustered within subspecies longum, as observed in phylogenetic analysis. B. longum SC664 was different at the sequence level but not in its functions. COG analysis revealed that carbohydrate use is variable among longum subspecies. Glycosyl hydrolases participating in human milk oligosaccharide use were found in certain infant and adult genomes. Predictive genomic analysis revealed that B. longum M12 contained an HMO cluster associated with the use of fucosylated HMOs but only endowed with a GH95, being able to grow in 2-fucosyllactose as the sole carbon source. This study identifies novel genomes with distinct adaptations to HMOs and highlights the plasticity of B. longum subsp. longum to colonize the human gut microbiota.

6 citations

Journal ArticleDOI
TL;DR: Bacillus pumilus 64-1, a bacterial strain isolated from the marine sponge Plakina cyanorosea, exhibits antimicrobial activity against both pathogenic and drug-resistant Gram-positive and Gram-negative bacteria as discussed by the authors.
Abstract: Bacillus pumilus 64-1, a bacterial strain isolated from the marine sponge Plakina cyanorosea, which exhibits antimicrobial activity against both pathogenic and drug-resistant Gram-positive and Gram-negative bacteria. This study aimed to conduct an in-depth genomic analysis of this bioactive sponge-derived strain. The nearly complete genome of strain 64-1 consists of 3.6 Mbp (41.5% GC), which includes 3,705 coding sequences (CDS). An open pangenome was observed when limiting to the type strains of the B. pumilus group and aquatic-derived B. pumilus representatives. The genome appears to encode for at least 12 potential biosynthetic gene clusters (BGCs), including both types I and III polyketide synthases (PKS), non-ribosomal peptide synthetases (NRPS), and one NRPS-T1PKS hybrid, among others. In particular, bacilysin and other bacteriocin-coding genes were found and may be associated with the detected antimicrobial activity. Strain 64-1 also appears to possess a broad repertoire of genes encoding for plant cell wall-degrading carbohydrate-active enzymes (CAZymes). A myriad of genes which may be involved in various process required by the strain in its marine habitat, such as those encoding for osmoprotectory transport systems and the biosynthesis of compatible solutes were also present. Several heavy metal tolerance genes are also present, together with various mobile elements including a region encoding for a type III-B Clustered Regularly Interspaced Short Palindromic Repeats (CRISPR) region, four prophage segments and transposase elements. This is the first report on the genomic characterization of a cultivable bacterial member of the Plakina cyanorosea holobiont.

6 citations

Journal ArticleDOI
TL;DR: It is demonstrated that the CRISPR-MVLST might be used as a tool with high discriminatory power, comparable ease of use, ability of tracking the source of Salmonella strains along the food chain and indication of genetic features especially virulence genes.
Abstract: In public health field, high-resolution and efficient typing of the bacterial pathogen is essential, considering source-tracking and risk assessment are fundamental issues. Currently, there are no recommendations for applying molecular characterization methods for Salmonella along the food chain, and a systematic in-field investigation comparing subtyping methods in the context of routine surveillance was partially addressed. ABSTRACT High-resolution and efficient typing for the bacterial pathogen is essential for tracking the sources, detecting or diagnosing variants, and conducting a risk assessment. However, a systematic in-field investigation of Salmonella along the food chain has not been documented. This study assessed 12 typing methods, such as antimicrobial-resistance (AMR) gene profile typing, Core Genome Multilocus Sequence Typing (cgMLST), and CRISPR multi-virulence locus sequence typing (CRISPR-MVLST), to evaluate their effectiveness for use in routine monitoring of foodborne Salmonella transmission along the poultry production chain. During 2015-16, a total of 1,064 samples were collected from poultry production chain, starting from breeding farms and slaughterhouses to the markets of Zhejiang province in China. A total of 61 consecutive unique Salmonella isolates recovered from these samples were selected for genome sequencing and further comparative typing analysis. Traditional typing methods, including serotyping, AMR phenotype-based typing, as well as modern genotyping approaches, were evaluated and compared by their discrimination index (DI). The results showed that the serotyping method identified nine serovars. The gold standard cgMLST method indicated only 18 different types (DI = 0.8541), while the CRISPR-MVLST method detected 30 types (DI = 0.9628), with a higher DI than all examined medium-resolution WGS-based genotyping methods. We demonstrate that the CRISPR-MVLST might be used as a tool with high discriminatory power, comparable ease of use, ability of tracking the source of Salmonella strains along the food chain and indication of genetic features especially virulence genes. The available methods with different purposes and laboratory expertise were also illustrated to assist in rational implementation. IMPORTANCE In public health field, high-resolution and efficient typing of the bacterial pathogen is essential, considering source-tracking and risk assessment are fundamental issues. Currently, there are no recommendations for applying molecular characterization methods for Salmonella along the food chain, and a systematic in-field investigation comparing subtyping methods in the context of routine surveillance was partially addressed. Using 1,064 samples along a poultry production chain with a considerable level of Salmonella contamination, we collected representative isolates for genome sequencing and comparative analysis by using 12 typing techniques, particularly with whole-genome sequence (WGS) based methods and a recently invented CRISPR multi-virulence locus sequence typing (CRISPR-MVLST) method. CRISPR-MVLST is identified as a tool with higher discriminatory power compared with medium-resolution WGS-based typing methods, comparable ease of use and proven ability of tracking Salmonella isolates. Besides, we also offer recommendations for rational choice of subtyping methods to assist in better implementation schemes.

6 citations

Journal ArticleDOI
TL;DR: In this paper , a case-control comparative genomics study was conducted to identify genes associated with progression from colonization to infection in patients colonized by diverse Klebsiella species complex.
Abstract: Abstract Members of the Klebsiella pneumoniae species complex frequently colonize the gut and colonization is associated with subsequent infection. To identify genes associated with progression from colonization to infection, we undertook a case-control comparative genomics study. Concordant cases ( N = 85), where colonizing and invasive isolates were identical strain types, were matched to asymptomatically colonizing controls ( N = 160). Thirty-seven genes are associated with infection, 27 of which remain significant following adjustment for patient variables and bacterial phylogeny. Infection-associated genes are not previously characterized virulence factors, but instead a diverse group of stress resistance, regulatory and antibiotic resistance genes, despite careful adjustment for antibiotic exposure. Many genes are plasmid borne, and for some, the relationship with infection is mediated by gut dominance. Five genes were validated in a geographically-independent cohort of colonized patients. This study identifies several genes reproducibly associated with progression to infection in patients colonized by diverse Klebsiella .

6 citations

Journal ArticleDOI
TL;DR: The results indicated the widespread of EVs carrying ARGs and virulence genes in daily life indoor dust, provided new insights into the status of extracellular DNA, and raised risk concerns on their gene transfer potential.
Abstract: Extracellular vesicles (EVs) are newly recognized as important vectors for carrying and spreading antibiotic resistance genes (ARGs). However, the ARGs harbored by EVs in ambient environments and the transfer potential are still unclear. In this study, the prevalence of ARGs and mobile genetic elements (MGEs) in EVs and their microbial origins were studied in indoor dust from restaurants, kindergarten, dormitories, and vehicles. The amount of EVs ranged from 3.40 × 107 to 1.09 × 1011 particles/g dust. The length of EV-associated DNA fragments was between 21 bp and 9.7 kb. Metagenomic sequencing showed that a total of 241 antibiotic ARG subtypes encoding resistance to 16 common classes were detected in the EVs from all four fields. Multidrug, quinolone, and macrolide resistance genes were the dominant types. 15 ARG subtypes were exclusively carried and even enriched in EVs compared to the indoor microbiome. Moreover, several ARGs showed co-occurrence with MGEs. The EVs showed distinct taxonomic composition with their original dust microbiota. 30.23% of EV-associated DNA was predicted to originate from potential pathogens. Our results indicated the widespread of EVs carrying ARGs and virulence genes in daily life indoor dust, provided new insights into the status of extracellular DNA, and raised risk concerns on their gene transfer potential.

6 citations

References
More filters
Journal ArticleDOI
TL;DR: A new approach to rapid sequence comparison, basic local alignment search tool (BLAST), directly approximates alignments that optimize a measure of local similarity, the maximal segment pair (MSP) score.

88,255 citations


"CARD 2020: antibiotic resistome sur..." refers background in this paper

  • ...The latter is described by CARD’s Model Ontology (MO, Supplementary Figure S1), which includes reference nucleotide and protein sequences, as well as additional search parameters including mutations conferring AMR (if applicable) and curated BLAST(P/N) (34,35) bit score cut-offs....

    [...]

Journal ArticleDOI
TL;DR: Burrows-Wheeler Alignment tool (BWA) is implemented, a new read alignment package that is based on backward search with Burrows–Wheeler Transform (BWT), to efficiently align short sequencing reads against a large reference sequence such as the human genome, allowing mismatches and gaps.
Abstract: Motivation: The enormous amount of short reads generated by the new DNA sequencing technologies call for the development of fast and accurate read alignment programs. A first generation of hash table-based methods has been developed, including MAQ, which is accurate, feature rich and fast enough to align short reads from a single individual. However, MAQ does not support gapped alignment for single-end reads, which makes it unsuitable for alignment of longer reads where indels may occur frequently. The speed of MAQ is also a concern when the alignment is scaled up to the resequencing of hundreds of individuals. Results: We implemented Burrows-Wheeler Alignment tool (BWA), a new read alignment package that is based on backward search with Burrows–Wheeler Transform (BWT), to efficiently align short sequencing reads against a large reference sequence such as the human genome, allowing mismatches and gaps. BWA supports both base space reads, e.g. from Illumina sequencing machines, and color space reads from AB SOLiD machines. Evaluations on both simulated and real data suggest that BWA is ~10–20× faster than MAQ, while achieving similar accuracy. In addition, BWA outputs alignment in the new standard SAM (Sequence Alignment/Map) format. Variant calling and other downstream analyses after the alignment can be achieved with the open source SAMtools software package. Availability: http://maq.sourceforge.net Contact: [email protected]

43,862 citations

Journal ArticleDOI
TL;DR: Bowtie 2 combines the strengths of the full-text minute index with the flexibility and speed of hardware-accelerated dynamic programming algorithms to achieve a combination of high speed, sensitivity and accuracy.
Abstract: As the rate of sequencing increases, greater throughput is demanded from read aligners. The full-text minute index is often used to make alignment very fast and memory-efficient, but the approach is ill-suited to finding longer, gapped alignments. Bowtie 2 combines the strengths of the full-text minute index with the flexibility and speed of hardware-accelerated dynamic programming algorithms to achieve a combination of high speed, sensitivity and accuracy.

37,898 citations


"CARD 2020: antibiotic resistome sur..." refers methods in this paper

  • ...Metagenomics analysis (i.e. RGI bwt) uses Bowtie2 (40) or BWA (41) mapping of sequencing reads to CARD’s PHM reference sequences only, while annotation of genomes or assembly contigs predicts resistome using four of CARD’s AMR detection models: PHM, PVM, RVM and POM (note: RGI currently only scans for nonsynonymous substitutions; not frameshifts, deletions or insertions)....

    [...]

  • ...RGI bwt) uses Bowtie2 (40) or BWA (41) mapping of sequencing reads to CARD’s PHM reference sequences only, while annotation of genomes or assembly contigs predicts resistome using four of CARD’s AMR detection models: PHM, PVM, RVM and POM (note: RGI currently only scans for nonsynonymous substitutions; not frameshifts, deletions or insertions)....

    [...]

Journal ArticleDOI
TL;DR: The goals of the PDB are described, the systems in place for data deposition and access, how to obtain further information and plans for the future development of the resource are described.
Abstract: The Protein Data Bank (PDB; http://www.rcsb.org/pdb/ ) is the single worldwide archive of structural data of biological macromolecules. This paper describes the goals of the PDB, the systems in place for data deposition and access, how to obtain further information, and near-term plans for the future development of the resource.

34,239 citations


"CARD 2020: antibiotic resistome sur..." refers methods in this paper

  • ...In 2017, we described the CARD*Shark text-mining algorithm (26) for computer-assisted literature triage, which we have expanded based on the new ARO Drug Class classification tags....

    [...]

Journal ArticleDOI
TL;DR: The new BLAST command-line applications, compared to the current BLAST tools, demonstrate substantial speed improvements for long queries as well as chromosome length database sequences.
Abstract: Sequence similarity searching is a very important bioinformatics task. While Basic Local Alignment Search Tool (BLAST) outperforms exact methods through its use of heuristics, the speed of the current BLAST software is suboptimal for very long queries or database sequences. There are also some shortcomings in the user-interface of the current command-line applications. We describe features and improvements of rewritten BLAST software and introduce new command-line applications. Long query sequences are broken into chunks for processing, in some cases leading to dramatically shorter run times. For long database sequences, it is possible to retrieve only the relevant parts of the sequence, reducing CPU time and memory usage for searches of short queries against databases of contigs or chromosomes. The program can now retrieve masking information for database sequences from the BLAST databases. A new modular software library can now access subject sequence data from arbitrary data sources. We introduce several new features, including strategy files that allow a user to save and reuse their favorite set of options. The strategy files can be uploaded to and downloaded from the NCBI BLAST web site. The new BLAST command-line applications, compared to the current BLAST tools, demonstrate substantial speed improvements for long queries as well as chromosome length database sequences. We have also improved the user interface of the command-line applications.

13,223 citations


"CARD 2020: antibiotic resistome sur..." refers background or methods in this paper

  • ...The website also includes a built-in BLAST instance for comparing sequences to CARD reference sequences and a web instance of RGI for resistome prediction with data visualization tools (https:// card.mcmaster.ca/analyze)....

    [...]

  • ...The RVM is functionally similar to the PVM, except it works for rRNA mutations and therefore uses a nucleotide reference sequence and a BLASTN bit score cut-off....

    [...]

  • ...Briefly, RGI algorithmically predicts AMR genes and mutations from submitted genomes using a combination of open reading frame prediction with Prodigal (38), sequence alignment with BLAST (35) or DIAMOND (39), and curated resistance mutations included with the AMR detection model....

    [...]

  • ...In the same time period, the CARD website hosted ∼45 000 BLAST analyses, ∼220 000 RGI analyses, ∼64 000 data file downloads, and ∼10,000 RGI software downloads....

    [...]

  • ...We had determined that the asymptotic nature of the BLAST expectation value (E) gave it very low discriminatory power between different -lactamase gene families (nearly 13 of CARD’s content), but that the linear nature of the BLAST bit score (S′) allowed this level of discrimination....

    [...]