scispace - formally typeset
Search or ask a question
Journal ArticleDOI

In Silico Detection and Typing of Plasmids using PlasmidFinder and Plasmid Multilocus Sequence Typing

TL;DR: Two easy-to-use Web tools for in silico detection and characterization of whole-genome sequence (WGS) and whole-plasmid sequence data from members of the family Enterobacteriaceae are designed and developed.
Abstract: In the work presented here, we designed and developed two easy-to-use Web tools for in silico detection and characterization of whole-genome sequence (WGS) and whole-plasmid sequence data from members of the family Enterobacteriaceae. These tools will facilitate bacterial typing based on draft genomes of multidrug-resistant Enterobacteriaceae species by the rapid detection of known plasmid types. Replicon sequences from 559 fully sequenced plasmids associated with the family Enterobacteriaceae in the NCBI nucleotide database were collected to build a consensus database for integration into a Web tool called PlasmidFinder that can be used for replicon sequence analysis of raw, contig group, or completely assembled and closed plasmid sequencing data. The PlasmidFinder database currently consists of 116 replicon sequences that match with at least at 80% nucleotide identity all replicon sequences identified in the 559 fully sequenced plasmids. For plasmid multilocus sequence typing (pMLST) analysis, a database that is updated weekly was generated from www.pubmlst.org and integrated into a Web tool called pMLST. Both databases were evaluated using draft genomes from a collection of Salmonella enterica serovar Typhimurium isolates. PlasmidFinder identified a total of 103 replicons and between zero and five different plasmid replicons within each of 49 S . Typhimurium draft genomes tested. The pMLST Web tool was able to subtype genomic sequencing data of plasmids, revealing both known plasmid sequence types (STs) and new alleles and ST variants. In conclusion, testing of the two Web tools using both fully assembled plasmid sequences and WGS-generated draft genomes showed them to be able to detect a broad variety of plasmids that are often associated with antimicrobial resistance in clinically relevant bacterial pathogens.

Content maybe subject to copyright    Report

Citations
More filters
Journal ArticleDOI
TL;DR: An integrated, high-resolution analysis of both chromosome and plasmid sequences using Klebsiella pneumoniae isolates sampled during a European survey revealed three contrasting modes of dissemination used by carbapenemase genes, which confer resistance to last-linecarbapenems.
Abstract: Molecular and genomic surveillance systems for bacterial pathogens currently rely on tracking clonally evolving lineages. By contrast, plasmids are usually excluded or analyzed with low-resolution techniques, despite being the primary vectors of antibiotic resistance genes across many key pathogens. Here, we used a combination of long- and short-read sequence data of Klebsiella pneumoniae isolates (n = 1,717) from a European survey to perform an integrated, continent-wide study of chromosomal and plasmid diversity. This revealed three contrasting modes of dissemination used by carbapenemase genes, which confer resistance to last-line carbapenems. First, blaOXA-48-like genes have spread primarily via the single epidemic pOXA-48–like plasmid, which emerged recently in clinical settings and spread rapidly to numerous lineages. Second, blaVIM and blaNDM genes have spread via transient associations of many diverse plasmids with numerous lineages. Third, blaKPC genes have transmitted predominantly by stable association with one successful clonal lineage (ST258/512) yet have been mobilized among diverse plasmids within this lineage. We show that these plasmids, which include pKpQIL-like and IncX3 plasmids, have a long association (and are coevolving) with the lineage, although frequent recombination and rearrangement events between them have led to a complex array of mosaic plasmids carrying blaKPC. Taken altogether, these results reveal the diverse trajectories of antibiotic resistance genes in clinical settings, summarized as using one plasmid/multiple lineages, multiple plasmids/multiple lineages, and multiple plasmids/one lineage. Our study provides a framework for the much needed incorporation of plasmid data into genomic surveillance systems, an essential step toward a more comprehensive understanding of resistance spread.

64 citations


Cites background from "In Silico Detection and Typing of P..."

  • ...1 (41) with the PlasmidFinder database (13)....

    [...]

  • ...The assemblies contained one to eight plasmid replicons (median, four), which are sequences used for defining plasmid incompatibility (Inc) groups (13)....

    [...]

Journal ArticleDOI
TL;DR: BTyper, a computational tool that employs a combination of virulence gene-based typing, multilocus sequence typing (MLST), panC clade typing, and rpoB allelic typing to rapidly classify and assess the virulence potential of any isolate using its nucleotide sequencing data, is developed.
Abstract: The Bacillus cereus group comprises nine species, several of which are pathogenic. Differentiating between isolates that may cause disease and those that do not is a matter of public health and economic importance, but it can be particularly challenging due to the high genomic similarity within the group. To this end, we have developed BTyper, a computational tool that employs a combination of (i) virulence gene-based typing, (ii) multilocus sequence typing (MLST), (iii) panC clade typing, and (iv) rpoB allelic typing to rapidly classify B. cereus group isolates using nucleotide sequencing data. BTyper was applied to a set of 662 B. cereus group genome assemblies to (i) identify anthrax-associated genes in non-B. anthracis members of the B. cereus group, and (ii) identify assemblies from B. cereus group strains with emetic potential. With BTyper, the anthrax toxin genes cya, lef, and pagA were detected in 8 genomes classified by the NCBI as B. cereus that clustered into two distinct groups using k-medoids clustering, while either the B. anthracis poly-γ-d-glutamate capsule biosynthesis genes capABCDE or the hyaluronic acid capsule hasA gene was detected in an additional 16 assemblies classified as either B. cereus or Bacillus thuringiensis isolated from clinical, environmental, and food sources. The emetic toxin genes cesABCD were detected in 24 assemblies belonging to panC clades III and VI that had been isolated from food, clinical, and environmental settings. The command line version of BTyper is available at https://github.com/lmc297/BTyper. In addition, BMiner, a companion application for analyzing multiple BTyper output files in aggregate, can be found at https://github.com/lmc297/BMiner. IMPORTANCE Bacillus cereus is a foodborne pathogen that is estimated to cause tens of thousands of illnesses each year in the United States alone. Even with molecular methods, it can be difficult to distinguish nonpathogenic B. cereus group isolates from their pathogenic counterparts, including the human pathogen Bacillus anthracis, which is responsible for anthrax, as well as the insect pathogen B. thuringiensis. By using the variety of typing schemes employed by BTyper, users can rapidly classify, characterize, and assess the virulence potential of any isolate using its nucleotide sequencing data.

64 citations


Cites methods from "In Silico Detection and Typing of P..."

  • ...seconds using assembled genomes (76, 77); (ii) scalability, with the ability to provide...

    [...]

  • ...in isolates using Illumina reads or nucleotide assemblies (77); and VirulenceFinder,...

    [...]

Journal ArticleDOI
TL;DR: The authors combine in silico analysis of 435 genomes of ruminal bacteria and archaea with transcriptomics and in vitro assays to investigate the distribution, expression and mobility of antibiotic resistance genes within the ruminal microbiota, finding that the tet(W) gene is under positive selective pressure.
Abstract: Infections caused by multidrug resistant bacteria represent a therapeutic challenge both in clinical settings and in livestock production, but the prevalence of antibiotic resistance genes among the species of bacteria that colonize the gastrointestinal tract of ruminants is not well characterized. Here, we investigate the resistome of 435 ruminal microbial genomes in silico and confirm representative phenotypes in vitro. We find a high abundance of genes encoding tetracycline resistance and evidence that the tet(W) gene is under positive selective pressure. Our findings reveal that tet(W) is located in a novel integrative and conjugative element in several ruminal bacterial genomes. Analyses of rumen microbial metatranscriptomes confirm the expression of the most abundant antibiotic resistance genes. Our data provide insight into antibiotic resistange gene profiles of the main species of ruminal bacteria and reveal the potential role of mobile genetic elements in shaping the resistome of the rumen microbiome, with implications for human and animal health.

64 citations

Journal ArticleDOI
TL;DR: WGS analysis demonstrated complex transmission dynamics within the burn center at levels of the strain and/or plasmid in association with a transposon, highlighting the versatility of KPC-producing Enterobacteriaceae in their ability to utilize multiple modes to resistance gene propagation.
Abstract: Klebsiella pneumoniae carbapenemase (KPC)-producing Enterobacter cloacae has been recently recognized in the United States. Whole-genome sequencing (WGS) has become a useful tool for analysis of outbreaks and for determining transmission networks of multidrug-resistant organisms in health care settings, including carbapenem-resistant Enterobacteriaceae (CRE). We experienced a prolonged outbreak of CRE E. cloacae and K. pneumoniae over a 3-year period at a large academic burn center despite rigorous infection control measures. To understand the molecular mechanisms that sustained this outbreak, we investigated the CRE outbreak isolates by using WGS. Twenty-two clinical isolates of CRE, including E. cloacae (n = 15) and K. pneumoniae (n = 7), were sequenced and analyzed genetically. WGS revealed that this outbreak, which seemed epidemiologically unlinked, was in fact genetically linked over a prolonged period. Multiple mechanisms were found to account for the ongoing outbreak of KPC-3-producing E. cloacae and K. pneumoniae This outbreak was primarily maintained by a clonal expansion of E. cloacae sequence type 114 (ST114) with distribution of multiple resistance determinants. Plasmid and transposon analyses suggested that the majority of blaKPC-3 was transmitted via an identical Tn4401b element on part of a common plasmid. WGS analysis demonstrated complex transmission dynamics within the burn center at levels of the strain and/or plasmid in association with a transposon, highlighting the versatility of KPC-producing Enterobacteriaceae in their ability to utilize multiple modes to resistance gene propagation.

63 citations


Cites background from "In Silico Detection and Typing of P..."

  • ...3 (51), with a threshold of 95% identity on de novo assemblies....

    [...]

Journal ArticleDOI
TL;DR: The overlap of multi-drug resistant bacteria and diversity of the genotypes carrying CTX-M-15 in the community and hospitals requires an overall approach that addresses social behaviour and activity, rationalization of the antibiotic stewardship policy and a deeper understanding of the ecological factors that lead to persistence and spread of such alleles.
Abstract: Extended-spectrum beta-lactamase (ESBL)-producing Enterobacteriaceae commonly cause infections worldwide. BlaCTX-M-15 has been commonly detected in hospital isolates in Mwanza, Tanzania. Little is known regarding the faecal carriage of ESBL isolates and blaCTX-M-15 allele among humans in the community in developing countries. A cross-sectional study involving 334 humans from the community settings in Mwanza City was conducted between June and September 2014. Stool specimens were collected and processed to detect ESBL producing enterobacteriaceae. ESBL isolates were confirmed using disc approximation method, commercial ESBL plates and VITEK-2 system. A polymerase chain reaction and sequencing based allele typing for CTX-M ESBL genes was performed to 42 confirmed ESBL isolates followed by whole genome sequence of 25 randomly selected isolates to detect phylogenetic groups, sequence types plasmid replicon types. Of 334 humans investigated, 55 (16.5 %) were found to carry ESBL-producing bacteria. Age, history of antibiotic use and history of admission were independent factors found to predict ESBL-carriage. The carriage rate of ESBL-producing Escherichia coli was significantly higher than that of Klebsiella pneumoniae (15.1 % vs. 3.8 %, p = 0.026). Of 42 ESBL isolates, 37 (88.1 %) were found to carry the blaCTX-M-15 allele. Other transferrable resistance genes were aac(6’)Ib-cr, aac(3)-IIa, aac(3)-IId, aadA1, aadA5, strA, strB and qnrS1. Eight multi-locus sequence types (ST) were detected in 25 E. coli isolates subjected to genome sequencing. ST-131 was detected in 6 (24 %), ST-38 in 5 (20 %) and 5 (20 %) clonal complex − 10(ST-617, ST-44) of isolates. The pathogenic phylogenetic groups D and B2 were detected in 8/25 (32 %) and 6/25 (24 %) of isolates respectively. BlaCTX-M-15 was found to be located in multiple IncY and IncF plasmids while in 13/25(52 %) of cases it was chromosomally located. The overlap of multi-drug resistant bacteria and diversity of the genotypes carrying CTX-M-15 in the community and hospitals requires an overall approach that addresses social behaviour and activity, rationalization of the antibiotic stewardship policy and a deeper understanding of the ecological factors that lead to persistence and spread of such alleles.

62 citations

References
More filters
Journal ArticleDOI
TL;DR: A web server providing a convenient way of identifying acquired antimicrobial resistance genes in completely sequenced isolates was created, and the method was evaluated on WGS chromosomes and plasmids of 30 isolates.
Abstract: Objectives Identification of antimicrobial resistance genes is important for understanding the underlying mechanisms and the epidemiology of antimicrobial resistance. As the costs of whole-genome sequencing (WGS) continue to decline, it becomes increasingly available in routine diagnostic laboratories and is anticipated to substitute traditional methods for resistance gene identification. Thus, the current challenge is to extract the relevant information from the large amount of generated data.

3,956 citations


"In Silico Detection and Typing of P..." refers methods in this paper

  • ...To extract the relevant information from the large amount of data generated, a Web-based tool, ResFinder, for the identification of acquired or intrinsically present antimicrobial resistance genes in whole-genome data was recently developed (15)....

    [...]

Journal ArticleDOI
TL;DR: NCBI’s Conserved Domain Database (CDD) is a resource for the annotation of protein sequences with the location of conserved domain footprints, and functional sites inferred from these footprints.
Abstract: NCBI's Conserved Domain Database (CDD) is a resource for the annotation of protein sequences with the location of conserved domain footprints, and functional sites inferred from these footprints. CDD includes manually curated domain models that make use of protein 3D structure to refine domain models and provide insights into sequence/structure/function relationships. Manually curated models are organized hierarchically if they describe domain families that are clearly related by common descent. As CDD also imports domain family models from a variety of external sources, it is a partially redundant collection. To simplify protein annotation, redundant models and models describing homologous families are clustered into superfamilies. By default, domain footprints are annotated with the corresponding superfamily designation, on top of which specific annotation may indicate high-confidence assignment of family membership. Pre-computed domain annotation is available for proteins in the Entrez/Protein dataset, and a novel interface, Batch CD-Search, allows the computation and download of annotation for large sets of protein queries. CDD can be accessed via http://www.ncbi.nlm.nih.gov/Structure/cdd/cdd.shtml.

2,934 citations


"In Silico Detection and Typing of P..." refers background in this paper

  • ...In particular, the replicase proteins showing the pfam02387 or pfam01051 conserved domains were assigned to the FII and FIB groups, respectively (31)....

    [...]

Journal ArticleDOI
TL;DR: Results indicated that the inc/rep PCR method demonstrates high specificity and sensitivity in detecting replicons on reference plasmids and also revealed the presence of recurrent and common plasmid in epidemiologically unrelated Salmonella isolates of different serotypes.

2,163 citations


"In Silico Detection and Typing of P..." refers methods in this paper

  • ...A collection of 24 previously characterized and fully FIG 1 Numbers of fully sequenced plasmids (y axis) classified into incompatibility groups occurring in the different bacterial species of the Enterobacteriaceae family....

    [...]

  • ...Since 2005, a PCR-based replicon typing (PBRT) scheme has been available that targets in multiplex PCRs the replicons of the major plasmid families occurring in members of the family Enterobacteriaceae (2)....

    [...]

  • ...Here, we present two free, easy-to-use Web tools, PlasmidFinder and pMLST, to analyze and classify plasmids from bacterial species of the family Enterobacteriaceae....

    [...]

  • ...Here, we describe the design of two new easy-to-use Web tools useful for the rapid identification of plasmids in Enterobacteriaceae species that are of interest for epidemiological and clinical microbiology investigations of the plasmid-associated spread of antimicrobial resistance....

    [...]

  • ...This method was initially developed to detect the replicons of plasmids belonging to the 18 major incompatibility (Inc) groups of Enterobacteriaceae species (3)....

    [...]

Journal ArticleDOI
TL;DR: The Bacterial Isolate Genome Sequence Database (BIGSDB) represents a freely available resource that will assist the broader community in the elucidation of the structure and function of bacteria by means of a population genomics approach.
Abstract: The opportunities for bacterial population genomics that are being realised by the application of parallel nucleotide sequencing require novel bioinformatics platforms These must be capable of the storage, retrieval, and analysis of linked phenotypic and genotypic information in an accessible, scalable and computationally efficient manner The Bacterial Isolate Genome Sequence Database (BIGSDB) is a scalable, open source, web-accessible database system that meets these needs, enabling phenotype and sequence data, which can range from a single sequence read to whole genome data, to be efficiently linked for a limitless number of bacterial specimens The system builds on the widely used mlstdbNet software, developed for the storage and distribution of multilocus sequence typing (MLST) data, and incorporates the capacity to define and identify any number of loci and genetic variants at those loci within the stored nucleotide sequences These loci can be further organised into 'schemes' for isolate characterisation or for evolutionary or functional analyses Isolates and loci can be indexed by multiple names and any number of alternative schemes can be accommodated, enabling cross-referencing of different studies and approaches LIMS functionality of the software enables linkage to and organisation of laboratory samples The data are easily linked to external databases and fine-grained authentication of access permits multiple users to participate in community annotation by setting up or contributing to different schemes within the database Some of the applications of BIGSDB are illustrated with the genera Neisseria and Streptococcus The BIGSDB source code and documentation are available at http://pubmlstorg/software/database/bigsdb/ Genomic data can be used to characterise bacterial isolates in many different ways but it can also be efficiently exploited for evolutionary or functional studies BIGSDB represents a freely available resource that will assist the broader community in the elucidation of the structure and function of bacteria by means of a population genomics approach

1,943 citations

Journal ArticleDOI
TL;DR: A Web-based method for MLST of 66 bacterial species based on whole-genome sequencing data that enables investigators to determine the sequence types of their isolates on the basis of WGS data.
Abstract: Accurate strain identification is essential for anyone working with bacteria. For many species, multilocus sequence typing (MLST) is considered the “gold standard” of typing, but it is traditionally performed in an expensive and time-consuming manner. As the costs of whole-genome sequencing (WGS) continue to decline, it becomes increasingly available to scientists and routine diagnostic laboratories. Currently, the cost is below that of traditional MLST. The new challenges will be how to extract the relevant information from the large amount of data so as to allow for comparison over time and between laboratories. Ideally, this information should also allow for comparison to historical data. We developed a Web-based method for MLST of 66 bacterial species based on WGS data. As input, the method uses short sequence reads from four sequencing platforms or preassembled genomes. Updates from the MLST databases are downloaded monthly, and the best-matching MLST alleles of the specified MLST scheme are found using a BLAST-based ranking method. The sequence type is then determined by the combination of alleles identified. The method was tested on preassembled genomes from 336 isolates covering 56 MLST schemes, on short sequence reads from 387 isolates covering 10 schemes, and on a small test set of short sequence reads from 29 isolates for which the sequence type had been determined by traditional methods. The method presented here enables investigators to determine the sequence types of their isolates on the basis of WGS data. This method is publicly available at www.cbs.dtu.dk/services/MLST.

1,620 citations


"In Silico Detection and Typing of P..." refers methods in this paper

  • ...If raw sequence reads are uploaded, they are first assembled (after the sequencing platform is given by the user) as described previously (16)....

    [...]