scispace - formally typeset
Search or ask a question
Journal ArticleDOI

In Silico Detection and Typing of Plasmids using PlasmidFinder and Plasmid Multilocus Sequence Typing

TL;DR: Two easy-to-use Web tools for in silico detection and characterization of whole-genome sequence (WGS) and whole-plasmid sequence data from members of the family Enterobacteriaceae are designed and developed.
Abstract: In the work presented here, we designed and developed two easy-to-use Web tools for in silico detection and characterization of whole-genome sequence (WGS) and whole-plasmid sequence data from members of the family Enterobacteriaceae. These tools will facilitate bacterial typing based on draft genomes of multidrug-resistant Enterobacteriaceae species by the rapid detection of known plasmid types. Replicon sequences from 559 fully sequenced plasmids associated with the family Enterobacteriaceae in the NCBI nucleotide database were collected to build a consensus database for integration into a Web tool called PlasmidFinder that can be used for replicon sequence analysis of raw, contig group, or completely assembled and closed plasmid sequencing data. The PlasmidFinder database currently consists of 116 replicon sequences that match with at least at 80% nucleotide identity all replicon sequences identified in the 559 fully sequenced plasmids. For plasmid multilocus sequence typing (pMLST) analysis, a database that is updated weekly was generated from www.pubmlst.org and integrated into a Web tool called pMLST. Both databases were evaluated using draft genomes from a collection of Salmonella enterica serovar Typhimurium isolates. PlasmidFinder identified a total of 103 replicons and between zero and five different plasmid replicons within each of 49 S . Typhimurium draft genomes tested. The pMLST Web tool was able to subtype genomic sequencing data of plasmids, revealing both known plasmid sequence types (STs) and new alleles and ST variants. In conclusion, testing of the two Web tools using both fully assembled plasmid sequences and WGS-generated draft genomes showed them to be able to detect a broad variety of plasmids that are often associated with antimicrobial resistance in clinically relevant bacterial pathogens.

Content maybe subject to copyright    Report

Citations
More filters
Journal ArticleDOI
31 Oct 2017-Mbio
TL;DR: The data suggest that susceptibility to phages and antibiotics are evolving largely independently, and unlike in experiments with lab strains, negative associations between antibiotic resistance phenotypes in nature are rare, relevant for treatment scenarios where bacteria encounter multiple antibacterial agents.
Abstract: The spread of antibiotic resistance is driving interest in new approaches to control bacterial pathogens. This includes applying multiple antibiotics strategically, using bacteriophages against antibiotic-resistant bacteria, and combining both types of antibacterial agents. All these approaches rely on or are impacted by associations among resistance phenotypes (where bacteria resistant to one antibacterial agent are also relatively susceptible or resistant to others). Experiments with laboratory strains have shown strong associations between some resistance phenotypes, but we lack a quantitative understanding of associations among antibiotic and phage resistance phenotypes in natural and clinical populations. To address this, we measured resistance to various antibiotics and bacteriophages for 94 natural and clinical Escherichia coli isolates. We found several positive associations between resistance phenotypes across isolates. Associations were on average stronger for antibacterial agents of the same type (antibiotic-antibiotic or phage-phage) than different types (antibiotic-phage). Plasmid profiles and genetic knockouts suggested that such associations can result from both colocalization of resistance genes and pleiotropic effects of individual resistance mechanisms, including one case of antibiotic-phage cross-resistance. Antibiotic resistance was predicted by core genome phylogeny and plasmid profile, but phage resistance was predicted only by core genome phylogeny. Finally, we used observed associations to predict genes involved in a previously uncharacterized phage resistance mechanism, which we verified using experimental evolution. Our data suggest that susceptibility to phages and antibiotics are evolving largely independently, and unlike in experiments with lab strains, negative associations between antibiotic resistance phenotypes in nature are rare. This is relevant for treatment scenarios where bacteria encounter multiple antibacterial agents.IMPORTANCE Rising antibiotic resistance is making it harder to treat bacterial infections. Whether resistance to a given antibiotic spreads or declines is influenced by whether it is associated with altered susceptibility to other antibiotics or other stressors that bacteria encounter in nature, such as bacteriophages (viruses that infect bacteria). We used natural and clinical isolates of Escherichia coli, an abundant species and key pathogen, to characterize associations among resistance phenotypes to various antibiotics and bacteriophages. We found associations between some resistance phenotypes, and in contrast to past work with laboratory strains, they were exclusively positive. Analysis of bacterial genome sequences and horizontally transferred genetic elements (plasmids) helped to explain this, as well as our finding that there was no overall association between antibiotic resistance and bacteriophage resistance profiles across isolates. This improves our understanding of resistance evolution in nature, potentially informing new rational therapies that combine different antibacterials, including bacteriophages.

37 citations


Cites methods from "In Silico Detection and Typing of P..."

  • ...We used PlasmidFinder (13, 14) on the genome assemblies for each isolate (with an identity threshold of 75%)....

    [...]

Journal ArticleDOI
TL;DR: Wang et al. as mentioned in this paper performed an in-silico analysis of 2608 plasmids derived from 814 completely sequenced K. pneumoniae (hv-CRKP) strains.
Abstract: Klebsiella pneumoniae, as a global priority pathogen, is well known for its capability of acquiring mobile genetic elements that carry resistance and/or virulence genes. Its virulence plasmid, previously deemed nonconjugative and restricted within hypervirulent K. pneumoniae (hvKP), has disseminated into classic K. pneumoniae (cKP), particularly carbapenem-resistant K. pneumoniae (CRKP), which poses alarming challenges to public health. However, the mechanism underlying its transfer from hvKP to CRKP is unclear. A total of 28 sequence type (ST) 11 bloodstream infection-causing CRKP strains were collected from Ruijin Hospital in Shanghai, China, and used as recipients in conjugation assays. Transconjugants obtained from conjugation assays were confirmed by XbaI and S1 nuclease pulsed-field gel electrophoresis, PCR detection and/or whole-genome sequencing. The plasmid stability of the transconjugants was evaluated by serial culture. Genetically modified strains and constructed mimic virulence plasmids were employed to investigate the mechanisms underlying mobilization. The level of extracellular polysaccharides was measured by mucoviscosity assays and uronic acid quantification. An in silico analysis of 2608 plasmids derived from 814 completely sequenced K. pneumoniae strains available in GenBank was performed to investigate the distribution of putative helper plasmids and mobilizable virulence plasmids. A nonconjugative virulence plasmid was mobilized by the conjugative plasmid belonging to incompatibility group F (IncF) from the hvKP strain into ST11 CRKP strains under low extracellular polysaccharide-producing conditions or by employing intermediate E. coli strains. The virulence plasmid was mobilized via four modes: transfer alone, cotransfer with the conjugative IncF plasmid, hybrid plasmid formation due to two rounds of single-strand exchanges at specific 28-bp fusion sites or homologous recombination. According to the in silico analysis, 31.8% (242) of the putative helper plasmids and 98.8% (84/85) of the virulence plasmids carry the 28-bp fusion site. All virulence plasmids carry the origin of the transfer site. The nonconjugative virulence plasmid in ST11 CRKP strains is putatively mobilized from hvKP or E. coli intermediates with the help of conjugative IncF plasmids. Our findings emphasize the importance of raising public awareness of the rapid dissemination of virulence plasmids and the consistent emergence of hypervirulent carbapenem-resistant K. pneumoniae (hv-CRKP) strains.

37 citations

Journal ArticleDOI
TL;DR: Despite advertised “probiotic” effects, the results indicate that raw milk microbiota has minimal lactic acid bacteria.
Abstract: It has been estimated that at least 3% of the USA population consumes unpasteurized (raw) milk from animal sources, and the demand to legalize raw milk sales continues to increase However, consumption of raw milk can cause foodborne illness and be a source of bacteria containing transferrable antimicrobial resistance genes (ARGs) To obtain a comprehensive understanding of the microbiome and antibiotic resistome in both raw and processed milk, we systematically analyzed 2034 retail milk samples including unpasteurized milk and pasteurized milk via vat pasteurization, high-temperature-short-time pasteurization, and ultra-pasteurization from the United States using complementary culture-based, 16S rRNA gene, and metagenomic sequencing techniques Raw milk samples had the highest prevalence of viable bacteria which were measured as all aerobic bacteria, coliform, and Escherichia coli counts, and their microbiota was distinct from other types of milk 16S rRNA gene sequencing revealed that Pseudomonadaceae dominated raw milk with limited levels of lactic acid bacteria Among all milk samples, the microbiota remained stable with constant bacterial populations when stored at 4 °C In contrast, storage at room temperature dramatically enriched the bacterial populations present in raw milk samples and, in parallel, significantly increased the richness and abundance of ARGs Metagenomic sequencing indicated raw milk possessed dramatically more ARGs than pasteurized milk, and a conjugation assay documented the active transfer of blaCMY-2, one ceftazidime resistance gene present in raw milk-borne E coli, across bacterial species The room temperature-enriched resistome differed in raw milk from distinct geographic locations, a difference likely associated with regionally distinct milk microbiota Despite advertised “probiotic” effects, our results indicate that raw milk microbiota has minimal lactic acid bacteria In addition, retail raw milk serves as a reservoir of ARGs, populations of which are readily amplified by spontaneous fermentation There is an increased need to understand potential food safety risks from improper transportation and storage of raw milk with regard to ARGs

37 citations


Cites background from "In Silico Detection and Typing of P..."

  • ...0, [47]) to assess the presence of acquired ARGs and plasmids in E....

    [...]

Posted ContentDOI
23 Apr 2020-bioRxiv
TL;DR: Investigation of differences in the replicon distributions of protein-coding genes on a large scale as a new approach to distinguish plasmid-borne from chromosome-borne contigs as well as defining and computed statistical discrimination thresholds for a new metric: the replicons distribution score (RDS).
Abstract: Plasmids are extrachromosomal genetic elements replicating independently of the chromosome which play a vital role in the environmental adaptation of bacteria. Due to potential mobilization or conjugation capabilities, plasmids are important genetic vehicles for antimicrobial resistance genes and virulence factors with huge and increasing clinical implications. They are therefore subject to large genomic studies within the scientific community worldwide. As a result of rapidly improving next generation sequencing methods, the amount of sequenced bacterial genomes is constantly increasing, in turn raising the need for specialized tools to (i) extract plasmid sequences from draft assemblies, (ii) derive their origin and distribution, and (iii) further investigate their genetic repertoire. Recently, several bioinformatic methods and tools have emerged to tackle this issue; however, a combination of both high sensitivity and specificity in plasmid sequence identification is rarely achieved in a taxon-independent manner. In addition, many software tools are not appropriate for large high-throughput analyses or cannot be included into existing software pipelines due to their technical design or software implementation. In this study, we investigated differences in the replicon distributions of protein-coding genes on a large scale as a new approach to distinguish plasmid-borne from chromosome-borne contigs. We defined and computed statistical discrimination thresholds for a new metric: the replicon distribution score (RDS) which achieved an accuracy of 96.6%. The final performance was further improved by the combination of the RDS metric with heuristics exploiting several plasmid specific higher-level contig characterizations. We implemented this workflow in a new high-throughput taxon-independent bioinformatics software tool called Platon for the recruitment and characterization of plasmid-borne contigs from short-read draft assemblies. Compared to PlasFlow, Platon achieved a higher accuracy (97.5%) and more balanced predictions (F1=82.6%) tested on a broad range of bacterial taxa and better or equal performance against the targeted tools PlasmidFinder and PlaScope on sequenced ​ E. coli isolates. Platon is available at: ​ platon.computational.bio

37 citations

Posted ContentDOI
06 Nov 2015-bioRxiv
TL;DR: This study of the largest worldwide collection of sequenced ST131 E. coli isolates to date demonstrates that clonal expansion of two previously recognized antimicrobial-resistant clades started around 25 years ago, consistent with the widespread introduction of fluoroquinolones and extended-spectrum cephalosporins in clinical medicine.
Abstract: Background Escherichia coli sequence type 131 (ST131) has emerged globally as the most predominant lineage within this clinically important species, and its association with fluoroquinolone and extended-spectrum cephalosporin resistance impacts significantly on treatment. The evolutionary histories of this lineage, and of important antimicrobial resistance elements within it, remain unclearly defined. Results This study of the largest worldwide collection (n = 215) of sequenced ST131 E. coli isolates to date demonstrates that clonal expansion of two previously recognized antimicrobial-resistant clades, C1/H30R and C2/H30Rx, started around 25 years ago, consistent with the widespread introduction of fluoroquinolones and extended-spectrum cephalosporins in clinical medicine. These two clades appear to have emerged in the United States, with the expansion of the C2/H30Rx clade driven by the acquisition of a blaCTX-M-15-containing IncFII-like plasmid that has subsequently undergone extensive rearrangement. Several other evolutionary processes influencing the trajectory of this drug-resistant lineage are described, including sporadic acquisitions of CTX-M resistance plasmids, and chromosomal integration of blaCTX-M within sub-clusters followed by vertical evolution. These processes are also occurring for another family of CTX-M gene variants more recently observed amongst ST131, the blaCTX-M-14/14-like group. Conclusions The complexity of the evolutionary history of ST131 has important implications for antimicrobial resistance surveillance, epidemiological analysis, and control of emerging clinical lineages of E. coli. These data also highlight the global imperative to reduce specific antibiotic selection pressures, and demonstrate the important and varied roles played by plasmids and other mobile genetic elements in the perpetuation of antimicrobial resistance within lineages.

37 citations

References
More filters
Journal ArticleDOI
TL;DR: A web server providing a convenient way of identifying acquired antimicrobial resistance genes in completely sequenced isolates was created, and the method was evaluated on WGS chromosomes and plasmids of 30 isolates.
Abstract: Objectives Identification of antimicrobial resistance genes is important for understanding the underlying mechanisms and the epidemiology of antimicrobial resistance. As the costs of whole-genome sequencing (WGS) continue to decline, it becomes increasingly available in routine diagnostic laboratories and is anticipated to substitute traditional methods for resistance gene identification. Thus, the current challenge is to extract the relevant information from the large amount of generated data.

3,956 citations


"In Silico Detection and Typing of P..." refers methods in this paper

  • ...To extract the relevant information from the large amount of data generated, a Web-based tool, ResFinder, for the identification of acquired or intrinsically present antimicrobial resistance genes in whole-genome data was recently developed (15)....

    [...]

Journal ArticleDOI
TL;DR: NCBI’s Conserved Domain Database (CDD) is a resource for the annotation of protein sequences with the location of conserved domain footprints, and functional sites inferred from these footprints.
Abstract: NCBI's Conserved Domain Database (CDD) is a resource for the annotation of protein sequences with the location of conserved domain footprints, and functional sites inferred from these footprints. CDD includes manually curated domain models that make use of protein 3D structure to refine domain models and provide insights into sequence/structure/function relationships. Manually curated models are organized hierarchically if they describe domain families that are clearly related by common descent. As CDD also imports domain family models from a variety of external sources, it is a partially redundant collection. To simplify protein annotation, redundant models and models describing homologous families are clustered into superfamilies. By default, domain footprints are annotated with the corresponding superfamily designation, on top of which specific annotation may indicate high-confidence assignment of family membership. Pre-computed domain annotation is available for proteins in the Entrez/Protein dataset, and a novel interface, Batch CD-Search, allows the computation and download of annotation for large sets of protein queries. CDD can be accessed via http://www.ncbi.nlm.nih.gov/Structure/cdd/cdd.shtml.

2,934 citations


"In Silico Detection and Typing of P..." refers background in this paper

  • ...In particular, the replicase proteins showing the pfam02387 or pfam01051 conserved domains were assigned to the FII and FIB groups, respectively (31)....

    [...]

Journal ArticleDOI
TL;DR: Results indicated that the inc/rep PCR method demonstrates high specificity and sensitivity in detecting replicons on reference plasmids and also revealed the presence of recurrent and common plasmid in epidemiologically unrelated Salmonella isolates of different serotypes.

2,163 citations


"In Silico Detection and Typing of P..." refers methods in this paper

  • ...A collection of 24 previously characterized and fully FIG 1 Numbers of fully sequenced plasmids (y axis) classified into incompatibility groups occurring in the different bacterial species of the Enterobacteriaceae family....

    [...]

  • ...Since 2005, a PCR-based replicon typing (PBRT) scheme has been available that targets in multiplex PCRs the replicons of the major plasmid families occurring in members of the family Enterobacteriaceae (2)....

    [...]

  • ...Here, we present two free, easy-to-use Web tools, PlasmidFinder and pMLST, to analyze and classify plasmids from bacterial species of the family Enterobacteriaceae....

    [...]

  • ...Here, we describe the design of two new easy-to-use Web tools useful for the rapid identification of plasmids in Enterobacteriaceae species that are of interest for epidemiological and clinical microbiology investigations of the plasmid-associated spread of antimicrobial resistance....

    [...]

  • ...This method was initially developed to detect the replicons of plasmids belonging to the 18 major incompatibility (Inc) groups of Enterobacteriaceae species (3)....

    [...]

Journal ArticleDOI
TL;DR: The Bacterial Isolate Genome Sequence Database (BIGSDB) represents a freely available resource that will assist the broader community in the elucidation of the structure and function of bacteria by means of a population genomics approach.
Abstract: The opportunities for bacterial population genomics that are being realised by the application of parallel nucleotide sequencing require novel bioinformatics platforms These must be capable of the storage, retrieval, and analysis of linked phenotypic and genotypic information in an accessible, scalable and computationally efficient manner The Bacterial Isolate Genome Sequence Database (BIGSDB) is a scalable, open source, web-accessible database system that meets these needs, enabling phenotype and sequence data, which can range from a single sequence read to whole genome data, to be efficiently linked for a limitless number of bacterial specimens The system builds on the widely used mlstdbNet software, developed for the storage and distribution of multilocus sequence typing (MLST) data, and incorporates the capacity to define and identify any number of loci and genetic variants at those loci within the stored nucleotide sequences These loci can be further organised into 'schemes' for isolate characterisation or for evolutionary or functional analyses Isolates and loci can be indexed by multiple names and any number of alternative schemes can be accommodated, enabling cross-referencing of different studies and approaches LIMS functionality of the software enables linkage to and organisation of laboratory samples The data are easily linked to external databases and fine-grained authentication of access permits multiple users to participate in community annotation by setting up or contributing to different schemes within the database Some of the applications of BIGSDB are illustrated with the genera Neisseria and Streptococcus The BIGSDB source code and documentation are available at http://pubmlstorg/software/database/bigsdb/ Genomic data can be used to characterise bacterial isolates in many different ways but it can also be efficiently exploited for evolutionary or functional studies BIGSDB represents a freely available resource that will assist the broader community in the elucidation of the structure and function of bacteria by means of a population genomics approach

1,943 citations

Journal ArticleDOI
TL;DR: A Web-based method for MLST of 66 bacterial species based on whole-genome sequencing data that enables investigators to determine the sequence types of their isolates on the basis of WGS data.
Abstract: Accurate strain identification is essential for anyone working with bacteria. For many species, multilocus sequence typing (MLST) is considered the “gold standard” of typing, but it is traditionally performed in an expensive and time-consuming manner. As the costs of whole-genome sequencing (WGS) continue to decline, it becomes increasingly available to scientists and routine diagnostic laboratories. Currently, the cost is below that of traditional MLST. The new challenges will be how to extract the relevant information from the large amount of data so as to allow for comparison over time and between laboratories. Ideally, this information should also allow for comparison to historical data. We developed a Web-based method for MLST of 66 bacterial species based on WGS data. As input, the method uses short sequence reads from four sequencing platforms or preassembled genomes. Updates from the MLST databases are downloaded monthly, and the best-matching MLST alleles of the specified MLST scheme are found using a BLAST-based ranking method. The sequence type is then determined by the combination of alleles identified. The method was tested on preassembled genomes from 336 isolates covering 56 MLST schemes, on short sequence reads from 387 isolates covering 10 schemes, and on a small test set of short sequence reads from 29 isolates for which the sequence type had been determined by traditional methods. The method presented here enables investigators to determine the sequence types of their isolates on the basis of WGS data. This method is publicly available at www.cbs.dtu.dk/services/MLST.

1,620 citations


"In Silico Detection and Typing of P..." refers methods in this paper

  • ...If raw sequence reads are uploaded, they are first assembled (after the sequencing platform is given by the user) as described previously (16)....

    [...]