scispace - formally typeset
Search or ask a question
Journal ArticleDOI

In Silico Detection and Typing of Plasmids using PlasmidFinder and Plasmid Multilocus Sequence Typing

TL;DR: Two easy-to-use Web tools for in silico detection and characterization of whole-genome sequence (WGS) and whole-plasmid sequence data from members of the family Enterobacteriaceae are designed and developed.
Abstract: In the work presented here, we designed and developed two easy-to-use Web tools for in silico detection and characterization of whole-genome sequence (WGS) and whole-plasmid sequence data from members of the family Enterobacteriaceae. These tools will facilitate bacterial typing based on draft genomes of multidrug-resistant Enterobacteriaceae species by the rapid detection of known plasmid types. Replicon sequences from 559 fully sequenced plasmids associated with the family Enterobacteriaceae in the NCBI nucleotide database were collected to build a consensus database for integration into a Web tool called PlasmidFinder that can be used for replicon sequence analysis of raw, contig group, or completely assembled and closed plasmid sequencing data. The PlasmidFinder database currently consists of 116 replicon sequences that match with at least at 80% nucleotide identity all replicon sequences identified in the 559 fully sequenced plasmids. For plasmid multilocus sequence typing (pMLST) analysis, a database that is updated weekly was generated from www.pubmlst.org and integrated into a Web tool called pMLST. Both databases were evaluated using draft genomes from a collection of Salmonella enterica serovar Typhimurium isolates. PlasmidFinder identified a total of 103 replicons and between zero and five different plasmid replicons within each of 49 S . Typhimurium draft genomes tested. The pMLST Web tool was able to subtype genomic sequencing data of plasmids, revealing both known plasmid sequence types (STs) and new alleles and ST variants. In conclusion, testing of the two Web tools using both fully assembled plasmid sequences and WGS-generated draft genomes showed them to be able to detect a broad variety of plasmids that are often associated with antimicrobial resistance in clinically relevant bacterial pathogens.

Content maybe subject to copyright    Report

Citations
More filters
Journal ArticleDOI
TL;DR: The recombination event that occurred during the conjugation process of a tet(X4)-bearing plasmid probably formed through IS26 homologous recombination and presumably play an important role in the dissemination of the tet( X4).
Abstract: Objectives As the spread of antimicrobial resistance genes becomes an increasing global threat, improved understanding of genetic structure and transferability of the resistant plasmids becomes more critical. The newly description of several plasmid-mediated tet(X) variant genes, tet(X3), tet(X4) and tet(X5), poses a considerable risk for public health. This study aimed to investigate the recombination event that occurred during the conjugation process of a tet(X4)-bearing plasmid. Methods A Tet(X4)-producing Escherichia coli isolate, 2019XSD11, was subjected to susceptibility testing, S1-PFGE and whole genome sequencing. The genetic features of plasmids and the recombination event were analysed by sequence comparison and annotation. We performed electrotransformation assay to further test the transferability of the tet(X4)-bearing plasmid. Results A novel type of fusion tet(X4)-bearing plasmid was discovered from the transconjugant, plasmid p2019XSD11-TC2-284 (∼280 kbp). The sequence of this plasmid consisted of a hybrid episome of two plasmids p2019XSD11-190 (∼190 kbp) harbouring tet(X4) and p2019XSD11-92 (∼92 kbp) harbouring blaCTX-M-55 originated from 2019XSD11. The two plasmids were concatenated by IS26 elements. Analyses of the genetic constitution of the plasmids essential for transmission showed the plasmid p2019XSD11-190 lacked an intact type IV secretion system. Beyond this, the origin of transfer region and relaxase genes in plasmid p2019XSD11-190 had no sequence similarity with those in plasmid p2019XSD11-92. Conclusions The fusion of the two plasmids probably formed through IS26 homologous recombination. Such recombination events presumably play an important role in the dissemination of the tet(X4). Molecular surveillance of tet(X) variant genes and genetic structures warrants further investigation to evaluate the underlying public health risk.

29 citations

Journal ArticleDOI
TL;DR: The need to establish appropriate control measures to curb the spread of MDR-MRSA in the food chain is suggested and a human-associated clone, ST612-CC8-t1257-SCCmec_IVd (2B), previously reported in South Africa is suggested.

29 citations

Journal ArticleDOI
22 Jan 2020
TL;DR: Genome analysis suggested that Pantoea agglomerans strain C1 exhibits high biotechnological potential as plant growth-promoting bacterium in heavy metal polluted soils.
Abstract: Distinctive strains of Pantoea are used as soil inoculants for their ability to promote plant growth. Pantoea agglomerans strain C1, previously isolated from the phyllosphere of lettuce, can produce indole-3-acetic acid (IAA), solubilize phosphate, and inhibit plant pathogens, such as Erwinia amylovora. In this paper, the complete genome sequence of strain C1 is reported. In addition, experimental evidence is provided on how the strain tolerates arsenate As (V) up to 100 mM, and on how secreted metabolites like IAA and siderophores act as biostimulants in tomato cuttings. The strain has a circular chromosome and two prophages for a total genome of 4,846,925-bp, with a DNA G+C content of 55.2%. Genes related to plant growth promotion and biocontrol activity, such as those associated with IAA and spermidine synthesis, solubilization of inorganic phosphate, acquisition of ferrous iron, and production of volatile organic compounds, siderophores and GABA, were found in the genome of strain C1. Genome analysis also provided better understanding of the mechanisms underlying strain resistance to multiple toxic heavy metals and transmission of these genes by horizontal gene transfer. Findings suggested that strain C1 exhibits high biotechnological potential as plant growth-promoting bacterium in heavy metal polluted soils.

29 citations

Journal ArticleDOI
TL;DR: The hybrid assembly pipeline of Unicycler is demonstrated as a superior approach for genomic analyses of bacterial pathogens using Illumina and Oxford Nanopore sequencing.
Abstract: We benchmarked the hybrid assembly approaches of MaSuRCA, SPAdes, and Unicycler for bacterial pathogens using Illumina and Oxford Nanopore sequencing by determining genome completeness and accuracy, antimicrobial resistance (AMR), virulence potential, multilocus sequence typing (MLST), phylogeny, and pan genome. Ten bacterial species (10 strains) were tested for simulated reads of both mediocre- and low-quality, whereas 11 bacterial species (12 strains) were tested for real reads. Unicycler performed the best for achieving contiguous genomes, closely followed by MaSuRCA, while all SPAdes assemblies were incomplete. MaSuRCA was less tolerant of low-quality long reads than SPAdes and Unicycler. The hybrid assemblies of five antimicrobial-resistant strains with simulated reads provided consistent AMR genotypes with the reference genomes. The MaSuRCA assembly of Staphylococcus aureus with real reads contained msr(A) and tet(K), while the reference genome and SPAdes and Unicycler assemblies harbored blaZ. The AMR genotypes of the reference genomes and hybrid assemblies were consistent for the other five antimicrobial-resistant strains with real reads. The numbers of virulence genes in all hybrid assemblies were similar to those of the reference genomes, irrespective of simulated or real reads. Only one exception existed that the reference genome and hybrid assemblies of Pseudomonas aeruginosa with mediocre-quality long reads carried 241 virulence genes, whereas 184 virulence genes were identified in the hybrid assemblies of low-quality long reads. The MaSuRCA assemblies of Escherichia coli O157:H7 and Salmonella Typhimurium with mediocre-quality long reads contained 126 and 118 virulence genes, respectively, while 110 and 107 virulence genes were detected in their MaSuRCA assemblies of low-quality long reads, respectively. All approaches performed well in our MLST and phylogenetic analyses. The pan genomes of the hybrid assemblies of S. Typhimurium with mediocre-quality long reads were similar to that of the reference genome, while SPAdes and Unicycler were more tolerant of low-quality long reads than MaSuRCA for the pan-genome analysis. All approaches functioned well in the pan-genome analysis of Campylobacter jejuni with real reads. Our research demonstrates the hybrid assembly pipeline of Unicycler as a superior approach for genomic analyses of bacterial pathogens using Illumina and Oxford Nanopore sequencing.

29 citations

Journal ArticleDOI
TL;DR: This work characterized a subset of the CRKP isolates and reconstruct the spread of a multi-clone epidemic event that occurred in an Intensive Care Unit in a hospital in Northern Italy from August 2015 to May 2016, resulting in seven deaths.
Abstract: The circulation of carbapenem-resistant Klebsiella pneumoniae (CRKP) is a significant problem worldwide. In this work we characterize the isolates and reconstruct the spread of a multi-clone epidemic event that occurred in an Intensive Care Unit in a hospital in Northern Italy. The event took place from August 2015 to May 2016 and involved 23 patients. Twelve of these patients were colonized by CRKP at the gastrointestinal level, while the other 11 were infected in various body districts. We retrospectively collected data on the inpatients and characterized a subset of the CRKP isolates using antibiotic resistance profiling and whole genome sequencing. A SNP-based phylogenetic approach was used to depict the evolutionary context of the obtained genomes, showing that 26 of the 32 isolates belong to three genome clusters, while the remaining six were classified as sporadic. The first genome cluster was composed of multi-resistant isolates of sequence type (ST) 512. Among those, two were resistant to colistin, one of which indicating the insurgence of resistance during an infection. One patient hospitalized in this period was colonized by two strains of CRKP, both carrying the blaKPC gene (variant KPC-3). The analysis of the genome contig containing the blaKPC locus indicates that the gene was not transmitted between the two isolates. The second infection cluster comprised four other genomes of ST512, while the third one (ST258) colonized 12 patients, causing five clinical infections and resulting in seven deaths. This cluster presented the highest level of antibiotic resistance, including colistin resistance in all 17 analyzed isolates. The three outbreaking clones did not present more virulence genes than the sporadic isolates and had different patterns of antibiotic resistance, however, were clearly distinct from the sporadic ones in terms of infection status, being the only ones causing overt infections.

29 citations

References
More filters
Journal ArticleDOI
TL;DR: A web server providing a convenient way of identifying acquired antimicrobial resistance genes in completely sequenced isolates was created, and the method was evaluated on WGS chromosomes and plasmids of 30 isolates.
Abstract: Objectives Identification of antimicrobial resistance genes is important for understanding the underlying mechanisms and the epidemiology of antimicrobial resistance. As the costs of whole-genome sequencing (WGS) continue to decline, it becomes increasingly available in routine diagnostic laboratories and is anticipated to substitute traditional methods for resistance gene identification. Thus, the current challenge is to extract the relevant information from the large amount of generated data.

3,956 citations


"In Silico Detection and Typing of P..." refers methods in this paper

  • ...To extract the relevant information from the large amount of data generated, a Web-based tool, ResFinder, for the identification of acquired or intrinsically present antimicrobial resistance genes in whole-genome data was recently developed (15)....

    [...]

Journal ArticleDOI
TL;DR: NCBI’s Conserved Domain Database (CDD) is a resource for the annotation of protein sequences with the location of conserved domain footprints, and functional sites inferred from these footprints.
Abstract: NCBI's Conserved Domain Database (CDD) is a resource for the annotation of protein sequences with the location of conserved domain footprints, and functional sites inferred from these footprints. CDD includes manually curated domain models that make use of protein 3D structure to refine domain models and provide insights into sequence/structure/function relationships. Manually curated models are organized hierarchically if they describe domain families that are clearly related by common descent. As CDD also imports domain family models from a variety of external sources, it is a partially redundant collection. To simplify protein annotation, redundant models and models describing homologous families are clustered into superfamilies. By default, domain footprints are annotated with the corresponding superfamily designation, on top of which specific annotation may indicate high-confidence assignment of family membership. Pre-computed domain annotation is available for proteins in the Entrez/Protein dataset, and a novel interface, Batch CD-Search, allows the computation and download of annotation for large sets of protein queries. CDD can be accessed via http://www.ncbi.nlm.nih.gov/Structure/cdd/cdd.shtml.

2,934 citations


"In Silico Detection and Typing of P..." refers background in this paper

  • ...In particular, the replicase proteins showing the pfam02387 or pfam01051 conserved domains were assigned to the FII and FIB groups, respectively (31)....

    [...]

Journal ArticleDOI
TL;DR: Results indicated that the inc/rep PCR method demonstrates high specificity and sensitivity in detecting replicons on reference plasmids and also revealed the presence of recurrent and common plasmid in epidemiologically unrelated Salmonella isolates of different serotypes.

2,163 citations


"In Silico Detection and Typing of P..." refers methods in this paper

  • ...A collection of 24 previously characterized and fully FIG 1 Numbers of fully sequenced plasmids (y axis) classified into incompatibility groups occurring in the different bacterial species of the Enterobacteriaceae family....

    [...]

  • ...Since 2005, a PCR-based replicon typing (PBRT) scheme has been available that targets in multiplex PCRs the replicons of the major plasmid families occurring in members of the family Enterobacteriaceae (2)....

    [...]

  • ...Here, we present two free, easy-to-use Web tools, PlasmidFinder and pMLST, to analyze and classify plasmids from bacterial species of the family Enterobacteriaceae....

    [...]

  • ...Here, we describe the design of two new easy-to-use Web tools useful for the rapid identification of plasmids in Enterobacteriaceae species that are of interest for epidemiological and clinical microbiology investigations of the plasmid-associated spread of antimicrobial resistance....

    [...]

  • ...This method was initially developed to detect the replicons of plasmids belonging to the 18 major incompatibility (Inc) groups of Enterobacteriaceae species (3)....

    [...]

Journal ArticleDOI
TL;DR: The Bacterial Isolate Genome Sequence Database (BIGSDB) represents a freely available resource that will assist the broader community in the elucidation of the structure and function of bacteria by means of a population genomics approach.
Abstract: The opportunities for bacterial population genomics that are being realised by the application of parallel nucleotide sequencing require novel bioinformatics platforms These must be capable of the storage, retrieval, and analysis of linked phenotypic and genotypic information in an accessible, scalable and computationally efficient manner The Bacterial Isolate Genome Sequence Database (BIGSDB) is a scalable, open source, web-accessible database system that meets these needs, enabling phenotype and sequence data, which can range from a single sequence read to whole genome data, to be efficiently linked for a limitless number of bacterial specimens The system builds on the widely used mlstdbNet software, developed for the storage and distribution of multilocus sequence typing (MLST) data, and incorporates the capacity to define and identify any number of loci and genetic variants at those loci within the stored nucleotide sequences These loci can be further organised into 'schemes' for isolate characterisation or for evolutionary or functional analyses Isolates and loci can be indexed by multiple names and any number of alternative schemes can be accommodated, enabling cross-referencing of different studies and approaches LIMS functionality of the software enables linkage to and organisation of laboratory samples The data are easily linked to external databases and fine-grained authentication of access permits multiple users to participate in community annotation by setting up or contributing to different schemes within the database Some of the applications of BIGSDB are illustrated with the genera Neisseria and Streptococcus The BIGSDB source code and documentation are available at http://pubmlstorg/software/database/bigsdb/ Genomic data can be used to characterise bacterial isolates in many different ways but it can also be efficiently exploited for evolutionary or functional studies BIGSDB represents a freely available resource that will assist the broader community in the elucidation of the structure and function of bacteria by means of a population genomics approach

1,943 citations

Journal ArticleDOI
TL;DR: A Web-based method for MLST of 66 bacterial species based on whole-genome sequencing data that enables investigators to determine the sequence types of their isolates on the basis of WGS data.
Abstract: Accurate strain identification is essential for anyone working with bacteria. For many species, multilocus sequence typing (MLST) is considered the “gold standard” of typing, but it is traditionally performed in an expensive and time-consuming manner. As the costs of whole-genome sequencing (WGS) continue to decline, it becomes increasingly available to scientists and routine diagnostic laboratories. Currently, the cost is below that of traditional MLST. The new challenges will be how to extract the relevant information from the large amount of data so as to allow for comparison over time and between laboratories. Ideally, this information should also allow for comparison to historical data. We developed a Web-based method for MLST of 66 bacterial species based on WGS data. As input, the method uses short sequence reads from four sequencing platforms or preassembled genomes. Updates from the MLST databases are downloaded monthly, and the best-matching MLST alleles of the specified MLST scheme are found using a BLAST-based ranking method. The sequence type is then determined by the combination of alleles identified. The method was tested on preassembled genomes from 336 isolates covering 56 MLST schemes, on short sequence reads from 387 isolates covering 10 schemes, and on a small test set of short sequence reads from 29 isolates for which the sequence type had been determined by traditional methods. The method presented here enables investigators to determine the sequence types of their isolates on the basis of WGS data. This method is publicly available at www.cbs.dtu.dk/services/MLST.

1,620 citations


"In Silico Detection and Typing of P..." refers methods in this paper

  • ...If raw sequence reads are uploaded, they are first assembled (after the sequencing platform is given by the user) as described previously (16)....

    [...]