scispace - formally typeset
Search or ask a question
Journal ArticleDOI

In Silico Detection and Typing of Plasmids using PlasmidFinder and Plasmid Multilocus Sequence Typing

TL;DR: Two easy-to-use Web tools for in silico detection and characterization of whole-genome sequence (WGS) and whole-plasmid sequence data from members of the family Enterobacteriaceae are designed and developed.
Abstract: In the work presented here, we designed and developed two easy-to-use Web tools for in silico detection and characterization of whole-genome sequence (WGS) and whole-plasmid sequence data from members of the family Enterobacteriaceae. These tools will facilitate bacterial typing based on draft genomes of multidrug-resistant Enterobacteriaceae species by the rapid detection of known plasmid types. Replicon sequences from 559 fully sequenced plasmids associated with the family Enterobacteriaceae in the NCBI nucleotide database were collected to build a consensus database for integration into a Web tool called PlasmidFinder that can be used for replicon sequence analysis of raw, contig group, or completely assembled and closed plasmid sequencing data. The PlasmidFinder database currently consists of 116 replicon sequences that match with at least at 80% nucleotide identity all replicon sequences identified in the 559 fully sequenced plasmids. For plasmid multilocus sequence typing (pMLST) analysis, a database that is updated weekly was generated from www.pubmlst.org and integrated into a Web tool called pMLST. Both databases were evaluated using draft genomes from a collection of Salmonella enterica serovar Typhimurium isolates. PlasmidFinder identified a total of 103 replicons and between zero and five different plasmid replicons within each of 49 S . Typhimurium draft genomes tested. The pMLST Web tool was able to subtype genomic sequencing data of plasmids, revealing both known plasmid sequence types (STs) and new alleles and ST variants. In conclusion, testing of the two Web tools using both fully assembled plasmid sequences and WGS-generated draft genomes showed them to be able to detect a broad variety of plasmids that are often associated with antimicrobial resistance in clinically relevant bacterial pathogens.

Content maybe subject to copyright    Report

Citations
More filters
Journal ArticleDOI
TL;DR: The data show that a rise in incidence of MDR E. coli ST73 clinical isolates is not due to a circulating outbreak strain, rather the ST73 circulating strains are distantly related and carry a diverse set of resistance plasmids, which suggests that the evolutionary events behind emergence of drug-resistant E coli differ between lineages.
Abstract: Objectives: To determine the population structure of E. coli ST73 isolated from human bacteraemia and urinary tract infections. Methods: The genomes of 22 E.coli ST73 isolates were sequenced using the Illumina HiSeq platform. High resolution SNP typing was used to create a phylogenetic tree. Comparative genomics were also performed using a pangenome approach. In silico and S1-PFGE plasmid profiling was conducted, and isolates were checked for their ability to survive exposure to human serum. Results: E.coli ST73 isolates circulating in clinically unrelated episodes show a high degree of diversity at a whole genome level, though exhibit conservation in gene content, particularly in virulence associated gene carriage. The isolates also contain a highly diverse plasmid pool that confers multi-drug resistance via carriage of CTX-M genes. All strains are highly serum resistant and uniformly carry genes shown to be essential for serum resistance. Conclusions: Our data shows that a rise in incidence of multi-drug resistant E.coli ST73 clinical isolates is not due to a circulating outbreak strain as in E.coli ST131. Rather the ST73 circulating strains are distantly related and carry a diverse set of resistance plasmids. This suggests that the evolutionary events behind emergence of drug resistant E.coli differ between lineages.

25 citations

Journal ArticleDOI
TL;DR: The prevalence of ESBL/pAmpC-Enterobacteriaceae was similar in hospitalized patients across the Dutch–German border region, whereas VRE prevalence was slightly higher on the German side and the overall prevalence of the studied pathogens was lower in the community than in hospitals in the Northern Netherlands.
Abstract: Objectives: To reveal the prevalence and epidemiology of extended-spectrum β-lactamase (ESBL)- and/or plasmid AmpC (pAmpC)- and carbapenemase (CP) producing Enterobacteriaceae and vancomycin-resistant enterococci (VRE) across the Northern Dutch-German border region. Methods: A point-prevalence study on ESBL/pAmpC/CP producing Enterobacteriaceae and VRE was carried out in hospitalized patients in the Northern Netherlands (n = 445, 2012-2013) and Germany (n = 242, 2012). Healthy individuals from the Dutch community (n = 400, 2010-2012) were also screened. In addition, a genome-wide gene-by-gene approach was applied to study the epidemiology of ESBL-Escherichia coli and VRE. Results: A total of 34 isolates from 27 patients (6.1%) admitted to Dutch hospitals were ESBL/pAmpC positive and 29 ESBL-E. coli, three pAmpC-E. coli, one ESBL-Enterobacter cloacae, and one pAmpC-Proteus mirabilis were found. In the German hospital, 18 isolates (16 E. coli and 2 Klebsiella pneumoniae) from 17 patients (7.7%) were ESBL positive. In isolates from the hospitalized patients CTX-M-15 was the most frequently detected ESBL-gene. In the Dutch community, 11 individuals (2.75%) were ESBL/pAmpC positive: 10 ESBL-E. coli (CTX-M-1 being the most prevalent gene) and one pAmpC E. coli. Six Dutch (1.3%) and four German (3.9%) hospitalized patients were colonized with VRE. Genetic relatedness by core genome multi-locus sequence typing (cgMLST) was found between two ESBL-E. coli isolates from Dutch and German cross-border hospitals and between VRE isolates from different hospitals within the same region. Conclusion: The prevalence of ESBL/pAmpC-Enterobacteriaceae was similar in hospitalized patients across the Dutch-German border region, whereas VRE prevalence was slightly higher on the German side. The overall prevalence of the studied pathogens was lower in the community than in hospitals in the Northern Netherlands. Cross-border transmission of ESBL-E. coli and VRE seems unlikely based on cgMLST analysis, however continuous monitoring is necessary to control their spread and stay informed about their epidemiology.

25 citations


Cites background from "In Silico Detection and Typing of P..."

  • ...…factors (VirulenceFinder), and species confirmation for VRE and ESBL-E. coli (KmerFinder), and serotype (SerotypeFinder) and plasmid replicons (PlasmidFinder) for ESBL-E. coli (Zankari et al., 2012; Carattoli et al., 2014; Hasman et al., 2014; Joensen et al., 2014, 2015; Larsen et al., 2014)....

    [...]

Posted ContentDOI
13 Nov 2017-bioRxiv
TL;DR: The data suggest the major source of K. pneumoniae was the patient’s own gut microbiome, but extended-spectrum beta-lactamase strains were acquired in the referring hospital, highlighting the need for rectal screening for ESBL organisms upon admission to geriatric wards.
Abstract: Background. Klebsiella pneumoniae is a leading cause of extended-spectrum beta-lactamase (ESBL) producing hospital-associated infections, for which elderly patients are at increased risk. Methods. We conducted a 1-year prospective cohort study, in which a third of patients admitted to two geriatric wards in a specialized hospital were recruited and screened for carriage of K. pneumoniae by microbiological culture. Clinical isolates were monitored via the hospital laboratory. Colonizing and clinical isolates were subjected to whole genome sequencing and antimicrobial susceptibility testing. Results. K. pneumoniae throat carriage prevalence was 4.1%, rectal carriage 10.8% and ESBL carriage 1.7%. K. pneumoniae infection incidence was 1.2%. The isolates were diverse, and most patients were colonized or infected with a unique phylogenetic lineage, with no evidence of transmission in the wards. ESBL strains carried blaCTX-M-15 and belonged to clones associated with hospital-acquired ESBL infections in other countries (ST29, ST323, ST340). One also carried the carbapenemase blaIMP-26. Genomic and epidemiological data provided evidence that ESBL strains were acquired in the referring hospital. Nanopore sequencing also identified strain-to-strain transmission of a blaCTX-M-15 FIBK/FIIK plasmid in the referring hospital. Conclusions. The data suggest the major source of K. pneumoniae was the patient9s own gut microbiome, but ESBL strains were acquired in the referring hospital. This highlights the importance of the wider hospital network to understanding K. pneumoniae risk and infection control. Rectal screening for ESBL organisms upon admission to geriatric wards could help inform patient management and infection control in such facilities.

25 citations


Cites methods from "In Silico Detection and Typing of P..."

  • ...Plasmid incompatibility types and subtypes were identified using PlasmidFinder[15] and established methods for IncC subtyping[16,17]....

    [...]

Journal ArticleDOI
TL;DR: Shigellosis trends in these communities provide insight into global movement of drug-resistant Enterobacteriaceae.
Abstract: Shigellae are sensitive indicator species for studying trends in the international transmission of antimicrobial-resistant Enterobacteriaceae. Orthodox Jewish communities (OJCs) are a known risk group for shigellosis; Shigella sonnei is cyclically epidemic in OJCs in Israel, and sporadic outbreaks occur in OJCs elsewhere. We generated whole-genome sequences for 437 isolates of S. sonnei from OJCs and non-OJCs collected over 22 years in Europe (the United Kingdom, France, and Belgium), the United States, Canada, and Israel and analyzed these within a known global genomic context. Through phylogenetic and genomic analysis, we showed that strains from outbreaks in OJCs outside of Israel are distinct from strains in the general population and relate to a single multidrug-resistant sublineage of S. sonnei that prevails in Israel. Further Bayesian phylogenetic analysis showed that this strain emerged approximately 30 years ago, demonstrating the speed at which antimicrobial drug-resistant pathogens can spread widely through geographically dispersed, but internationally connected, communities.

25 citations


Cites methods from "In Silico Detection and Typing of P..."

  • ...Contiguous sequences containing antimicrobial resistance genes were extracted from assemblies and the presence of plasmid incompatibility groups on these contiguous sequences was determined by using PlasmidFinder (33)....

    [...]

Journal ArticleDOI
TL;DR: Long-read assemblers, namely Canu, Flye, Miniasm/Racon, Raven, Redbean, and Shasta, were benchmarked using Oxford Nanopore long reads of bacterial pathogens and Raven was the most robust and accurate assembler.
Abstract: Oxford Nanopore sequencing can be used to achieve complete bacterial genomes. However, the error rates of Oxford Nanopore long reads are greater compared to Illumina short reads. Long-read assemblers using a variety of assembly algorithms have been developed to overcome this deficiency, which have not been benchmarked for genomic analyses of bacterial pathogens using Oxford Nanopore long reads. In this study, long-read assemblers, namely Canu, Flye, Miniasm/Racon, Raven, Redbean, and Shasta, were thus benchmarked using Oxford Nanopore long reads of bacterial pathogens. Ten species were tested for mediocre- and low-quality simulated reads, and 10 species were tested for real reads. Raven was the most robust assembler, obtaining complete and accurate genomes. All Miniasm/Racon and Raven assemblies of mediocre-quality reads provided accurate antimicrobial resistance (AMR) profiles, while the Raven assembly of Klebsiella variicola with low-quality reads was the only assembly with an accurate AMR profile among all assemblers and species. All assemblers functioned well for predicting virulence genes using mediocre-quality and real reads, whereas only the Raven assemblies of low-quality reads had accurate numbers of virulence genes. Regarding multilocus sequence typing (MLST), Miniasm/Racon was the most effective assembler for mediocre-quality reads, while only the Raven assemblies of Escherichia coli O157:H7 and K. variicola with low-quality reads showed positive MLST results. Miniasm/Racon and Raven were the best performers for MLST using real reads. The Miniasm/Racon and Raven assemblies showed accurate phylogenetic inference. For the pan-genome analyses, Raven was the strongest assembler for simulated reads, whereas Miniasm/Racon and Raven performed the best for real reads. Overall, the most robust and accurate assembler was Raven, closely followed by Miniasm/Racon.

25 citations


Cites methods from "In Silico Detection and Typing of P..."

  • ...com/phac-nml/staramr) against known plasmid sequences in the PlasmidFinder database [36] with 98% minimum identity and 60% minimum coverage....

    [...]

References
More filters
Journal ArticleDOI
TL;DR: A web server providing a convenient way of identifying acquired antimicrobial resistance genes in completely sequenced isolates was created, and the method was evaluated on WGS chromosomes and plasmids of 30 isolates.
Abstract: Objectives Identification of antimicrobial resistance genes is important for understanding the underlying mechanisms and the epidemiology of antimicrobial resistance. As the costs of whole-genome sequencing (WGS) continue to decline, it becomes increasingly available in routine diagnostic laboratories and is anticipated to substitute traditional methods for resistance gene identification. Thus, the current challenge is to extract the relevant information from the large amount of generated data.

3,956 citations


"In Silico Detection and Typing of P..." refers methods in this paper

  • ...To extract the relevant information from the large amount of data generated, a Web-based tool, ResFinder, for the identification of acquired or intrinsically present antimicrobial resistance genes in whole-genome data was recently developed (15)....

    [...]

Journal ArticleDOI
TL;DR: NCBI’s Conserved Domain Database (CDD) is a resource for the annotation of protein sequences with the location of conserved domain footprints, and functional sites inferred from these footprints.
Abstract: NCBI's Conserved Domain Database (CDD) is a resource for the annotation of protein sequences with the location of conserved domain footprints, and functional sites inferred from these footprints. CDD includes manually curated domain models that make use of protein 3D structure to refine domain models and provide insights into sequence/structure/function relationships. Manually curated models are organized hierarchically if they describe domain families that are clearly related by common descent. As CDD also imports domain family models from a variety of external sources, it is a partially redundant collection. To simplify protein annotation, redundant models and models describing homologous families are clustered into superfamilies. By default, domain footprints are annotated with the corresponding superfamily designation, on top of which specific annotation may indicate high-confidence assignment of family membership. Pre-computed domain annotation is available for proteins in the Entrez/Protein dataset, and a novel interface, Batch CD-Search, allows the computation and download of annotation for large sets of protein queries. CDD can be accessed via http://www.ncbi.nlm.nih.gov/Structure/cdd/cdd.shtml.

2,934 citations


"In Silico Detection and Typing of P..." refers background in this paper

  • ...In particular, the replicase proteins showing the pfam02387 or pfam01051 conserved domains were assigned to the FII and FIB groups, respectively (31)....

    [...]

Journal ArticleDOI
TL;DR: Results indicated that the inc/rep PCR method demonstrates high specificity and sensitivity in detecting replicons on reference plasmids and also revealed the presence of recurrent and common plasmid in epidemiologically unrelated Salmonella isolates of different serotypes.

2,163 citations


"In Silico Detection and Typing of P..." refers methods in this paper

  • ...A collection of 24 previously characterized and fully FIG 1 Numbers of fully sequenced plasmids (y axis) classified into incompatibility groups occurring in the different bacterial species of the Enterobacteriaceae family....

    [...]

  • ...Since 2005, a PCR-based replicon typing (PBRT) scheme has been available that targets in multiplex PCRs the replicons of the major plasmid families occurring in members of the family Enterobacteriaceae (2)....

    [...]

  • ...Here, we present two free, easy-to-use Web tools, PlasmidFinder and pMLST, to analyze and classify plasmids from bacterial species of the family Enterobacteriaceae....

    [...]

  • ...Here, we describe the design of two new easy-to-use Web tools useful for the rapid identification of plasmids in Enterobacteriaceae species that are of interest for epidemiological and clinical microbiology investigations of the plasmid-associated spread of antimicrobial resistance....

    [...]

  • ...This method was initially developed to detect the replicons of plasmids belonging to the 18 major incompatibility (Inc) groups of Enterobacteriaceae species (3)....

    [...]

Journal ArticleDOI
TL;DR: The Bacterial Isolate Genome Sequence Database (BIGSDB) represents a freely available resource that will assist the broader community in the elucidation of the structure and function of bacteria by means of a population genomics approach.
Abstract: The opportunities for bacterial population genomics that are being realised by the application of parallel nucleotide sequencing require novel bioinformatics platforms These must be capable of the storage, retrieval, and analysis of linked phenotypic and genotypic information in an accessible, scalable and computationally efficient manner The Bacterial Isolate Genome Sequence Database (BIGSDB) is a scalable, open source, web-accessible database system that meets these needs, enabling phenotype and sequence data, which can range from a single sequence read to whole genome data, to be efficiently linked for a limitless number of bacterial specimens The system builds on the widely used mlstdbNet software, developed for the storage and distribution of multilocus sequence typing (MLST) data, and incorporates the capacity to define and identify any number of loci and genetic variants at those loci within the stored nucleotide sequences These loci can be further organised into 'schemes' for isolate characterisation or for evolutionary or functional analyses Isolates and loci can be indexed by multiple names and any number of alternative schemes can be accommodated, enabling cross-referencing of different studies and approaches LIMS functionality of the software enables linkage to and organisation of laboratory samples The data are easily linked to external databases and fine-grained authentication of access permits multiple users to participate in community annotation by setting up or contributing to different schemes within the database Some of the applications of BIGSDB are illustrated with the genera Neisseria and Streptococcus The BIGSDB source code and documentation are available at http://pubmlstorg/software/database/bigsdb/ Genomic data can be used to characterise bacterial isolates in many different ways but it can also be efficiently exploited for evolutionary or functional studies BIGSDB represents a freely available resource that will assist the broader community in the elucidation of the structure and function of bacteria by means of a population genomics approach

1,943 citations

Journal ArticleDOI
TL;DR: A Web-based method for MLST of 66 bacterial species based on whole-genome sequencing data that enables investigators to determine the sequence types of their isolates on the basis of WGS data.
Abstract: Accurate strain identification is essential for anyone working with bacteria. For many species, multilocus sequence typing (MLST) is considered the “gold standard” of typing, but it is traditionally performed in an expensive and time-consuming manner. As the costs of whole-genome sequencing (WGS) continue to decline, it becomes increasingly available to scientists and routine diagnostic laboratories. Currently, the cost is below that of traditional MLST. The new challenges will be how to extract the relevant information from the large amount of data so as to allow for comparison over time and between laboratories. Ideally, this information should also allow for comparison to historical data. We developed a Web-based method for MLST of 66 bacterial species based on WGS data. As input, the method uses short sequence reads from four sequencing platforms or preassembled genomes. Updates from the MLST databases are downloaded monthly, and the best-matching MLST alleles of the specified MLST scheme are found using a BLAST-based ranking method. The sequence type is then determined by the combination of alleles identified. The method was tested on preassembled genomes from 336 isolates covering 56 MLST schemes, on short sequence reads from 387 isolates covering 10 schemes, and on a small test set of short sequence reads from 29 isolates for which the sequence type had been determined by traditional methods. The method presented here enables investigators to determine the sequence types of their isolates on the basis of WGS data. This method is publicly available at www.cbs.dtu.dk/services/MLST.

1,620 citations


"In Silico Detection and Typing of P..." refers methods in this paper

  • ...If raw sequence reads are uploaded, they are first assembled (after the sequencing platform is given by the user) as described previously (16)....

    [...]