scispace - formally typeset
Search or ask a question
Journal ArticleDOI

In Silico Detection and Typing of Plasmids using PlasmidFinder and Plasmid Multilocus Sequence Typing

TL;DR: Two easy-to-use Web tools for in silico detection and characterization of whole-genome sequence (WGS) and whole-plasmid sequence data from members of the family Enterobacteriaceae are designed and developed.
Abstract: In the work presented here, we designed and developed two easy-to-use Web tools for in silico detection and characterization of whole-genome sequence (WGS) and whole-plasmid sequence data from members of the family Enterobacteriaceae. These tools will facilitate bacterial typing based on draft genomes of multidrug-resistant Enterobacteriaceae species by the rapid detection of known plasmid types. Replicon sequences from 559 fully sequenced plasmids associated with the family Enterobacteriaceae in the NCBI nucleotide database were collected to build a consensus database for integration into a Web tool called PlasmidFinder that can be used for replicon sequence analysis of raw, contig group, or completely assembled and closed plasmid sequencing data. The PlasmidFinder database currently consists of 116 replicon sequences that match with at least at 80% nucleotide identity all replicon sequences identified in the 559 fully sequenced plasmids. For plasmid multilocus sequence typing (pMLST) analysis, a database that is updated weekly was generated from www.pubmlst.org and integrated into a Web tool called pMLST. Both databases were evaluated using draft genomes from a collection of Salmonella enterica serovar Typhimurium isolates. PlasmidFinder identified a total of 103 replicons and between zero and five different plasmid replicons within each of 49 S . Typhimurium draft genomes tested. The pMLST Web tool was able to subtype genomic sequencing data of plasmids, revealing both known plasmid sequence types (STs) and new alleles and ST variants. In conclusion, testing of the two Web tools using both fully assembled plasmid sequences and WGS-generated draft genomes showed them to be able to detect a broad variety of plasmids that are often associated with antimicrobial resistance in clinically relevant bacterial pathogens.

Content maybe subject to copyright    Report

Citations
More filters
Journal ArticleDOI
26 Feb 2019-Mbio
TL;DR: Results from this study confirmed that traits such as metal, antibiotic, and phage resistance along with toxin-antitoxin systems are encoded on abundant circular plasmids, all of which could confer novel and advantageous traits to their hosts.
Abstract: Naturally occurring plasmids constitute a major category of mobile genetic elements responsible for harboring and transferring genes important in survival and fitness. A targeted evaluation of plasmidomes can reveal unique adaptations required by microbial communities. We developed a model system to optimize plasmid DNA isolation procedures targeted to groundwater samples which are typically characterized by low cell density (and likely variations in the plasmid size and copy numbers). The optimized method resulted in successful identification of several hundred circular plasmids, including some large plasmids (11 plasmids more than 50 kb in size, with the largest being 1.7 Mb in size). Several interesting observations were made from the analysis of plasmid DNA isolated in this study. The plasmid pool (plasmidome) was more conserved than the corresponding microbiome distribution (16S rRNA based). The circular plasmids were diverse as represented by the presence of seven plasmid incompatibility groups. The genes carried on these groundwater plasmids were highly enriched in metal resistance. Results from this study confirmed that traits such as metal, antibiotic, and phage resistance along with toxin-antitoxin systems are encoded on abundant circular plasmids, all of which could confer novel and advantageous traits to their hosts. This study confirms the ecological role of the plasmidome in maintaining the latent capacity of a microbiome, enabling rapid adaptation to environmental stresses.IMPORTANCE Plasmidomes have been typically studied in environments abundant in bacteria, and this is the first study to explore plasmids from an environment characterized by low cell density. We specifically target groundwater, a significant source of water for human/agriculture use. We used samples from a well-studied site and identified hundreds of circular plasmids, including one of the largest sizes reported in plasmidome studies. The striking similarity of the plasmid-borne ORFs in terms of taxonomical and functional classifications across several samples suggests a conserved plasmid pool, in contrast to the observed variability in the 16S rRNA-based microbiome distribution. Additionally, the stress response to environmental factors has stronger conservation via plasmid-borne genes as marked by abundance of metal resistance genes. Last, identification of novel and diverse plasmids enriches the existing plasmid database(s) and serves as a paradigm to increase the repertoire of biological parts that are available for modifying novel environmental strains.

33 citations


Cites background from "In Silico Detection and Typing of P..."

  • ...The “circular_scaffolds” were also categorized into incompatibility groups (58) and the relaxase/MOB types (37)....

    [...]

Journal ArticleDOI
TL;DR: Single molecule real-time sequencing allowed monitoring of the genetic and epigenetic microevolution of MDR OXA-48-producing K. pneumoniae and revealed in addition to SNPs, complex rearrangements of genetic elements.
Abstract: Objectives Carbapenemase-producing Klebsiella pneumoniae pose an increasing risk for healthcare facilities worldwide. A continuous monitoring of ST distribution and its association with resistance and virulence genes is required for early detection of successful K. pneumoniae lineages. In this study, we used WGS to characterize MDR blaOXA-48-positive K. pneumoniae isolated from inpatients at the University Medical Center Gottingen, Germany, between March 2013 and August 2014. Methods Closed genomes for 16 isolates of carbapenemase-producing K. pneumoniae were generated by single molecule real-time technology using the PacBio RSII platform. Results Eight of the 16 isolates showed identical XbaI macrorestriction patterns and shared the same MLST, ST147. The eight ST147 isolates differed by only 1-25 SNPs of their core genome, indicating a clonal origin. Most of the eight ST147 isolates carried four plasmids with sizes of 246.8, 96.1, 63.6 and 61.0 kb and a novel linear plasmid prophage, named pKO2, of 54.6 kb. The blaOXA-48 gene was located on a 63.6 kb IncL plasmid and is part of composite transposon Tn1999.2. The ST147 isolates expressed the yersinabactin system as a major virulence factor. The comparative whole-genome analysis revealed several rearrangements of mobile genetic elements and losses of chromosomal and plasmidic regions in the ST147 isolates. Conclusions Single molecule real-time sequencing allowed monitoring of the genetic and epigenetic microevolution of MDR OXA-48-producing K. pneumoniae and revealed in addition to SNPs, complex rearrangements of genetic elements.

33 citations

Journal ArticleDOI
15 Aug 2018-PLOS ONE
TL;DR: The presence of carbapenem-resistant uropathogenic E. coli clones in community-acquired UTIs in Riyadh, Kingdom of Saudi Arabia is investigated to identify the virulence and resistance structures of the resistant clones and relate the isolates to those circulating globally.
Abstract: Urinary tract infections (UTIs) associated with Escherichia coli are a growing threat with an increase in the prevalence of multidrug resistant (MDR) strains, particularly s-lactamase producers, occurring globally. We investigated the presence of carbapenem-resistant uropathogenic E. coli clones in community-acquired UTIs in Riyadh, Kingdom of Saudi Arabia (KSA) to identify the virulence and resistance structures of the resistant clones and relate the isolates to those circulating globally. A combination of comparative genomics and phenotypic approaches were used to characterize ten MDR-uropathogenic Escherichia coli isolates recovered from UTI patients in Riyadh between November 2014 and January 2015. We report the presence of NDM-1 and 5, and OXA-181 in carbapenem-resistant UPEC strains from Riyadh, KSA. Single nucleotide polymorphism analyses demonstrated that these ten isolates fell into four phylogenetically distinct clades within the UPEC phylogeny. Comparative genomic analyses indicate that these diverse clones could be distinguished according to their multilocus sequencing type (MLST), serology, and virulence and antimicrobial gene architectures. These clones include the blaNDM-1 carrying isolates of the globally predominant MDR ST131 and ST69 types, previously identified as one of the most common UPEC strains in KSA. This is in addition to clones of ST23Cplx (ST410) and ST448Cplx (ST448) that have likely evolved from common intestinal strains, carrying copies of s-lactamase genes including blaNDM-5, blaCTX-M-15, blaTEM-1, blaCMY-42, blaOXA-1 and blaOXA-181. These data have identified an emerging public health concern and highlight the need to use comprehensive approaches to detect the structure of MDR E. coli populations associated with community-acquired UTIs in KSA.

32 citations

Journal ArticleDOI
01 Jun 2018
TL;DR: Detailed genetic data from the whole-genome sequencing of 162 bacteraemic isolates collected in Scotland, UK, in 2013–2015 is linked with clinical data to delineate bacterial and host factors that influence the acquisition in hospital or the community, outcome and antibiotic resistance.
Abstract: Bacteraemia caused by Escherichia coli is a growing problem with a significant mortality. The factors that influence the acquisition and outcome of these infections are not clear. Here, we have linked detailed genetic data from the whole-genome sequencing of 162 bacteraemic isolates collected in Scotland, UK, in 2013–2015, with clinical data in order to delineate bacterial and host factors that influence the acquisition in hospital or the community, outcome and antibiotic resistance. We identified four major sequence types (STs) in these isolates: ST131, ST69, ST73 and ST95. Nearly 50 % of the bacteraemic isolates had a urinary origin. ST69 was genetically distinct from the other STs, with significantly less sharing of accessory genes and with a distinct plasmid population. Virulence genes were widespread and diversely distributed between the dominant STs. ST131 was significantly associated with hospital-associated infections (HAIs), and ST69 with those from the community. However, there was no association of ST with outcome, although patients with HAI had a higher immediate mortality compared to those with community-associated infections (CAIs). Genome-wide association studies revealed genes involved in antibiotic persistence as significantly associated with HAIs and those encoding elements of a type VI secretion system with CAIs. Antibiotic resistance was common, and there were networks of correlated resistance genes and phenotypic antibiotic resistance. This study has revealed the complex interactions between the genotype of E. coli and its ability to cause bacteraemia, and some of the determinants influencing hospital or community acquisition. In part, these are shaped by antibiotic usage, but strain-specific factors are also important.

32 citations

Journal ArticleDOI
TL;DR: A large-scale phylogenomic analysis of a spatiotemporally and clinically diverse set of 907 E. coli isolates elucidate the molecular determinants of severe UTI and have implications for the early detection of this pathogen.
Abstract: Escherichia coli is the leading cause of urinary tract infection, one of the most common bacterial infections in humans. Despite this, a genomic perspective is lacking regarding the phylogenetic distribution of isolates associated with different clinical syndromes. Here, we present a large-scale phylogenomic analysis of a spatiotemporally and clinically diverse set of 907 E. coli isolates, including 722 uropathogenic E. coli (UPEC) isolates. A genome-wide association approach identifies the (P-fimbriae-encoding) papGII locus as the key feature distinguishing invasive UPEC, defined as isolates associated with severe UTI, i.e., kidney infection (pyelonephritis) or urinary-source bacteremia, from non-invasive UPEC, defined as isolates associated with asymptomatic bacteriuria or bladder infection (cystitis). Within the E. coli population, distinct invasive UPEC lineages emerged through repeated horizontal acquisition of diverse papGII-containing pathogenicity islands. Our findings elucidate the molecular determinants of severe UTI and have implications for the early detection of this pathogen.

32 citations

References
More filters
Journal ArticleDOI
TL;DR: A web server providing a convenient way of identifying acquired antimicrobial resistance genes in completely sequenced isolates was created, and the method was evaluated on WGS chromosomes and plasmids of 30 isolates.
Abstract: Objectives Identification of antimicrobial resistance genes is important for understanding the underlying mechanisms and the epidemiology of antimicrobial resistance. As the costs of whole-genome sequencing (WGS) continue to decline, it becomes increasingly available in routine diagnostic laboratories and is anticipated to substitute traditional methods for resistance gene identification. Thus, the current challenge is to extract the relevant information from the large amount of generated data.

3,956 citations


"In Silico Detection and Typing of P..." refers methods in this paper

  • ...To extract the relevant information from the large amount of data generated, a Web-based tool, ResFinder, for the identification of acquired or intrinsically present antimicrobial resistance genes in whole-genome data was recently developed (15)....

    [...]

Journal ArticleDOI
TL;DR: NCBI’s Conserved Domain Database (CDD) is a resource for the annotation of protein sequences with the location of conserved domain footprints, and functional sites inferred from these footprints.
Abstract: NCBI's Conserved Domain Database (CDD) is a resource for the annotation of protein sequences with the location of conserved domain footprints, and functional sites inferred from these footprints. CDD includes manually curated domain models that make use of protein 3D structure to refine domain models and provide insights into sequence/structure/function relationships. Manually curated models are organized hierarchically if they describe domain families that are clearly related by common descent. As CDD also imports domain family models from a variety of external sources, it is a partially redundant collection. To simplify protein annotation, redundant models and models describing homologous families are clustered into superfamilies. By default, domain footprints are annotated with the corresponding superfamily designation, on top of which specific annotation may indicate high-confidence assignment of family membership. Pre-computed domain annotation is available for proteins in the Entrez/Protein dataset, and a novel interface, Batch CD-Search, allows the computation and download of annotation for large sets of protein queries. CDD can be accessed via http://www.ncbi.nlm.nih.gov/Structure/cdd/cdd.shtml.

2,934 citations


"In Silico Detection and Typing of P..." refers background in this paper

  • ...In particular, the replicase proteins showing the pfam02387 or pfam01051 conserved domains were assigned to the FII and FIB groups, respectively (31)....

    [...]

Journal ArticleDOI
TL;DR: Results indicated that the inc/rep PCR method demonstrates high specificity and sensitivity in detecting replicons on reference plasmids and also revealed the presence of recurrent and common plasmid in epidemiologically unrelated Salmonella isolates of different serotypes.

2,163 citations


"In Silico Detection and Typing of P..." refers methods in this paper

  • ...A collection of 24 previously characterized and fully FIG 1 Numbers of fully sequenced plasmids (y axis) classified into incompatibility groups occurring in the different bacterial species of the Enterobacteriaceae family....

    [...]

  • ...Since 2005, a PCR-based replicon typing (PBRT) scheme has been available that targets in multiplex PCRs the replicons of the major plasmid families occurring in members of the family Enterobacteriaceae (2)....

    [...]

  • ...Here, we present two free, easy-to-use Web tools, PlasmidFinder and pMLST, to analyze and classify plasmids from bacterial species of the family Enterobacteriaceae....

    [...]

  • ...Here, we describe the design of two new easy-to-use Web tools useful for the rapid identification of plasmids in Enterobacteriaceae species that are of interest for epidemiological and clinical microbiology investigations of the plasmid-associated spread of antimicrobial resistance....

    [...]

  • ...This method was initially developed to detect the replicons of plasmids belonging to the 18 major incompatibility (Inc) groups of Enterobacteriaceae species (3)....

    [...]

Journal ArticleDOI
TL;DR: The Bacterial Isolate Genome Sequence Database (BIGSDB) represents a freely available resource that will assist the broader community in the elucidation of the structure and function of bacteria by means of a population genomics approach.
Abstract: The opportunities for bacterial population genomics that are being realised by the application of parallel nucleotide sequencing require novel bioinformatics platforms These must be capable of the storage, retrieval, and analysis of linked phenotypic and genotypic information in an accessible, scalable and computationally efficient manner The Bacterial Isolate Genome Sequence Database (BIGSDB) is a scalable, open source, web-accessible database system that meets these needs, enabling phenotype and sequence data, which can range from a single sequence read to whole genome data, to be efficiently linked for a limitless number of bacterial specimens The system builds on the widely used mlstdbNet software, developed for the storage and distribution of multilocus sequence typing (MLST) data, and incorporates the capacity to define and identify any number of loci and genetic variants at those loci within the stored nucleotide sequences These loci can be further organised into 'schemes' for isolate characterisation or for evolutionary or functional analyses Isolates and loci can be indexed by multiple names and any number of alternative schemes can be accommodated, enabling cross-referencing of different studies and approaches LIMS functionality of the software enables linkage to and organisation of laboratory samples The data are easily linked to external databases and fine-grained authentication of access permits multiple users to participate in community annotation by setting up or contributing to different schemes within the database Some of the applications of BIGSDB are illustrated with the genera Neisseria and Streptococcus The BIGSDB source code and documentation are available at http://pubmlstorg/software/database/bigsdb/ Genomic data can be used to characterise bacterial isolates in many different ways but it can also be efficiently exploited for evolutionary or functional studies BIGSDB represents a freely available resource that will assist the broader community in the elucidation of the structure and function of bacteria by means of a population genomics approach

1,943 citations

Journal ArticleDOI
TL;DR: A Web-based method for MLST of 66 bacterial species based on whole-genome sequencing data that enables investigators to determine the sequence types of their isolates on the basis of WGS data.
Abstract: Accurate strain identification is essential for anyone working with bacteria. For many species, multilocus sequence typing (MLST) is considered the “gold standard” of typing, but it is traditionally performed in an expensive and time-consuming manner. As the costs of whole-genome sequencing (WGS) continue to decline, it becomes increasingly available to scientists and routine diagnostic laboratories. Currently, the cost is below that of traditional MLST. The new challenges will be how to extract the relevant information from the large amount of data so as to allow for comparison over time and between laboratories. Ideally, this information should also allow for comparison to historical data. We developed a Web-based method for MLST of 66 bacterial species based on WGS data. As input, the method uses short sequence reads from four sequencing platforms or preassembled genomes. Updates from the MLST databases are downloaded monthly, and the best-matching MLST alleles of the specified MLST scheme are found using a BLAST-based ranking method. The sequence type is then determined by the combination of alleles identified. The method was tested on preassembled genomes from 336 isolates covering 56 MLST schemes, on short sequence reads from 387 isolates covering 10 schemes, and on a small test set of short sequence reads from 29 isolates for which the sequence type had been determined by traditional methods. The method presented here enables investigators to determine the sequence types of their isolates on the basis of WGS data. This method is publicly available at www.cbs.dtu.dk/services/MLST.

1,620 citations


"In Silico Detection and Typing of P..." refers methods in this paper

  • ...If raw sequence reads are uploaded, they are first assembled (after the sequencing platform is given by the user) as described previously (16)....

    [...]