scispace - formally typeset
Search or ask a question
Journal ArticleDOI

In Silico Detection and Typing of Plasmids using PlasmidFinder and Plasmid Multilocus Sequence Typing

TL;DR: Two easy-to-use Web tools for in silico detection and characterization of whole-genome sequence (WGS) and whole-plasmid sequence data from members of the family Enterobacteriaceae are designed and developed.
Abstract: In the work presented here, we designed and developed two easy-to-use Web tools for in silico detection and characterization of whole-genome sequence (WGS) and whole-plasmid sequence data from members of the family Enterobacteriaceae. These tools will facilitate bacterial typing based on draft genomes of multidrug-resistant Enterobacteriaceae species by the rapid detection of known plasmid types. Replicon sequences from 559 fully sequenced plasmids associated with the family Enterobacteriaceae in the NCBI nucleotide database were collected to build a consensus database for integration into a Web tool called PlasmidFinder that can be used for replicon sequence analysis of raw, contig group, or completely assembled and closed plasmid sequencing data. The PlasmidFinder database currently consists of 116 replicon sequences that match with at least at 80% nucleotide identity all replicon sequences identified in the 559 fully sequenced plasmids. For plasmid multilocus sequence typing (pMLST) analysis, a database that is updated weekly was generated from www.pubmlst.org and integrated into a Web tool called pMLST. Both databases were evaluated using draft genomes from a collection of Salmonella enterica serovar Typhimurium isolates. PlasmidFinder identified a total of 103 replicons and between zero and five different plasmid replicons within each of 49 S . Typhimurium draft genomes tested. The pMLST Web tool was able to subtype genomic sequencing data of plasmids, revealing both known plasmid sequence types (STs) and new alleles and ST variants. In conclusion, testing of the two Web tools using both fully assembled plasmid sequences and WGS-generated draft genomes showed them to be able to detect a broad variety of plasmids that are often associated with antimicrobial resistance in clinically relevant bacterial pathogens.

Content maybe subject to copyright    Report

Citations
More filters
Journal ArticleDOI
TL;DR: The unexplained success of some lineages within this pool of highly resistant strains, and the discontinuity between phenotypic resistance and genotype at the macro level, indicate that intrinsic mechanisms contribute to competitive advantage and/or resistance.
Abstract: Klebsiella pneumoniae is a major threat to public health with the emergence of isolates resistant to most, if not all, useful antibiotics. We present an in-depth analysis of 178 extended-spectrum beta-lactamase (ESBL)-producing K. pneumoniae collected from patients resident in a region of Pakistan, during the period 2010-2012, when the now globally-distributed carbapenemase bla-NDM-1 was being acquired by Klebsiella. We observed two dominant lineages, but neither the overall resistance profile nor virulence-associated factors, explain their evolutionary success. Phenotypic analysis of resistance shows few differences between the acquisition of resistance genes and the phenotypic resistance profile, including beta-lactam antibiotics that were used to treat ESBL-positive strains. Resistance against these drugs could be explained by inhibitor-resistant beta-lactamase enzymes, carbapenemases or ampC type beta-lactamases, at least one of which was detected in most, but not all relevant strains analysed. Complete genomes for six selected strains are reported, these provide detailed insights into the mobile elements present in these isolates during the initial spread of NDM-1. The unexplained success of some lineages within this pool of highly resistant strains, and the discontinuity between phenotypic resistance and genotype at the macro level, indicate that intrinsic mechanisms contribute to competitive advantage and/or resistance.

45 citations

Journal ArticleDOI
01 Sep 2015-PLOS ONE
TL;DR: Virulence factors and phenotypic characteristics of MPEC that may play a role in pathogenesis of E. coli mastitis are revealed and showed that the persistent MPEC was the most versatile in terms of nutrients metabolized and acute MPEC the least.
Abstract: Escherichia coli is a major etiological agent of intra-mammary infections (IMI) in cows, leading to acute mastitis and causing great economic losses in dairy production worldwide. Particular strains cause persistent IMI, leading to recurrent mastitis. Virulence factors of mammary pathogenic E. coli (MPEC) involved pathogenesis of mastitis as well as those differentiating strains causing acute or persistent mastitis are largely unknown. This study aimed to identify virulence markers in MPEC through whole genome and phenome comparative analysis. MPEC strains causing acute (VL2874 and P4) or persistent (VL2732) mastitis were compared to an environmental strain (K71) and to the genomes of strains representing different E. coli pathotypes. Intra-mammary challenge in mice confirmed experimentally that the strains studied here have different pathogenic potential, and that the environmental strain K71 is non-pathogenic in the mammary gland. Analysis of whole genome sequences and predicted proteomes revealed high similarity among MPEC, whereas MPEC significantly differed from the non-mammary pathogenic strain K71, and from E. coli genomes from other pathotypes. Functional features identified in MPEC genomes and lacking in the non-mammary pathogenic strain were associated with synthesis of lipopolysaccharide and other membrane antigens, ferric-dicitrate iron acquisition and sugars metabolism. Features associated with cytotoxicity or intra-cellular survival were found specifically in the genomes of strains from severe and acute (VL2874) or persistent (VL2732) mastitis, respectively. MPEC genomes were relatively similar to strain K-12, which was subsequently shown here to be possibly pathogenic in the mammary gland. Phenome analysis showed that the persistent MPEC was the most versatile in terms of nutrients metabolized and acute MPEC the least. Among phenotypes unique to MPEC compared to the non-mammary pathogenic strain were uric acid and D-serine metabolism. This study reveals virulence factors and phenotypic characteristics of MPEC that may play a role in pathogenesis of E. coli mastitis.

45 citations

Journal ArticleDOI
TL;DR: A novel understanding of gene regulation and mechanisms of inulin utilization in probiotic L. plantarum generating opportunities for synbiotic product development is provided.
Abstract: The draft genome of L. plantarum isolated from Asian fermented foods, infant feces and shrimp intestines were sequenced and compared to those of well-studied strains. Among twenty-eight strains of L. plantarum, the variation of genomic features involved in the ecological adaptation were elucidated. The genome sizes ranged from approximately 3.1 to 3.5 Mb, of which about 2,932 to 3,345 of protein-coding sequences (CDS) were predicted. The food derived isolates contained higher number of carbohydrate metabolism associated genes than those from infant feces. This observation correlated to their phenotypic trait of carbohydrate metabolic profile indicating the ability to metabolize the largest range of sugars. Surprisingly, two isolates of P14 and P76 from fermented fish were able to utilize inulin. β-fructosidase, the inulin-degrading enzyme, was detected in both supernatant and cell wall extract of both strains. No activity was observed in the cytoplasmic fraction, indicating that this key enzyme was either membrane-bound or extracellularly secreted. From genomic mining analysis, a predicted inulin operon of fosRABCDX , which encoded β-fructosidase and many fructose transporting proteins were found within the genomes of strains P14 and P76. Moreover, pst1BCA genes encoding sucrose-specific IIBCA components involved in sucrose transport, was also identified. The proteomic analysis revealed the mechanism and functional characteristic of fosRABCDXE operon involved in inulin utilization of L. plantarum . The expression of both fos operon and pst genes were up-regulated at mid-log phase. FosE and the LPXTG-motif cell wall anchored β-fructosidase was induced in high abundance when inulin was present as a carbon source. Importance Inulin is a long-chain carbohydrate that may at as a prebiotic, which provide many health benefits to the host by selectively stimulating the growth and activity of beneficial bacteria in the colon. While certain lactobacilli can catabolize inulin, this has not yet been described for L. plantarum and its associated putative inulin operon has not been reported in this species. By using comparative and functional genomics we showed that two L. plantarum strains were able to utilize inulin, and identified a functional inulin operon in their genomes. The proteogenomic data revealed inulin degrading and uptaking routes widely expressed among L. plantarum strains, which related to fosRABCDXE operon and pstBCA genes. The present work provides novel understanding gene regulation and mechanisms of inulin utilization in probiotic L. plantarum generating opportunities for synbiotic product development.

45 citations

Journal ArticleDOI
TL;DR: It is found that HGT into the recording strain in human clinical fecal samples can be extensive and is driven by different plasmid types, with the IncX type being the most actively transferred.
Abstract: The flow of genetic material between bacteria is central to the adaptation and evolution of bacterial genomes. However, our knowledge about DNA transfer within complex microbiomes is lacking, with most studies of horizontal gene transfer (HGT) relying on bioinformatic analyses of genetic elements maintained on evolutionary timescales or experimental measurements of phenotypically trackable markers. Here, we utilize the CRISPR-Cas spacer acquisition process to detect DNA acquisition events from complex microbiota in real-time and at nucleotide resolution. In this system, an E. coli recording strain is exposed to a microbial sample and spacers are acquired from transferred plasmids and permanently stored in genomic CRISPR arrays. Sequencing and analysis of acquired spacers enables identification of the transferred plasmids. This approach allowed us to identify individual mobile elements without relying on phenotypic markers or post-transfer replication. We found that HGT into the recording strain in human clinical fecal samples can be extensive and is driven by different plasmid types, with the IncX type being the most actively transferred. Horizontal gene transfer (HGT) in complex bacterial communities has been mostly studied using metagenomic analyses. Here, the authors develop an E. coli CRISPR-Cas spacer acquisition platform that allows real-time recording of HGT events at nucleotide-resolution, identifying diverse DNA transfer events in human clinical fecal samples.

45 citations

Journal ArticleDOI
TL;DR: MCRPEC were highly prevalent in the aquaculture supply chain, with the isolates showing resistance to most antibiotics, and genomic analysis suggested mcr-1 could be transferred to humans via the aquatic food chain.

45 citations

References
More filters
Journal ArticleDOI
TL;DR: A web server providing a convenient way of identifying acquired antimicrobial resistance genes in completely sequenced isolates was created, and the method was evaluated on WGS chromosomes and plasmids of 30 isolates.
Abstract: Objectives Identification of antimicrobial resistance genes is important for understanding the underlying mechanisms and the epidemiology of antimicrobial resistance. As the costs of whole-genome sequencing (WGS) continue to decline, it becomes increasingly available in routine diagnostic laboratories and is anticipated to substitute traditional methods for resistance gene identification. Thus, the current challenge is to extract the relevant information from the large amount of generated data.

3,956 citations


"In Silico Detection and Typing of P..." refers methods in this paper

  • ...To extract the relevant information from the large amount of data generated, a Web-based tool, ResFinder, for the identification of acquired or intrinsically present antimicrobial resistance genes in whole-genome data was recently developed (15)....

    [...]

Journal ArticleDOI
TL;DR: NCBI’s Conserved Domain Database (CDD) is a resource for the annotation of protein sequences with the location of conserved domain footprints, and functional sites inferred from these footprints.
Abstract: NCBI's Conserved Domain Database (CDD) is a resource for the annotation of protein sequences with the location of conserved domain footprints, and functional sites inferred from these footprints. CDD includes manually curated domain models that make use of protein 3D structure to refine domain models and provide insights into sequence/structure/function relationships. Manually curated models are organized hierarchically if they describe domain families that are clearly related by common descent. As CDD also imports domain family models from a variety of external sources, it is a partially redundant collection. To simplify protein annotation, redundant models and models describing homologous families are clustered into superfamilies. By default, domain footprints are annotated with the corresponding superfamily designation, on top of which specific annotation may indicate high-confidence assignment of family membership. Pre-computed domain annotation is available for proteins in the Entrez/Protein dataset, and a novel interface, Batch CD-Search, allows the computation and download of annotation for large sets of protein queries. CDD can be accessed via http://www.ncbi.nlm.nih.gov/Structure/cdd/cdd.shtml.

2,934 citations


"In Silico Detection and Typing of P..." refers background in this paper

  • ...In particular, the replicase proteins showing the pfam02387 or pfam01051 conserved domains were assigned to the FII and FIB groups, respectively (31)....

    [...]

Journal ArticleDOI
TL;DR: Results indicated that the inc/rep PCR method demonstrates high specificity and sensitivity in detecting replicons on reference plasmids and also revealed the presence of recurrent and common plasmid in epidemiologically unrelated Salmonella isolates of different serotypes.

2,163 citations


"In Silico Detection and Typing of P..." refers methods in this paper

  • ...A collection of 24 previously characterized and fully FIG 1 Numbers of fully sequenced plasmids (y axis) classified into incompatibility groups occurring in the different bacterial species of the Enterobacteriaceae family....

    [...]

  • ...Since 2005, a PCR-based replicon typing (PBRT) scheme has been available that targets in multiplex PCRs the replicons of the major plasmid families occurring in members of the family Enterobacteriaceae (2)....

    [...]

  • ...Here, we present two free, easy-to-use Web tools, PlasmidFinder and pMLST, to analyze and classify plasmids from bacterial species of the family Enterobacteriaceae....

    [...]

  • ...Here, we describe the design of two new easy-to-use Web tools useful for the rapid identification of plasmids in Enterobacteriaceae species that are of interest for epidemiological and clinical microbiology investigations of the plasmid-associated spread of antimicrobial resistance....

    [...]

  • ...This method was initially developed to detect the replicons of plasmids belonging to the 18 major incompatibility (Inc) groups of Enterobacteriaceae species (3)....

    [...]

Journal ArticleDOI
TL;DR: The Bacterial Isolate Genome Sequence Database (BIGSDB) represents a freely available resource that will assist the broader community in the elucidation of the structure and function of bacteria by means of a population genomics approach.
Abstract: The opportunities for bacterial population genomics that are being realised by the application of parallel nucleotide sequencing require novel bioinformatics platforms These must be capable of the storage, retrieval, and analysis of linked phenotypic and genotypic information in an accessible, scalable and computationally efficient manner The Bacterial Isolate Genome Sequence Database (BIGSDB) is a scalable, open source, web-accessible database system that meets these needs, enabling phenotype and sequence data, which can range from a single sequence read to whole genome data, to be efficiently linked for a limitless number of bacterial specimens The system builds on the widely used mlstdbNet software, developed for the storage and distribution of multilocus sequence typing (MLST) data, and incorporates the capacity to define and identify any number of loci and genetic variants at those loci within the stored nucleotide sequences These loci can be further organised into 'schemes' for isolate characterisation or for evolutionary or functional analyses Isolates and loci can be indexed by multiple names and any number of alternative schemes can be accommodated, enabling cross-referencing of different studies and approaches LIMS functionality of the software enables linkage to and organisation of laboratory samples The data are easily linked to external databases and fine-grained authentication of access permits multiple users to participate in community annotation by setting up or contributing to different schemes within the database Some of the applications of BIGSDB are illustrated with the genera Neisseria and Streptococcus The BIGSDB source code and documentation are available at http://pubmlstorg/software/database/bigsdb/ Genomic data can be used to characterise bacterial isolates in many different ways but it can also be efficiently exploited for evolutionary or functional studies BIGSDB represents a freely available resource that will assist the broader community in the elucidation of the structure and function of bacteria by means of a population genomics approach

1,943 citations

Journal ArticleDOI
TL;DR: A Web-based method for MLST of 66 bacterial species based on whole-genome sequencing data that enables investigators to determine the sequence types of their isolates on the basis of WGS data.
Abstract: Accurate strain identification is essential for anyone working with bacteria. For many species, multilocus sequence typing (MLST) is considered the “gold standard” of typing, but it is traditionally performed in an expensive and time-consuming manner. As the costs of whole-genome sequencing (WGS) continue to decline, it becomes increasingly available to scientists and routine diagnostic laboratories. Currently, the cost is below that of traditional MLST. The new challenges will be how to extract the relevant information from the large amount of data so as to allow for comparison over time and between laboratories. Ideally, this information should also allow for comparison to historical data. We developed a Web-based method for MLST of 66 bacterial species based on WGS data. As input, the method uses short sequence reads from four sequencing platforms or preassembled genomes. Updates from the MLST databases are downloaded monthly, and the best-matching MLST alleles of the specified MLST scheme are found using a BLAST-based ranking method. The sequence type is then determined by the combination of alleles identified. The method was tested on preassembled genomes from 336 isolates covering 56 MLST schemes, on short sequence reads from 387 isolates covering 10 schemes, and on a small test set of short sequence reads from 29 isolates for which the sequence type had been determined by traditional methods. The method presented here enables investigators to determine the sequence types of their isolates on the basis of WGS data. This method is publicly available at www.cbs.dtu.dk/services/MLST.

1,620 citations


"In Silico Detection and Typing of P..." refers methods in this paper

  • ...If raw sequence reads are uploaded, they are first assembled (after the sequencing platform is given by the user) as described previously (16)....

    [...]