scispace - formally typeset
Search or ask a question
Journal ArticleDOI

In Silico Detection and Typing of Plasmids using PlasmidFinder and Plasmid Multilocus Sequence Typing

TL;DR: Two easy-to-use Web tools for in silico detection and characterization of whole-genome sequence (WGS) and whole-plasmid sequence data from members of the family Enterobacteriaceae are designed and developed.
Abstract: In the work presented here, we designed and developed two easy-to-use Web tools for in silico detection and characterization of whole-genome sequence (WGS) and whole-plasmid sequence data from members of the family Enterobacteriaceae. These tools will facilitate bacterial typing based on draft genomes of multidrug-resistant Enterobacteriaceae species by the rapid detection of known plasmid types. Replicon sequences from 559 fully sequenced plasmids associated with the family Enterobacteriaceae in the NCBI nucleotide database were collected to build a consensus database for integration into a Web tool called PlasmidFinder that can be used for replicon sequence analysis of raw, contig group, or completely assembled and closed plasmid sequencing data. The PlasmidFinder database currently consists of 116 replicon sequences that match with at least at 80% nucleotide identity all replicon sequences identified in the 559 fully sequenced plasmids. For plasmid multilocus sequence typing (pMLST) analysis, a database that is updated weekly was generated from www.pubmlst.org and integrated into a Web tool called pMLST. Both databases were evaluated using draft genomes from a collection of Salmonella enterica serovar Typhimurium isolates. PlasmidFinder identified a total of 103 replicons and between zero and five different plasmid replicons within each of 49 S . Typhimurium draft genomes tested. The pMLST Web tool was able to subtype genomic sequencing data of plasmids, revealing both known plasmid sequence types (STs) and new alleles and ST variants. In conclusion, testing of the two Web tools using both fully assembled plasmid sequences and WGS-generated draft genomes showed them to be able to detect a broad variety of plasmids that are often associated with antimicrobial resistance in clinically relevant bacterial pathogens.

Content maybe subject to copyright    Report

Citations
More filters
Journal ArticleDOI
23 Mar 2022-Mbio
TL;DR: The authors' analyses of Samoan isolates from 1983 to 2020 identified a rare S. Typhi population in Samoa that likely emerged around the early 1970s and evolved into sublineages that are presently dominant, indicating overwhelming susceptibility to antibiotics that are no longer effective in most of South and Southeast Asia.
Abstract: In this study, we used whole genome sequencing and comparative genomics analyses to characterize the population structure, evolutionary origins, and genomic features of S. Typhi associated with decades of endemic typhoid fever in Samoa. ABSTRACT For decades, the remote island nation of Samoa (population ~200,000) has faced endemic typhoid fever despite improvements in water quality, sanitation, and economic development. We recently described the epidemiology of typhoid fever in Samoa from 2008 to 2019 by person, place, and time; however, the local Salmonella enterica serovar Typhi (S. Typhi) population structure, evolutionary origins, and genomic features remained unknown. Herein, we report whole genome sequence analyses of 306 S. Typhi isolates from Samoa collected between 1983 and 2020. Phylogenetics revealed a dominant population of rare genotypes 3.5.4 and 3.5.3, together comprising 292/306 (95.4%) of Samoan versus 2/4934 (0.04%) global S. Typhi isolates. Three distinct 3.5.4 genomic sublineages were identified, and their defining polymorphisms were determined. These dominant Samoan genotypes, which likely emerged in the 1970s, share ancestry with other 3.5 clade isolates from South America, Southeast Asia, and Oceania. Additionally, a 106-kb pHCM2 phenotypically cryptic plasmid, detected in a 1992 Samoan S. Typhi isolate, was identified in 106/306 (34.6%) of Samoan isolates; this is more than double the observed proportion of pHCM2-containing isolates in the global collection. In stark contrast with global S. Typhi trends, resistance-conferring polymorphisms were detected in only 15/306 (4.9%) of Samoan S. Typhi, indicating overwhelming susceptibility to antibiotics that are no longer effective in most of South and Southeast Asia. This country-level genomic framework can help local health authorities in their ongoing typhoid surveillance and control efforts, as well as fill a critical knowledge gap in S. Typhi genomic data from Oceania. IMPORTANCE In this study, we used whole genome sequencing and comparative genomics analyses to characterize the population structure, evolutionary origins, and genomic features of S. Typhi associated with decades of endemic typhoid fever in Samoa. Our analyses of Samoan isolates from 1983 to 2020 identified a rare S. Typhi population in Samoa that likely emerged around the early 1970s and evolved into sublineages that are presently dominant. The dominance of these endemic genotypes in Samoa is not readily explained by genomic content or widespread acquisition of antimicrobial resistance. These data establish the necessary framework for future genomic surveillance of S. Typhi in Samoa for public health benefit.

4 citations

01 Jan 2014
TL;DR: The whole genome sequence and resistome of the outbreak Klebsiella pneumoniae strain MP14 was analyzed and compared it with those of K. pneumoniae carbapenemase- (KPC-) producing isolates that showed high similarity in the NCBI genome database.
Abstract: We analyzed the whole genome sequence and resistome of the outbreak Klebsiella pneumoniae strain MP14 and compared it with those of K. pneumoniae carbapenemase- (KPC-) producing isolates that showed high similarity in the NCBI genome database. A KPC-2-producing multidrug-resistant (MDR) K. pneumoniae clinical isolate was obtained from a patient admitted to a Korean hospital in 2011. The strain MP14 was resistant to all tested β-lactams including monobactam, amikacin, levofloxacin, and cotrimoxazole, but susceptible to tigecycline and colistin. Resistome analysis showed the presence of -lactamase genes including , , , and . MP14 also possessed aac(6′-)Ib, aadA2, and aph(3′-)Ia as aminoglycoside resistance-encoding genes, mph(A) for macrolides, oqxA and oqxB for quinolone, catA1 for phenicol, sul1 for sulfonamide, and dfrA12 for trimethoprim. Both SNP tree and cgMLST analysis showed the close relatedness with the KPC producers (KPNIH strains) isolated from an outbreak in the USA and colistin-resistant strains isolated in Italy. The plasmid-scaffold genes in plasmids pKpQil, pKpQil-IT, pKPN3, or pKPN-IT were identified in MP14, KPNIH, and Italian strains. The KPC-2-producing MDR K. pneumoniae ST258 stain isolated in Korea was highly clonally related with MDR K. pneumoniae strains from the USA and Italy. Global spread of KPC-producing K. pneumoniae is a worrying phenomenon.

4 citations

Journal ArticleDOI
TL;DR: This work demonstrates the advantages associated with coupling genomics technologies with phenotypic data for novel ARG identification and demonstrates the efficacy and feasibility of a published data set from the U.S. National Antimicrobial Resistance Monitoring System.
Abstract: The spread of antibiotic-resistant bacteria presents a global health challenge. Efficient surveillance of bacteria harboring antibiotic resistance genes (ARGs) is a critical aspect to controlling the spread. Increased access to microbial genomic data from many diverse populations informs this surveillance but only when functional ARGs are identifiable within the data set. Current, homology-based approaches are effective at identifying the majority of ARGs within given clinical and nonclinical data sets for several pathogens, yet there are still some whose identities remain elusive. By coupling phenotypic profiling with genotypic data, these unknown ARGs can be identified to strengthen homology-based searches. To prove the efficacy and feasibility of this approach, a published data set from the U.S. National Antimicrobial Resistance Monitoring System (NARMS), for which the phenotypic and genotypic data of 640 Salmonella isolates are available, was subjected to this analysis. Six isolates recovered from the NARMS retail meat program between 2011 and 2013 were identified previously as phenotypically resistant to gentamicin but contained no known gentamicin resistance gene. Using the phenotypic and genotypic data, a comparative genomics approach was employed to identify the gene responsible for the observed resistance in all six of the isolates. This gene, grdA, is harbored on a 9,016-bp plasmid that is transferrable to Escherichia coli, confers gentamicin resistance to E. coli, and has never before been reported to confer gentamicin resistance. Bioinformatic analysis of the encoded protein suggests an ATP binding motif. This work demonstrates the advantages associated with coupling genomics technologies with phenotypic data for novel ARG identification.

4 citations

Journal ArticleDOI
TL;DR: The increase in antimicrobial resistance in food production environments increases the resistance rate of Acinetobacter strains present in meat, inhibits the isolation of Campylobacteria strains, and acts as a medium for the transmission of antimicrobial Resistance in the environment.
Abstract: Acinetobacter strains are widely present in the environment Some antimicrobial-resistant strains of this genus have been implicated in infections acquired in hospitals Genetic similarities have been reported between Acinetobacter strains in nosocomial infections and those isolated from foods However, the antimicrobial resistance of Acinetobacter strains in foods, such as meat, remains unclear This study initially aimed to isolate Campylobacter strains; instead, strains of the genus Acinetobacter were isolated from meat products, and their antimicrobial resistance was investigated In total, 58 Acinetobacter strains were isolated from 381 meat samples Of these, 32 strains (386%) were from beef, 22 (265%) from pork, and 4 (48%) from duck meat Antimicrobial susceptibility tests revealed that 12 strains were resistant to more than one antimicrobial agent, whereas two strains were multidrug-resistant; both strains were resistant to colistin Cephalosporin antimicrobials showed high minimal inhibitory concentration against Acinetobacter strains Resfinder analysis showed that one colistin-resistant strain carried mcr-43; this plasmid type was not confirmed, even when analyzed with PlasmidFinder Analysis of the contig harboring mcr-43 using BLAST confirmed that this contig was related to mcr-43 of Acinetobacter baumannii The increase in antimicrobial resistance in food production environments increases the resistance rate of Acinetobacter strains present in meat, inhibits the isolation of Campylobacter strains, and acts as a medium for the transmission of antimicrobial resistance in the environment Therefore, further investigations are warranted to prevent the spread of antimicrobial resistance in food products

4 citations

Journal ArticleDOI
26 Jun 2020
TL;DR: PlasmidSPAdes is not a suitable plasmid assembly tool for short-read sequence data for ESBL-encoding plasmids of Enterobacteriaceae and no optimal k-mer size could be defined for the number of ESBL genes andplasmid replicons detected.
Abstract: Knowledge of the epidemiology of plasmids is essential for understanding the evolution and spread of antimicrobial resistance PlasmidSPAdes attempts to reconstruct plasmids using short-read sequence data Accurate detection of extended-spectrum beta-lactamase (ESBL) genes and plasmid replicon genes is a prerequisite for the use of plasmid assembly tools to investigate the role of plasmids in the spread and evolution of ESBL production in Enterobacteriaceae This study evaluated the performance of PlasmidSPAdes plasmid assembly for Enterobacteriaceae in terms of detection of ESBL-encoding genes, plasmid replicons and chromosomal wgMLST genes, and assessed the effect of k-mer size Short-read sequence data for 59 ESBL-producing Enterobacteriaceae were assembled with PlasmidSPAdes using different k-mer sizes (21, 33, 55, 77, 99 and 127) For every k-mer size, the presence of ESBL genes, plasmid replicons and a selection of chromosomal wgMLST genes in the plasmid assembly was determined Out of 241 plasmid replicons and 66 ESBL genes detected by whole-genome assembly, 213 plasmid replicons [88 %; 95 % confidence interval (CI): 839-919] and 43 ESBL genes (65 %; 95 % CI: 531-756) were detected in the plasmid assemblies obtained by PlasmidSPAdes For most ESBL genes (833 %) and plasmid replicons (720 %), detection results did not differ between the k-mer sizes used in the plasmid assembly No optimal k-mer size could be defined for the number of ESBL genes and plasmid replicons detected For most isolates, the number of chromosomal wgMLST genes detected in the plasmid assemblies decreased with increasing k-mer size Based on our findings, PlasmidSPAdes is not a suitable plasmid assembly tool for short-read sequence data for ESBL-encoding plasmids of Enterobacteriaceae

4 citations


Additional excerpts

  • ...1 (Center for Genomic Epidemiology, DTU, Denmark) [7, 21]....

    [...]

  • ...1 and PlasmidFinder v.1.3.1 (Center for Genomic Epidemiology, DTU, Denmark) [7, 21]....

    [...]

References
More filters
Journal ArticleDOI
TL;DR: A web server providing a convenient way of identifying acquired antimicrobial resistance genes in completely sequenced isolates was created, and the method was evaluated on WGS chromosomes and plasmids of 30 isolates.
Abstract: Objectives Identification of antimicrobial resistance genes is important for understanding the underlying mechanisms and the epidemiology of antimicrobial resistance. As the costs of whole-genome sequencing (WGS) continue to decline, it becomes increasingly available in routine diagnostic laboratories and is anticipated to substitute traditional methods for resistance gene identification. Thus, the current challenge is to extract the relevant information from the large amount of generated data.

3,956 citations


"In Silico Detection and Typing of P..." refers methods in this paper

  • ...To extract the relevant information from the large amount of data generated, a Web-based tool, ResFinder, for the identification of acquired or intrinsically present antimicrobial resistance genes in whole-genome data was recently developed (15)....

    [...]

Journal ArticleDOI
TL;DR: NCBI’s Conserved Domain Database (CDD) is a resource for the annotation of protein sequences with the location of conserved domain footprints, and functional sites inferred from these footprints.
Abstract: NCBI's Conserved Domain Database (CDD) is a resource for the annotation of protein sequences with the location of conserved domain footprints, and functional sites inferred from these footprints. CDD includes manually curated domain models that make use of protein 3D structure to refine domain models and provide insights into sequence/structure/function relationships. Manually curated models are organized hierarchically if they describe domain families that are clearly related by common descent. As CDD also imports domain family models from a variety of external sources, it is a partially redundant collection. To simplify protein annotation, redundant models and models describing homologous families are clustered into superfamilies. By default, domain footprints are annotated with the corresponding superfamily designation, on top of which specific annotation may indicate high-confidence assignment of family membership. Pre-computed domain annotation is available for proteins in the Entrez/Protein dataset, and a novel interface, Batch CD-Search, allows the computation and download of annotation for large sets of protein queries. CDD can be accessed via http://www.ncbi.nlm.nih.gov/Structure/cdd/cdd.shtml.

2,934 citations


"In Silico Detection and Typing of P..." refers background in this paper

  • ...In particular, the replicase proteins showing the pfam02387 or pfam01051 conserved domains were assigned to the FII and FIB groups, respectively (31)....

    [...]

Journal ArticleDOI
TL;DR: Results indicated that the inc/rep PCR method demonstrates high specificity and sensitivity in detecting replicons on reference plasmids and also revealed the presence of recurrent and common plasmid in epidemiologically unrelated Salmonella isolates of different serotypes.

2,163 citations


"In Silico Detection and Typing of P..." refers methods in this paper

  • ...A collection of 24 previously characterized and fully FIG 1 Numbers of fully sequenced plasmids (y axis) classified into incompatibility groups occurring in the different bacterial species of the Enterobacteriaceae family....

    [...]

  • ...Since 2005, a PCR-based replicon typing (PBRT) scheme has been available that targets in multiplex PCRs the replicons of the major plasmid families occurring in members of the family Enterobacteriaceae (2)....

    [...]

  • ...Here, we present two free, easy-to-use Web tools, PlasmidFinder and pMLST, to analyze and classify plasmids from bacterial species of the family Enterobacteriaceae....

    [...]

  • ...Here, we describe the design of two new easy-to-use Web tools useful for the rapid identification of plasmids in Enterobacteriaceae species that are of interest for epidemiological and clinical microbiology investigations of the plasmid-associated spread of antimicrobial resistance....

    [...]

  • ...This method was initially developed to detect the replicons of plasmids belonging to the 18 major incompatibility (Inc) groups of Enterobacteriaceae species (3)....

    [...]

Journal ArticleDOI
TL;DR: The Bacterial Isolate Genome Sequence Database (BIGSDB) represents a freely available resource that will assist the broader community in the elucidation of the structure and function of bacteria by means of a population genomics approach.
Abstract: The opportunities for bacterial population genomics that are being realised by the application of parallel nucleotide sequencing require novel bioinformatics platforms These must be capable of the storage, retrieval, and analysis of linked phenotypic and genotypic information in an accessible, scalable and computationally efficient manner The Bacterial Isolate Genome Sequence Database (BIGSDB) is a scalable, open source, web-accessible database system that meets these needs, enabling phenotype and sequence data, which can range from a single sequence read to whole genome data, to be efficiently linked for a limitless number of bacterial specimens The system builds on the widely used mlstdbNet software, developed for the storage and distribution of multilocus sequence typing (MLST) data, and incorporates the capacity to define and identify any number of loci and genetic variants at those loci within the stored nucleotide sequences These loci can be further organised into 'schemes' for isolate characterisation or for evolutionary or functional analyses Isolates and loci can be indexed by multiple names and any number of alternative schemes can be accommodated, enabling cross-referencing of different studies and approaches LIMS functionality of the software enables linkage to and organisation of laboratory samples The data are easily linked to external databases and fine-grained authentication of access permits multiple users to participate in community annotation by setting up or contributing to different schemes within the database Some of the applications of BIGSDB are illustrated with the genera Neisseria and Streptococcus The BIGSDB source code and documentation are available at http://pubmlstorg/software/database/bigsdb/ Genomic data can be used to characterise bacterial isolates in many different ways but it can also be efficiently exploited for evolutionary or functional studies BIGSDB represents a freely available resource that will assist the broader community in the elucidation of the structure and function of bacteria by means of a population genomics approach

1,943 citations

Journal ArticleDOI
TL;DR: A Web-based method for MLST of 66 bacterial species based on whole-genome sequencing data that enables investigators to determine the sequence types of their isolates on the basis of WGS data.
Abstract: Accurate strain identification is essential for anyone working with bacteria. For many species, multilocus sequence typing (MLST) is considered the “gold standard” of typing, but it is traditionally performed in an expensive and time-consuming manner. As the costs of whole-genome sequencing (WGS) continue to decline, it becomes increasingly available to scientists and routine diagnostic laboratories. Currently, the cost is below that of traditional MLST. The new challenges will be how to extract the relevant information from the large amount of data so as to allow for comparison over time and between laboratories. Ideally, this information should also allow for comparison to historical data. We developed a Web-based method for MLST of 66 bacterial species based on WGS data. As input, the method uses short sequence reads from four sequencing platforms or preassembled genomes. Updates from the MLST databases are downloaded monthly, and the best-matching MLST alleles of the specified MLST scheme are found using a BLAST-based ranking method. The sequence type is then determined by the combination of alleles identified. The method was tested on preassembled genomes from 336 isolates covering 56 MLST schemes, on short sequence reads from 387 isolates covering 10 schemes, and on a small test set of short sequence reads from 29 isolates for which the sequence type had been determined by traditional methods. The method presented here enables investigators to determine the sequence types of their isolates on the basis of WGS data. This method is publicly available at www.cbs.dtu.dk/services/MLST.

1,620 citations


"In Silico Detection and Typing of P..." refers methods in this paper

  • ...If raw sequence reads are uploaded, they are first assembled (after the sequencing platform is given by the user) as described previously (16)....

    [...]