scispace - formally typeset
Search or ask a question
Journal ArticleDOI

In Silico Detection and Typing of Plasmids using PlasmidFinder and Plasmid Multilocus Sequence Typing

TL;DR: Two easy-to-use Web tools for in silico detection and characterization of whole-genome sequence (WGS) and whole-plasmid sequence data from members of the family Enterobacteriaceae are designed and developed.
Abstract: In the work presented here, we designed and developed two easy-to-use Web tools for in silico detection and characterization of whole-genome sequence (WGS) and whole-plasmid sequence data from members of the family Enterobacteriaceae. These tools will facilitate bacterial typing based on draft genomes of multidrug-resistant Enterobacteriaceae species by the rapid detection of known plasmid types. Replicon sequences from 559 fully sequenced plasmids associated with the family Enterobacteriaceae in the NCBI nucleotide database were collected to build a consensus database for integration into a Web tool called PlasmidFinder that can be used for replicon sequence analysis of raw, contig group, or completely assembled and closed plasmid sequencing data. The PlasmidFinder database currently consists of 116 replicon sequences that match with at least at 80% nucleotide identity all replicon sequences identified in the 559 fully sequenced plasmids. For plasmid multilocus sequence typing (pMLST) analysis, a database that is updated weekly was generated from www.pubmlst.org and integrated into a Web tool called pMLST. Both databases were evaluated using draft genomes from a collection of Salmonella enterica serovar Typhimurium isolates. PlasmidFinder identified a total of 103 replicons and between zero and five different plasmid replicons within each of 49 S . Typhimurium draft genomes tested. The pMLST Web tool was able to subtype genomic sequencing data of plasmids, revealing both known plasmid sequence types (STs) and new alleles and ST variants. In conclusion, testing of the two Web tools using both fully assembled plasmid sequences and WGS-generated draft genomes showed them to be able to detect a broad variety of plasmids that are often associated with antimicrobial resistance in clinically relevant bacterial pathogens.

Content maybe subject to copyright    Report

Citations
More filters
Posted ContentDOI
23 Oct 2020-bioRxiv
TL;DR: The independent emergence of azithromycin resistant typhoid in Northern India reflects an emerging broader problem across South Asia and illustrates the urgent need for the introduction of typhoid conjugate vaccines (TCVs) in the region.
Abstract: Background The emergence and spread of antimicrobial resistance (AMR) pose a major threat to the effective treatment and control of typhoid fever. The ongoing outbreak of extensively drug resistant (XDR) Salmonella Typhi (S. Typhi) in Pakistan has left azithromycin as the only remaining broadly efficacious oral antimicrobial for typhoid in South Asia. Ominously, azithromycin resistant S. Typhi organisms have been subsequently reported in Bangladesh, Pakistan, and Nepal. Methods Here, we aimed to understand the molecular basis of AMR in 66 S. Typhi isolated in a cross-sectional study performed in a suburb of Chandigarh in Northern India using whole genome sequencing (WGS) and phylogenetic analysis. Results We identified seven S. Typhi organisms with the R717Q mutation in the acrB gene that was recently found to confer resistance to azithromycin in Bangladesh. Six out of the azithromycin-resistant S. Typhi isolates also exhibited triple mutations in gyrA (S83F and D87N) and parC (S80I) genes and were resistant to ciprofloxacin. These contemporary ciprofloxacin/azithromycin-resistant isolates were phylogenetically distinct from each other and from those reported from Bangladesh, Pakistan, and Nepal. Conclusions The independent emergence of azithromycin resistant typhoid in Northern India reflects an emerging broader problem across South Asia and illustrates the urgent need for the introduction of typhoid conjugate vaccines (TCVs) in the region. Key points We identified ciprofloxacin/azithromycin-resistant Salmonella Typhi (S. Typhi) in Chandigarh in Northern India. The independent emergence of ciprofloxacin/azithromycin-resistant typhoid in Bangladesh, Pakistan, Nepal, and India and the continued spread of extensively-drug resistant (XDR) typhoid in Pakistan highlight the limitations of licensed oral treatments for typhoid fever in South Asia.

20 citations

Journal ArticleDOI
TL;DR: Genome analysis revealed a high metabolic versatility, the capability to cope with diverse stress agents, and the lack of several virulence factors found in pathogenic Aeromonas, which may explain the ability of this microorganism to live in an extremely polluted environment.
Abstract: Fil: Pavan, Maria Elisa. Universidad de Buenos Aires. Facultad de Ciencias Exactas y Naturales. Departamento de Quimica Biologica; Argentina

20 citations

Journal ArticleDOI
15 Apr 2020
TL;DR: A novel clone of Salmonella Reading has emerged that coincided with increased abundance in raw turkey products and two outbreaks of human illness in North America, suggesting that emergent strains with higher potential for niche success were likely vertically transferred and rapidly disseminated from a common source.
Abstract: Two separate human outbreaks of Salmonella enterica serotype Reading occurred between 2017 and 2019 in the United States and Canada, and both outbreaks were linked to the consumption of raw turkey products. In this study, a comprehensive genomic investigation was conducted to reconstruct the evolutionary history of S. Reading from turkeys and to determine the genomic context of outbreaks involving this infrequently isolated Salmonella serotype. A total of 988 isolates of U.S. origin were examined using whole-genome-based approaches, including current and historical isolates from humans, meat, and live food animals. Broadly, isolates clustered into three major clades, with one apparently highly adapted turkey clade. Within the turkey clade, isolates clustered into three subclades, including an “emergent” clade that contained only isolates dated 2016 or later, with many of the isolates from these outbreaks. Genomic differences were identified between emergent and other turkey subclades, suggesting that the apparent success of currently circulating subclades is, in part, attributable to plasmid acquisitions conferring antimicrobial resistance, gain of phage-like sequences with cargo virulence factors, and mutations in systems that may be involved in beta-glucuronidase activity and resistance towards colicins. U.S. and Canadian outbreak isolates were found interspersed throughout the emergent subclade and the other circulating subclade. The emergence of a novel S. Reading turkey subclade, coinciding temporally with expansion in commercial turkey production and with U.S. and Canadian human outbreaks, indicates that emergent strains with higher potential for niche success were likely vertically transferred and rapidly disseminated from a common source. IMPORTANCE Increasingly, outbreak investigations involving foodborne pathogens are difficult due to the interconnectedness of food animal production and distribution, and homogeneous nature of industry integration, necessitating high-resolution genomic investigations to determine their basis. Fortunately, surveillance and whole-genome sequencing, combined with the public availability of these data, enable comprehensive queries to determine underlying causes of such outbreaks. Utilizing this pipeline, it was determined that a novel clone of Salmonella Reading has emerged that coincided with increased abundance in raw turkey products and two outbreaks of human illness in North America. The rapid dissemination of this highly adapted and conserved clone indicates that it was likely obtained from a common source and rapidly disseminated across turkey production. Key genomic changes may have contributed to its apparent continued success in commercial turkeys and ability to cause illness in humans.

20 citations


Cites methods from "In Silico Detection and Typing of P..."

  • ...com/tseemann/abricate) with the PlasmidFinder database (75) and a minimum identity of 90% and minimum coverage length of 60%....

    [...]

Journal ArticleDOI
TL;DR: Phenotypic analysis of the fecal strains suggests the presence of traits related to bacterial commensalism in the authors' H22 strains and traits commonly found in uropathogenic E. coli in the H30-R strains.
Abstract: In 2006, we found healthy subjects carrying ST131 Escherichia coli in their intestinal microbiota consisting of two populations: a subdominant population of fluoroquinolone-resistant E. coli belonging to subclone H30 (H30-R or subclade C1), the current worldwide dominant ST131 subclone, and a dominant E. coli population composed of antibiotic-susceptible E. coli belonging to subclone H22 (clade B), the precursor of subclone H30. We sequenced the whole genome of fecal H22 strain S250, compared it to the genomes of ExPEC ST131 H30-Rx strain JJ1886 and commensal ST131 H41 strain SE15, sought the H22-H30 genomic differences in our fecal strains and assessed their phenotypic consequences. We detected 173 genes found in the Virulence Factor Database, of which 148 were shared by the three ST131 genomes, whereas some were genome-specific, notably those allowing determination of virotype (D for S250 and C for JJ1886). We found three sequences of the FimH site involved in adhesion: two in S250 and SE15 close and identical, respectively, to that previously reported to confer strong intestinal adhesion, and one in JJ1886, corresponding to that commonly present in uropathogenic E. coli. Among the genes involved in sugar metabolism, one encoding a gluconate kinase lacked in S250 and JJ1886. Although this gene was also absent in both our fecal H22 and H30-R strains, H22 strains showed a higher capacity to grow in minimal medium with gluconate. Among the genes involved in gluconate metabolism, only the ghrB gene differed between S250/H22 and JJ1886/H30-R strains, resulting in different gluconate reductases. Of the genes involved in biofilm formation, two were absent in the three genomes and one, fimB, in the JJ1886 genome. Our fecal H30-R strains lacking intact fimB displayed delayed biofilm formation relative to our fecal H22 strains. The H22 strains differed by subclade B type and plasmid content, whereas the H30-R strains were identical. Phenotypic analysis of our fecal strains based on observed genomic differences between S250 and JJ1886 strains suggests the presence of traits related to bacterial commensalism in our H22 strains and traits commonly found in uropathogenic E. coli in our H30-R strains.

20 citations


Cites methods from "In Silico Detection and Typing of P..."

  • ...The plasmid content of H22 strain S250 was determined using the PlasmidFinder system [32] and the FII, FIA, and FIB formula of the detected IncF plasmid by PCRbased replicon typing (http://pubmlst....

    [...]

Journal ArticleDOI
01 Jan 2020
TL;DR: Investigation of molecular epidemiology of bla IMP-4 in Queensland, Australia found that persistence of the carbapenemase gene is driven by both presence on a common plasmid and clonal spread of certain E. hormaechei lineages.
Abstract: Carbapenemase-producing Enterobacteriaceae (CPE) are an increasingly common cause of healthcare-associated infections and may occasionally be identified in patients without extensive healthcare exposure. bla IMP-4 is the most frequently detected carbapenemase gene in Enterobacteriaceae within Australia, but little is known about the mechanisms behind its persistence. Here we used whole genome sequencing (WGS) to investigate the molecular epidemiology of bla IMP-4 in Queensland, Australia. In total, 107 CPE were collected between 2014 and 2017 and sent for WGS on an Illumina NextSeq500. Resistance genes and plasmid types were detected using a combination of read mapping and nucleotide comparison of de novo assemblies. Six isolates were additionally sequenced using Oxford Nanopore MinION to generate long-reads and fully characterize the context of the bla IMP-4 gene. Of 107 CPE, 93 carried the bla IMP-4 gene; 74/107 also carried an IncHI2 plasmid, suggesting carriage of the bla IMP-4 gene on an IncHI2 plasmid. Comparison of these isolates to a previously characterized IncHI2 plasmid pMS7884A (isolated from an Enterobacter hormaechei strain in Brisbane) suggested that all isolates carried a similar plasmid. Five of six representative isolates sequenced using Nanopore long-read technology carried IncHI2 plasmids harbouring the bla IMP-4 gene. While the vast majority of isolates represented E. hormaechei , several other species were also found to carry the IncHI2 plasmid, including Klebsiella species, Escherichia coli and Citrobacter species. Several clonal groups of E. hormaechei were also identified, suggesting that persistence of bla IMP-4 is driven by both presence on a common plasmid and clonal spread of certain E. hormaechei lineages.

20 citations

References
More filters
Journal ArticleDOI
TL;DR: A web server providing a convenient way of identifying acquired antimicrobial resistance genes in completely sequenced isolates was created, and the method was evaluated on WGS chromosomes and plasmids of 30 isolates.
Abstract: Objectives Identification of antimicrobial resistance genes is important for understanding the underlying mechanisms and the epidemiology of antimicrobial resistance. As the costs of whole-genome sequencing (WGS) continue to decline, it becomes increasingly available in routine diagnostic laboratories and is anticipated to substitute traditional methods for resistance gene identification. Thus, the current challenge is to extract the relevant information from the large amount of generated data.

3,956 citations


"In Silico Detection and Typing of P..." refers methods in this paper

  • ...To extract the relevant information from the large amount of data generated, a Web-based tool, ResFinder, for the identification of acquired or intrinsically present antimicrobial resistance genes in whole-genome data was recently developed (15)....

    [...]

Journal ArticleDOI
TL;DR: NCBI’s Conserved Domain Database (CDD) is a resource for the annotation of protein sequences with the location of conserved domain footprints, and functional sites inferred from these footprints.
Abstract: NCBI's Conserved Domain Database (CDD) is a resource for the annotation of protein sequences with the location of conserved domain footprints, and functional sites inferred from these footprints. CDD includes manually curated domain models that make use of protein 3D structure to refine domain models and provide insights into sequence/structure/function relationships. Manually curated models are organized hierarchically if they describe domain families that are clearly related by common descent. As CDD also imports domain family models from a variety of external sources, it is a partially redundant collection. To simplify protein annotation, redundant models and models describing homologous families are clustered into superfamilies. By default, domain footprints are annotated with the corresponding superfamily designation, on top of which specific annotation may indicate high-confidence assignment of family membership. Pre-computed domain annotation is available for proteins in the Entrez/Protein dataset, and a novel interface, Batch CD-Search, allows the computation and download of annotation for large sets of protein queries. CDD can be accessed via http://www.ncbi.nlm.nih.gov/Structure/cdd/cdd.shtml.

2,934 citations


"In Silico Detection and Typing of P..." refers background in this paper

  • ...In particular, the replicase proteins showing the pfam02387 or pfam01051 conserved domains were assigned to the FII and FIB groups, respectively (31)....

    [...]

Journal ArticleDOI
TL;DR: Results indicated that the inc/rep PCR method demonstrates high specificity and sensitivity in detecting replicons on reference plasmids and also revealed the presence of recurrent and common plasmid in epidemiologically unrelated Salmonella isolates of different serotypes.

2,163 citations


"In Silico Detection and Typing of P..." refers methods in this paper

  • ...A collection of 24 previously characterized and fully FIG 1 Numbers of fully sequenced plasmids (y axis) classified into incompatibility groups occurring in the different bacterial species of the Enterobacteriaceae family....

    [...]

  • ...Since 2005, a PCR-based replicon typing (PBRT) scheme has been available that targets in multiplex PCRs the replicons of the major plasmid families occurring in members of the family Enterobacteriaceae (2)....

    [...]

  • ...Here, we present two free, easy-to-use Web tools, PlasmidFinder and pMLST, to analyze and classify plasmids from bacterial species of the family Enterobacteriaceae....

    [...]

  • ...Here, we describe the design of two new easy-to-use Web tools useful for the rapid identification of plasmids in Enterobacteriaceae species that are of interest for epidemiological and clinical microbiology investigations of the plasmid-associated spread of antimicrobial resistance....

    [...]

  • ...This method was initially developed to detect the replicons of plasmids belonging to the 18 major incompatibility (Inc) groups of Enterobacteriaceae species (3)....

    [...]

Journal ArticleDOI
TL;DR: The Bacterial Isolate Genome Sequence Database (BIGSDB) represents a freely available resource that will assist the broader community in the elucidation of the structure and function of bacteria by means of a population genomics approach.
Abstract: The opportunities for bacterial population genomics that are being realised by the application of parallel nucleotide sequencing require novel bioinformatics platforms These must be capable of the storage, retrieval, and analysis of linked phenotypic and genotypic information in an accessible, scalable and computationally efficient manner The Bacterial Isolate Genome Sequence Database (BIGSDB) is a scalable, open source, web-accessible database system that meets these needs, enabling phenotype and sequence data, which can range from a single sequence read to whole genome data, to be efficiently linked for a limitless number of bacterial specimens The system builds on the widely used mlstdbNet software, developed for the storage and distribution of multilocus sequence typing (MLST) data, and incorporates the capacity to define and identify any number of loci and genetic variants at those loci within the stored nucleotide sequences These loci can be further organised into 'schemes' for isolate characterisation or for evolutionary or functional analyses Isolates and loci can be indexed by multiple names and any number of alternative schemes can be accommodated, enabling cross-referencing of different studies and approaches LIMS functionality of the software enables linkage to and organisation of laboratory samples The data are easily linked to external databases and fine-grained authentication of access permits multiple users to participate in community annotation by setting up or contributing to different schemes within the database Some of the applications of BIGSDB are illustrated with the genera Neisseria and Streptococcus The BIGSDB source code and documentation are available at http://pubmlstorg/software/database/bigsdb/ Genomic data can be used to characterise bacterial isolates in many different ways but it can also be efficiently exploited for evolutionary or functional studies BIGSDB represents a freely available resource that will assist the broader community in the elucidation of the structure and function of bacteria by means of a population genomics approach

1,943 citations

Journal ArticleDOI
TL;DR: A Web-based method for MLST of 66 bacterial species based on whole-genome sequencing data that enables investigators to determine the sequence types of their isolates on the basis of WGS data.
Abstract: Accurate strain identification is essential for anyone working with bacteria. For many species, multilocus sequence typing (MLST) is considered the “gold standard” of typing, but it is traditionally performed in an expensive and time-consuming manner. As the costs of whole-genome sequencing (WGS) continue to decline, it becomes increasingly available to scientists and routine diagnostic laboratories. Currently, the cost is below that of traditional MLST. The new challenges will be how to extract the relevant information from the large amount of data so as to allow for comparison over time and between laboratories. Ideally, this information should also allow for comparison to historical data. We developed a Web-based method for MLST of 66 bacterial species based on WGS data. As input, the method uses short sequence reads from four sequencing platforms or preassembled genomes. Updates from the MLST databases are downloaded monthly, and the best-matching MLST alleles of the specified MLST scheme are found using a BLAST-based ranking method. The sequence type is then determined by the combination of alleles identified. The method was tested on preassembled genomes from 336 isolates covering 56 MLST schemes, on short sequence reads from 387 isolates covering 10 schemes, and on a small test set of short sequence reads from 29 isolates for which the sequence type had been determined by traditional methods. The method presented here enables investigators to determine the sequence types of their isolates on the basis of WGS data. This method is publicly available at www.cbs.dtu.dk/services/MLST.

1,620 citations


"In Silico Detection and Typing of P..." refers methods in this paper

  • ...If raw sequence reads are uploaded, they are first assembled (after the sequencing platform is given by the user) as described previously (16)....

    [...]