scispace - formally typeset
Search or ask a question
Journal ArticleDOI

In Silico Detection and Typing of Plasmids using PlasmidFinder and Plasmid Multilocus Sequence Typing

TL;DR: Two easy-to-use Web tools for in silico detection and characterization of whole-genome sequence (WGS) and whole-plasmid sequence data from members of the family Enterobacteriaceae are designed and developed.
Abstract: In the work presented here, we designed and developed two easy-to-use Web tools for in silico detection and characterization of whole-genome sequence (WGS) and whole-plasmid sequence data from members of the family Enterobacteriaceae. These tools will facilitate bacterial typing based on draft genomes of multidrug-resistant Enterobacteriaceae species by the rapid detection of known plasmid types. Replicon sequences from 559 fully sequenced plasmids associated with the family Enterobacteriaceae in the NCBI nucleotide database were collected to build a consensus database for integration into a Web tool called PlasmidFinder that can be used for replicon sequence analysis of raw, contig group, or completely assembled and closed plasmid sequencing data. The PlasmidFinder database currently consists of 116 replicon sequences that match with at least at 80% nucleotide identity all replicon sequences identified in the 559 fully sequenced plasmids. For plasmid multilocus sequence typing (pMLST) analysis, a database that is updated weekly was generated from www.pubmlst.org and integrated into a Web tool called pMLST. Both databases were evaluated using draft genomes from a collection of Salmonella enterica serovar Typhimurium isolates. PlasmidFinder identified a total of 103 replicons and between zero and five different plasmid replicons within each of 49 S . Typhimurium draft genomes tested. The pMLST Web tool was able to subtype genomic sequencing data of plasmids, revealing both known plasmid sequence types (STs) and new alleles and ST variants. In conclusion, testing of the two Web tools using both fully assembled plasmid sequences and WGS-generated draft genomes showed them to be able to detect a broad variety of plasmids that are often associated with antimicrobial resistance in clinically relevant bacterial pathogens.

Content maybe subject to copyright    Report

Citations
More filters
Journal ArticleDOI
TL;DR: It is hypothesized that the S. Infantis population has evolved in three separate lineages, with one more successfully emerging lineage, which probably shaped the population structure.
Abstract: Salmonella Infantis (S. Infantis) is one of the most frequent Salmonella serovars isolated from human cases of salmonellosis and the most detected serovar from animal and food sources in Europe. The serovar is commonly associated with poultry and there is increasing concern over multidrug resistant clones spreading worldwide, as the dominating clones are characterized by presence of large plasmids carrying multiple resistance genes. Increasing the knowledge of the S. Infantis population and evolution is important for understanding and preventing further spread. In this study, we analysed a collection of strains representing different decades, sources and geographic locations. We analysed the population structure and the accessory genome, in particular we identified prophages with a view to understand the role of prophages in relation to the evolution of this serovar. We sequenced a global collection of 100 S. Infantis strains. A core-genome SNP analysis separated five strains in e-Burst Group (eBG) 297 with a long branch. The remaining strains, all in eBG31, were divided into three lineages that were estimated to have separated approximately 150 years ago. One lineage contained the vast majority of strains. In five of six clusters, no obvious correlation with source or geographical locations was seen. However, one cluster contained mostly strains from human and avian sources, indicating a clone with preference for these sources. The majority of strains within this cluster harboured a pESI-like plasmid with multiple resistance genes. Another lineage contained three genetic clusters with more rarely isolated strains of mainly animal origin, possibly less sampled or less infectious clones. Conserved prophages were identified in all strains, likely representing bacteriophages which integrated into the chromosome of a common ancestor to S. Infantis. We also saw that some prophages were specific to clusters and were probably introduced when the clusters were formed. This study analysed a global S. Infantis population and described its genetic structure. We hypothesize that the population has evolved in three separate lineages, with one more successfully emerging lineage. We furthermore detected conserved prophages present in the entire population and cluster specific prophages, which probably shaped the population structure.

41 citations

Journal ArticleDOI
TL;DR: In this article, the authors conducted a cross-sectional study among poultry-workers, chickens, and poultry farm/live bird market (LBM) environments in Abuja, Nigeria to determine the prevalence and genetic relatedness among multidrug-resistant Escherichia coli (E. coli) among poultry workers.
Abstract: Inappropriate use of antimicrobial agents in animal production has led to the development of antimicrobial resistance (AMR) in foodborne pathogens. Transmission of AMR foodborne pathogens from reservoirs, particularly chickens to the human population does occur. Recently, we reported that occupational exposure was a risk factor for multidrug-resistant (MDR) Escherichia coli (E. coli) among poultry-workers. Here we determined the prevalence and genetic relatedness among MDR E. coli isolated from poultry-workers, chickens, and poultry environments in Abuja, Nigeria. This study was conducted to address the gaps identified by the Nigerian AMR situation analysis. We conducted a cross-sectional study among poultry-workers, chickens, and poultry farm/live bird market (LBM) environments. The isolates were tested phenotypically for their antimicrobial susceptibility profiles, genotypically characterized using whole-genome sequencing (WGS) and in silico multilocus sequence types (MLST). We conducted a phylogenetic single nucleotide polymorphism (SNPs) analysis to determine relatedness and clonality among the isolates. A total of 115 (26.8%) out of 429 samples were positive for E. coli. Of these, 110 isolates were viable for phenotypic and genotypic characterization. The selection comprised 47 (42.7%) isolates from poultry-workers, 36 (32.7%) from chickens, and 27 (24.5%) from poultry-farm or LBM environments. Overall, 101 (91.8%) of the isolates were MDR conferring resistance to at least three drug classes. High frequency of resistance was observed for tetracycline (n = 102; 92.7%), trimethoprim/sulfamethoxazole (n = 93; 84.5%), streptomycin (n = 87; 79.1%) and ampicillin (n = 88; 80%). Two plasmid-mediated colistin genes—mcr-1.1 harboured on IncX4 plasmids were detected in environmental isolates. The most prevalent sequence types (ST) were ST-155 (n = 8), ST-48 (n = 8) and ST-10 (n = 6). Two isolates of human and environmental sources with a SNPs difference of 6161 originating from the same farm shared a novel ST. The isolates had similar AMR genes and plasmid replicons. MDR E.coli isolates were prevalent amongst poultry-workers, poultry, and the poultry farm/LBM environment. The emergence of MDR E. coli with novel ST in two isolates may be plasmid-mediated. Competent authorities should enforce AMR regulations to ensure prudent use of antimicrobials to limit the risk of transmission along the food chain.

41 citations

Journal ArticleDOI
TL;DR: This study suggests that it was the dissemination of the type 4 Tn1546-like transposon and plasmid via horizontal transfer to multiple populations of E. faecium, followed by clonal spread of new VREfm clones, that contributed to the increase in and diversity of VRE FM in Danish hospitals.
Abstract: Objectives From 2012 to 2014, there has been a huge increase in vancomycin-resistant (vanA) Enterococcus faecium (VREfm) in Copenhagen, Denmark, with 602 patients infected or colonized with VREfm in 2014 compared with just 22 in 2012. The objective of this study was to describe the genetic epidemiology of VREfm to assess the contribution of clonal spread and horizontal transfer of the vanA transposon (Tn1546) and plasmid in the dissemination of VREfm in hospitals. Methods VREfm from Copenhagen, Denmark (2012-14) were whole-genome sequenced. The clonal structure was determined and the structure of Tn1546-like transposons was characterized. One VREfm isolate belonging to the largest clonal group was sequenced using long-read technology to close a 37 kb vanA plasmid. Results Phylogeny revealed a polyclonal structure where 495 VREfm isolates were divided into 13 main groups and 7 small groups. The majority of the isolates were located in three groups (n = 44, 100 and 218) and clonal spread of VREfm between wards and hospitals was identified. Five Tn1546-like transposon types were identified. A dominant truncated transposon (type 4, 92%) was spread across all but one VREfm group. The closed vanA plasmid was highly covered by reads from isolates containing the type 4 transposon. Conclusions This study suggests that it was the dissemination of the type 4 Tn1546-like transposon and plasmid via horizontal transfer to multiple populations of E. faecium, followed by clonal spread of new VREfm clones, that contributed to the increase in and diversity of VREfm in Danish hospitals.

41 citations

Journal ArticleDOI
TL;DR: The population biology of B. ubonensis is examined, and a high degree of heterogeneity among the lipopolysaccharide O-antigen cluster loci is uncovered, with at least 35 different variants identified.
Abstract: The bacterium Burkholderia ubonensis is commonly co-isolated from environmental specimens harbouring the melioidosis pathogen, Burkholderia pseudomallei. B. ubonensis has been reported in northern Australia and Thailand but not North America, suggesting similar geographic distribution to B. pseudomallei. Unlike most other Burkholderia cepacia complex (Bcc) species, B. ubonensis is considered non-pathogenic, although its virulence potential has not been tested. Antibiotic resistance in B. ubonensis, particularly towards drugs used to treat the most severe B. pseudomallei infections, has also been poorly characterised. This study examined the population biology of B. ubonensis, and includes the first reported isolates from the Caribbean. Phylogenomic analysis of 264 B. ubonensis genomes identified distinct clades that corresponded with geographic origin, similar to B. pseudomallei. A small proportion (4%) of strains lacked the 920kb chromosome III replicon, with discordance of presence/absence amongst genetically highly related strains, demonstrating that the third chromosome of B. ubonensis, like other Bcc species, probably encodes for a nonessential pC3 megaplasmid. Multilocus sequence typing using the B. pseudomallei scheme revealed that one-third of strains lack the "housekeeping" narK locus. In comparison, all strains could be genotyped using the Bcc scheme. Several strains possessed high-level meropenem resistance (≥32 μg/mL), a concern due to potential transmission of this phenotype to B. pseudomallei. In silico analysis uncovered a high degree of heterogeneity among the lipopolysaccharide O-antigen cluster loci, with at least 35 different variants identified. Finally, we show that Asian B. ubonensis isolate RF23-BP41 is avirulent in the BALB/c mouse model via a subcutaneous route of infection. Our results provide several new insights into the biology of this understudied species.

41 citations

Journal ArticleDOI
TL;DR: The isolation and comparative genomics of carbapenemase-producing Citrobacter isolates from river sediment in China are reported, finding the presence of diverse conjugative blaKPC-2 plasmids from environmental Citrobacteria isolates, which poses the possible dissemination of antimicrobial resistance into clinical isolates.

41 citations

References
More filters
Journal ArticleDOI
TL;DR: A web server providing a convenient way of identifying acquired antimicrobial resistance genes in completely sequenced isolates was created, and the method was evaluated on WGS chromosomes and plasmids of 30 isolates.
Abstract: Objectives Identification of antimicrobial resistance genes is important for understanding the underlying mechanisms and the epidemiology of antimicrobial resistance. As the costs of whole-genome sequencing (WGS) continue to decline, it becomes increasingly available in routine diagnostic laboratories and is anticipated to substitute traditional methods for resistance gene identification. Thus, the current challenge is to extract the relevant information from the large amount of generated data.

3,956 citations


"In Silico Detection and Typing of P..." refers methods in this paper

  • ...To extract the relevant information from the large amount of data generated, a Web-based tool, ResFinder, for the identification of acquired or intrinsically present antimicrobial resistance genes in whole-genome data was recently developed (15)....

    [...]

Journal ArticleDOI
TL;DR: NCBI’s Conserved Domain Database (CDD) is a resource for the annotation of protein sequences with the location of conserved domain footprints, and functional sites inferred from these footprints.
Abstract: NCBI's Conserved Domain Database (CDD) is a resource for the annotation of protein sequences with the location of conserved domain footprints, and functional sites inferred from these footprints. CDD includes manually curated domain models that make use of protein 3D structure to refine domain models and provide insights into sequence/structure/function relationships. Manually curated models are organized hierarchically if they describe domain families that are clearly related by common descent. As CDD also imports domain family models from a variety of external sources, it is a partially redundant collection. To simplify protein annotation, redundant models and models describing homologous families are clustered into superfamilies. By default, domain footprints are annotated with the corresponding superfamily designation, on top of which specific annotation may indicate high-confidence assignment of family membership. Pre-computed domain annotation is available for proteins in the Entrez/Protein dataset, and a novel interface, Batch CD-Search, allows the computation and download of annotation for large sets of protein queries. CDD can be accessed via http://www.ncbi.nlm.nih.gov/Structure/cdd/cdd.shtml.

2,934 citations


"In Silico Detection and Typing of P..." refers background in this paper

  • ...In particular, the replicase proteins showing the pfam02387 or pfam01051 conserved domains were assigned to the FII and FIB groups, respectively (31)....

    [...]

Journal ArticleDOI
TL;DR: Results indicated that the inc/rep PCR method demonstrates high specificity and sensitivity in detecting replicons on reference plasmids and also revealed the presence of recurrent and common plasmid in epidemiologically unrelated Salmonella isolates of different serotypes.

2,163 citations


"In Silico Detection and Typing of P..." refers methods in this paper

  • ...A collection of 24 previously characterized and fully FIG 1 Numbers of fully sequenced plasmids (y axis) classified into incompatibility groups occurring in the different bacterial species of the Enterobacteriaceae family....

    [...]

  • ...Since 2005, a PCR-based replicon typing (PBRT) scheme has been available that targets in multiplex PCRs the replicons of the major plasmid families occurring in members of the family Enterobacteriaceae (2)....

    [...]

  • ...Here, we present two free, easy-to-use Web tools, PlasmidFinder and pMLST, to analyze and classify plasmids from bacterial species of the family Enterobacteriaceae....

    [...]

  • ...Here, we describe the design of two new easy-to-use Web tools useful for the rapid identification of plasmids in Enterobacteriaceae species that are of interest for epidemiological and clinical microbiology investigations of the plasmid-associated spread of antimicrobial resistance....

    [...]

  • ...This method was initially developed to detect the replicons of plasmids belonging to the 18 major incompatibility (Inc) groups of Enterobacteriaceae species (3)....

    [...]

Journal ArticleDOI
TL;DR: The Bacterial Isolate Genome Sequence Database (BIGSDB) represents a freely available resource that will assist the broader community in the elucidation of the structure and function of bacteria by means of a population genomics approach.
Abstract: The opportunities for bacterial population genomics that are being realised by the application of parallel nucleotide sequencing require novel bioinformatics platforms These must be capable of the storage, retrieval, and analysis of linked phenotypic and genotypic information in an accessible, scalable and computationally efficient manner The Bacterial Isolate Genome Sequence Database (BIGSDB) is a scalable, open source, web-accessible database system that meets these needs, enabling phenotype and sequence data, which can range from a single sequence read to whole genome data, to be efficiently linked for a limitless number of bacterial specimens The system builds on the widely used mlstdbNet software, developed for the storage and distribution of multilocus sequence typing (MLST) data, and incorporates the capacity to define and identify any number of loci and genetic variants at those loci within the stored nucleotide sequences These loci can be further organised into 'schemes' for isolate characterisation or for evolutionary or functional analyses Isolates and loci can be indexed by multiple names and any number of alternative schemes can be accommodated, enabling cross-referencing of different studies and approaches LIMS functionality of the software enables linkage to and organisation of laboratory samples The data are easily linked to external databases and fine-grained authentication of access permits multiple users to participate in community annotation by setting up or contributing to different schemes within the database Some of the applications of BIGSDB are illustrated with the genera Neisseria and Streptococcus The BIGSDB source code and documentation are available at http://pubmlstorg/software/database/bigsdb/ Genomic data can be used to characterise bacterial isolates in many different ways but it can also be efficiently exploited for evolutionary or functional studies BIGSDB represents a freely available resource that will assist the broader community in the elucidation of the structure and function of bacteria by means of a population genomics approach

1,943 citations

Journal ArticleDOI
TL;DR: A Web-based method for MLST of 66 bacterial species based on whole-genome sequencing data that enables investigators to determine the sequence types of their isolates on the basis of WGS data.
Abstract: Accurate strain identification is essential for anyone working with bacteria. For many species, multilocus sequence typing (MLST) is considered the “gold standard” of typing, but it is traditionally performed in an expensive and time-consuming manner. As the costs of whole-genome sequencing (WGS) continue to decline, it becomes increasingly available to scientists and routine diagnostic laboratories. Currently, the cost is below that of traditional MLST. The new challenges will be how to extract the relevant information from the large amount of data so as to allow for comparison over time and between laboratories. Ideally, this information should also allow for comparison to historical data. We developed a Web-based method for MLST of 66 bacterial species based on WGS data. As input, the method uses short sequence reads from four sequencing platforms or preassembled genomes. Updates from the MLST databases are downloaded monthly, and the best-matching MLST alleles of the specified MLST scheme are found using a BLAST-based ranking method. The sequence type is then determined by the combination of alleles identified. The method was tested on preassembled genomes from 336 isolates covering 56 MLST schemes, on short sequence reads from 387 isolates covering 10 schemes, and on a small test set of short sequence reads from 29 isolates for which the sequence type had been determined by traditional methods. The method presented here enables investigators to determine the sequence types of their isolates on the basis of WGS data. This method is publicly available at www.cbs.dtu.dk/services/MLST.

1,620 citations


"In Silico Detection and Typing of P..." refers methods in this paper

  • ...If raw sequence reads are uploaded, they are first assembled (after the sequencing platform is given by the user) as described previously (16)....

    [...]