scispace - formally typeset
Search or ask a question

Showing papers by "Derrick W. Crook published in 2015"


Journal ArticleDOI
TL;DR: A broad catalogue of genetic mutations enable data from whole-genome sequencing to be used clinically to predict drug resistance, drug susceptibility, or to identify drug phenotypes that cannot yet be genetically predicted.
Abstract: Summary Background Diagnosing drug-resistance remains an obstacle to the elimination of tuberculosis. Phenotypic drug-susceptibility testing is slow and expensive, and commercial genotypic assays screen only common resistance-determining mutations. We used whole-genome sequencing to characterise common and rare mutations predicting drug resistance, or consistency with susceptibility, for all first-line and second-line drugs for tuberculosis. Methods Between Sept 1, 2010, and Dec 1, 2013, we sequenced a training set of 2099 Mycobacterium tuberculosis genomes. For 23 candidate genes identified from the drug-resistance scientific literature, we algorithmically characterised genetic mutations as not conferring resistance (benign), resistance determinants, or uncharacterised. We then assessed the ability of these characterisations to predict phenotypic drug-susceptibility testing for an independent validation set of 1552 genomes. We sought mutations under similar selection pressure to those characterised as resistance determinants outside candidate genes to account for residual phenotypic resistance. Findings We characterised 120 training-set mutations as resistance determining, and 772 as benign. With these mutations, we could predict 89·2% of the validation-set phenotypes with a mean 92·3% sensitivity (95% CI 90·7–93·7) and 98·4% specificity (98·1–98·7). 10·8% of validation-set phenotypes could not be predicted because uncharacterised mutations were present. With an in-silico comparison, characterised resistance determinants had higher sensitivity than the mutations from three line-probe assays (85·1% vs 81·6%). No additional resistance determinants were identified among mutations under selection pressure in non-candidate genes. Interpretation A broad catalogue of genetic mutations enable data from whole-genome sequencing to be used clinically to predict drug resistance, drug susceptibility, or to identify drug phenotypes that cannot yet be genetically predicted. This approach could be integrated into routine diagnostic workflows, phasing out phenotypic drug-susceptibility testing while reporting drug resistance early. Funding Wellcome Trust, National Institute of Health Research, Medical Research Council, and the European Union.

511 citations


Journal ArticleDOI
TL;DR: De Bruijn graph representation of bacterial diversity can be used to identify species and resistance profiles of clinical isolates and is implemented in a software package that takes raw sequence data as input, and generates a clinician-friendly report within 3 minutes on a laptop.
Abstract: The rise of antibiotic-resistant bacteria has led to an urgent need for rapid detection of drug resistance in clinical samples, and improvements in global surveillance. Here we show how de Bruijn graph representation of bacterial diversity can be used to identify species and resistance profiles of clinical isolates. We implement this method for Staphylococcus aureus and Mycobacterium tuberculosis in a software package ('Mykrobe predictor') that takes raw sequence data as input, and generates a clinician-friendly report within 3 minutes on a laptop. For S. aureus, the error rates of our method are comparable to gold-standard phenotypic methods, with sensitivity/specificity of 99.1%/99.6% across 12 antibiotics (using an independent validation set, n=470). For M. tuberculosis, our method predicts resistance with sensitivity/specificity of 82.6%/98.5% (independent validation set, n=1,609); sensitivity is lower here, probably because of limited understanding of the underlying genetic mechanisms. We give evidence that minor alleles improve detection of extremely drug-resistant strains, and demonstrate feasibility of the use of emerging single-molecule nanopore sequencing techniques for these purposes.

453 citations


Journal ArticleDOI
TL;DR: This work studied two sequenced collections of the major human pathogen Staphylococcus aureus and found an unexpected inverse correlation between bacterial toxicity and disease severity, and suggests that within-host fitness differences between high- and low-toxicity isolates in human serum is a contributing factor.
Abstract: Bacterial virulence is a multifaceted trait where the interactions between pathogen and host factors affect the severity and outcome of the infection. Toxin secretion is central to the biology of many bacterial pathogens and is widely accepted as playing a crucial role in disease pathology. To understand the relationship between toxicity and bacterial virulence in greater depth, we studied two sequenced collections of the major human pathogen Staphylococcus aureus and found an unexpected inverse correlation between bacterial toxicity and disease severity. By applying a functional genomics approach, we identified several novel toxicity-affecting loci responsible for the wide range in toxic phenotypes observed within these collections. To understand the apparent higher propensity of low toxicity isolates to cause bacteraemia, we performed several functional assays, and our findings suggest that within-host fitness differences between high- and low-toxicity isolates in human serum is a contributing factor. As invasive infections, such as bacteraemia, limit the opportunities for onward transmission, highly toxic strains could gain an additional between-host fitness advantage, potentially contributing to the maintenance of toxicity at the population level. Our results clearly demonstrate how evolutionary trade-offs between toxicity, relative fitness, and transmissibility are critical for understanding the multifaceted nature of bacterial virulence.

160 citations


Posted Content
TL;DR: This paper demonstrates its ability to detect genes and genetic variants underlying resistance to 17 antimicrobials in 3,144 isolates from four taxonomically diverse clonal and recombining bacteria and reveals a candidate association between the outer membrane porin nmpC and cefazolin resistance in E. coli.
Abstract: Bacteria pose unique challenges for genome-wide association studies (GWAS) because of strong structuring into distinct strains and substantial linkage disequilibrium across the genome. While methods developed for human studies can correct for strain structure, this risks considerable loss- of-power because genetic differences between strains often contribute substantial phenotypic variability. Here we propose a new method that captures lineage-level associations even when locus-specific associations cannot be fine-mapped. We demonstrate its ability to detect genes and genetic variants underlying resistance to 17 antimicrobials in 3144 isolates from four taxonomically diverse clonal and recombining bacteria: Mycobacterium tuberculosis, Staphylococcus aureus, Escherichia coli and Klebsiella pneumoniae. Strong selection, recombination and penetrance confer high power to recover known antimicrobial resistance mechanisms, and reveal a candidate association between the outer membrane porin nmpC and cefazolin resistance in E. coli. Hence our method pinpoints locus-specific effects where possible, and boosts power by detecting lineage-level differences when fine-mapping is intractable.

149 citations


Journal ArticleDOI
TL;DR: Using WGS-based analysis of clinical isolates and plasmid transformants, it is demonstrated the unexpected dispersal of blaKPC to many non-ST258 lineages in a hospital through spread of at least two novel blaCPC plasmids.
Abstract: The global emergence of Klebsiella pneumoniae carbapenemase-producing K. pneumoniae (KPC-Kp) multilocus sequence type ST258 is widely recognized. Less is known about the molecular and epidemiological details of non-ST258 K. pneumoniae in the setting of an outbreak mediated by an endemic plasmid. We describe the interplay of blaKPC plasmids and K. pneumoniae strains and their relationship to the location of acquisition in a U.S. health care institution. Whole-genome sequencing (WGS) analysis was applied to KPC-Kp clinical isolates collected from a single institution over 5 years following the introduction of blaKPC in August 2007, as well as two plasmid transformants. KPC-Kp from 37 patients yielded 16 distinct sequence types (STs). Two novel conjugative blaKPC plasmids (pKPC_UVA01 and pKPC_UVA02), carried by the hospital index case, accounted for the presence of blaKPC in 21/37 (57%) subsequent cases. Thirteen (35%) isolates represented an emergent lineage, ST941, which contained pKPC_UVA01 in 5/13 (38%) and pKPC_UVA02 in 6/13 (46%) cases. Seven (19%) isolates were the epidemic KPC-Kp strain, ST258, mostly imported from elsewhere and not carrying pKPC_UVA01 or pKPC_UVA02. Using WGS-based analysis of clinical isolates and plasmid transformants, we demonstrate the unexpected dispersal of blaKPC to many non-ST258 lineages in a hospital through spread of at least two novel blaKPC plasmids. In contrast, ST258 KPC-Kp was imported into the institution on numerous occasions, with other blaKPC plasmid vectors and without sustained transmission. Instead, a newly recognized KPC-Kp strain, ST941, became associated with both novel blaKPC plasmids and spread locally, making it a future candidate for clinical persistence and dissemination.

124 citations



Journal ArticleDOI
TL;DR: The substantial diversity among intestinally carried ESBL-positive E. coli bacteria suggests that fecal surveillance, particularly if based on single-colony subcultures, will likely underestimate transmission events, especially in high-prevalence settings.
Abstract: Studies of the transmission epidemiology of antimicrobial-resistant Escherichia coli, such as strains harboring extended-spectrum beta-lactamase (ESBL) genes, frequently use selective culture of rectal surveillance swabs to identify isolates for molecular epidemiological investigation Typically, only single colonies are evaluated, which risks underestimating species diversity and transmission events We sequenced the genomes of 16 E coli colonies from each of eight fecal samples (n = 127 genomes; one failure), taken from different individuals in Cambodia, a region of high ESBL-producing E coli prevalence Sequence data were used to characterize both the core chromosomal diversity of E coli isolates and their resistance/virulence gene content as a proxy measure of accessory genome diversity The 127 E coli genomes represented 31 distinct sequence types (STs) Seven (88%) of eight subjects carried ESBL-positive isolates, all containing blaCTX-M variants Diversity was substantial, with a median of four STs/individual (range, 1 to 10) and wide genetic divergence at the nucleotide level within some STs In 2/8 (25%) individuals, the same blaCTX-M variant occurred in different clones, and/or different blaCTX-M variants occurred in the same clone Patterns of other resistance genes and common virulence factors, representing differences in the accessory genome, were also diverse within and between clones The substantial diversity among intestinally carried ESBL-positive E coli bacteria suggests that fecal surveillance, particularly if based on single-colony subcultures, will likely underestimate transmission events, especially in high-prevalence settings

78 citations


Journal ArticleDOI
TL;DR: In this paper, a low-cost and reliable method of DNA extraction from as little as 1 ml of early positive mycobacterial growth indicator tube (MGIT) cultures was developed.
Abstract: We developed a low-cost and reliable method of DNA extraction from as little as 1 ml of early positive mycobacterial growth indicator tube (MGIT) cultures that is suitable for whole-genome sequencing to identify mycobacterial species and predict antibiotic resistance in clinical samples. The DNA extraction method is based on ethanol precipitation supplemented by pretreatment steps with a MolYsis kit or saline wash for the removal of human DNA and a final DNA cleanup step with solid-phase reversible immobilization beads. The protocol yielded ≥0.2 ng/μl of DNA for 90% (MolYsis kit) and 83% (saline wash) of positive MGIT cultures. A total of 144 (94%) of the 154 samples sequenced on the MiSeq platform (Illumina) achieved the target of 1 million reads, with 90% coverage achieved. The DNA extraction protocol, therefore, will facilitate fast and accurate identification of mycobacterial species and resistance using a range of bioinformatics tools.

72 citations


Journal ArticleDOI
TL;DR: The high prevalence of paediatric colonization with ESBLE in Laos, one of the highest reported in Asia, is probably the result of inappropriate antibiotic use.
Abstract: Results: Ninety-two children (23%) were colonized with ESBLE, mainly Escherichia coli carrying blaCTX-M and Klebsiella pneumoniae carrying blaSHV or blaCTX-M, which were frequently resistant to multiple antibiotic classes. Although residence in Vientiane Capital, foreign travel, higher maternal level of education, antibiotic use in the preceding 3 months and attending a childcare facility with a ‘good’ level of hygiene were all associated with ESBLE colonization on univariable analysis, a significant association remained only for antibiotic use when a stepwise approach was used with a multivariate random-effects model. WGS analysis suggested transmission in both childcare facilities and community settings. Conclusions: The high prevalence of paediatric colonization with ESBLE in Laos, one of the highest reported in Asia, is probably the result of inappropriate antibiotic use. Paediatric colonization with CPE was not identified in this study, but it is important to continue to monitor the spread of antibiotic-resistant Enterobacteriaceae in Laos.

63 citations


Journal ArticleDOI
TL;DR: Using whole genome sequencing, it is confirmed ribotype 244 is from the same C. difficile clade as the epidemic ribotype 027, which produces toxins A, B, and binary toxin, however it is fluoroquinolone-susceptible and thousands of single nucleotide variants distinct from ribotypes 027.
Abstract: We describe an Australia-wide Clostridium difficile outbreak in 2011 and 2012 involving the previously uncommon ribotype 244. In Western Australia, 14 of 25 cases were community-associated, 11 were detected in patients younger than 65 years, 14 presented to emergency/outpatient departments, and 14 to non-tertiary/community hospitals. Using whole genome sequencing, we confirm ribotype 244 is from the same C. difficile clade as the epidemic ribotype 027. Like ribotype 027, it produces toxins A, B, and binary toxin, however it is fluoroquinolone-susceptible and thousands of single nucleotide variants distinct from ribotype 027. Fifteen outbreak isolates from across Australia were sequenced. Despite their geographic separation, all were genetically highly related without evidence of geographic clustering, consistent with a point source, for example affecting the national food chain. Comparison with reference laboratory strains revealed the outbreak clone shared a common ancestor with isolates from the United States and United Kingdom (UK). A strain obtained in the UK was phylogenetically related to our outbreak. Follow-up of that case revealed the patient had recently returned from Australia. Our data demonstrate new C. difficile strains are an on-going threat, with potential for rapid spread. Active surveillance is needed to identify and control emerging lineages.

61 citations


Journal ArticleDOI
TL;DR: With whole-genome sequencing, the hypothesis that a cluster of 16 E. cloacae bacteraemia cases in a Nepali neonatal unit represented a single clonal outbreak was tested, using a wider set of epidemiologically unrelated clinical E. cloakae isolates for comparison.
Abstract: There are limited data on Enterobacter cloacae outbreaks and fewer describing these in association with NDM-1. With whole-genome sequencing, we tested the hypothesis that a cluster of 16 E. cloacae bacteraemia cases in a Nepali neonatal unit represented a single clonal outbreak, using a wider set of epidemiologically unrelated clinical E. cloacae isolates for comparison.Forty-three isolates were analysed, including 23 E. cloacae and 3 Citrobacter sp. isolates obtained from blood cultures from 16 neonates over a 3 month period. These were compared with two contemporaneous community-associated drug-resistant isolates from adults, a unit soap dispenser isolate and a set of historical invasive isolates (n=14) from the same geographical locality.There were two clear neonatal outbreaks and one isolated case in the unit. One outbreak was associated with an NDM-1 plasmid also identified in a historical community-associated strain. The smaller, second outbreak was likely associated with a contaminated soap dispenser. The two community-acquired adult cases and three sets of historical hospital-associated neonatal isolates represented four additional genetic clusters.E. cloacae infections in this context represent several different transmission networks, operating at the community/hospital and host strain/plasmid levels. Wide sampling frames and high-resolution typing methods are needed to describe the complex molecular epidemiology of E. cloacae outbreaks, which is not appropriately reflected by routine susceptibility phenotypes. Soap dispensers may represent a reservoir for E. cloacae and bacterial strains and plasmids may persist in hospitals and in the community for long periods, sporadically being involved in outbreaks of disease.

Journal ArticleDOI
TL;DR: creation of a MRSA database allows clustering of isolates based on single nucleotide polymorphism (SNP) calling and has improved the understanding of MRSA transmission.
Abstract: Typing of meticillin resistant Staphylococcus aureus (MRSA) by whole genome sequencing (WGS) is performed routinely in Copenhagen since January 2013. We describe the relatedness, based on WGS data and epidemiological data, of 341 MRSA isolates. These comprised all MRSA (n = 300) identified in Copenhagen in the first five months of 2013. Moreover, because MRSA of staphylococcal protein A (spa)-type 304 (t304), sequence type (ST) 6 had been associated with a continuous neonatal ward outbreak in Copenhagen starting in 2011, 41 t304 isolates collected in the city between 2010 and 2012 were also included. Isolates from 2013 found to be of t304, ST6 (n=14) were compared to the 41 earlier isolates. In the study, isolates of clonal complex (CC) 22 were examined in detail, as this CC has been shown to include the hospital-acquired epidemic MRSA (EMRSA-15) clone. Finally, all MRSA ST80 were also further analysed, as representatives of an important community-acquired MRSA in Europe. Overall the analysis identified 85 spa-types and 35 STs from 17 CCs. WGS confirmed the relatedness of epidemiologically linked t304 neonatal outbreak isolates. Several non-outbreak related patients had isolates closely related to the neonatal isolates suggesting unrecognised community chains of transmission and insufficient epidemiological data. Only four CC22 isolates were related to EMRSA-15. No community spread was observed among the 13 ST80 isolates. WGS successfully replaced conventional typing and added information to epidemiological surveillance. Creation of a MRSA database allows clustering of isolates based on single nucleotide polymorphism (SNP) calling and has improved our understanding of MRSA transmission.

Posted ContentDOI
06 Nov 2015-bioRxiv
TL;DR: This study of the largest worldwide collection of sequenced ST131 E. coli isolates to date demonstrates that clonal expansion of two previously recognized antimicrobial-resistant clades started around 25 years ago, consistent with the widespread introduction of fluoroquinolones and extended-spectrum cephalosporins in clinical medicine.
Abstract: Background Escherichia coli sequence type 131 (ST131) has emerged globally as the most predominant lineage within this clinically important species, and its association with fluoroquinolone and extended-spectrum cephalosporin resistance impacts significantly on treatment. The evolutionary histories of this lineage, and of important antimicrobial resistance elements within it, remain unclearly defined. Results This study of the largest worldwide collection (n = 215) of sequenced ST131 E. coli isolates to date demonstrates that clonal expansion of two previously recognized antimicrobial-resistant clades, C1/H30R and C2/H30Rx, started around 25 years ago, consistent with the widespread introduction of fluoroquinolones and extended-spectrum cephalosporins in clinical medicine. These two clades appear to have emerged in the United States, with the expansion of the C2/H30Rx clade driven by the acquisition of a blaCTX-M-15-containing IncFII-like plasmid that has subsequently undergone extensive rearrangement. Several other evolutionary processes influencing the trajectory of this drug-resistant lineage are described, including sporadic acquisitions of CTX-M resistance plasmids, and chromosomal integration of blaCTX-M within sub-clusters followed by vertical evolution. These processes are also occurring for another family of CTX-M gene variants more recently observed amongst ST131, the blaCTX-M-14/14-like group. Conclusions The complexity of the evolutionary history of ST131 has important implications for antimicrobial resistance surveillance, epidemiological analysis, and control of emerging clinical lineages of E. coli. These data also highlight the global imperative to reduce specific antibiotic selection pressures, and demonstrate the important and varied roles played by plasmids and other mobile genetic elements in the perpetuation of antimicrobial resistance within lineages.

Posted ContentDOI
26 Apr 2015-bioRxiv
TL;DR: A de Bruijn graph comparison of clinical isolate and curated knowledge-base is used to identify species and predict resistance profile, including minor populations, and feasibility of an emerging single-molecule sequencing technique is demonstrated.
Abstract: Rapid and accurate detection of antibiotic resistance in pathogens is an urgent need, affecting both patient care and population-scale control. Microbial genome sequencing promises much, but many barriers exist to its routine deployment. Here, we address these challenges, using a de Bruijn graph comparison of clinical isolate and curated knowledge-base to identify species and predict resistance profile, including minor populations. This is implemented in a package, Mykrobe predictor, for S. aureus and M. tuberculosis, running in under three minutes on a laptop from raw data. For S. aureus, we train and validate in 495/471 samples respectively, finding error rates comparable to gold-standard phenotypic methods, with sensitivity/specificity of 99.3%/99.5% across 12 drugs. For M. tuberculosis, we identify species and predict resistance with specificity of 98.5% (training/validating on 1920/1609 samples). Sensitivity of 82.6% is limited by current understanding of genetic mechanisms. We also show that analysis of minor populations increases power to detect phenotypic resistance in second-line drugs without appreciable loss of specificity. Finally, we demonstrate feasibility of an emerging single-molecule sequencing technique.

Journal ArticleDOI
01 Jan 2015-Medicine
TL;DR: The prevalence of carriage is the highest in children, with direct vaccine impact exemplified by low carriage and disease prevalence of PCV7 serotypes in vaccinated children, whereas the indirect effects of herd protection are implied by similar observations in unvaccinated parents and older adults.

Journal ArticleDOI
TL;DR: Development of CDAD does not require discontinuation of anti-acid treatment in patients who have an indication for continuing PPI or H2RA therapy, such as gastro-oesophageal reflux disease and risk of gastrointestinal bleed.
Abstract: Objective It has been established that use of proton pump inhibitors (PPIs) is associated with an increased risk of acquiring Clostridium difficile-associated diarrhoea (CDAD). However, it is not known whether the use of PPIs or histamine-2 receptor antagonists (H2RAs) concurrently with CDAD-targeted antibiotic treatment affects clinical response or recurrence rates. Design In two phase 3 trials, patients with toxin-positive CDAD were randomised to receive fidaxomicin 200 mg twice daily or vancomycin 125 mg four times daily for 10 days. Only inpatients with CDAD (due to complete medication record availability) were included in this post hoc analysis: 701 patients, of whom 446 (64%) used PPIs or H2RAs during study drug treatment or follow-up. Baseline factors that were statistically significant in univariate analyses were analysed in multivariate analyses of effects on clinical response and recurrence. Results Multivariate analysis showed that leukocytosis, elevated creatinine and hypoalbuminemia, but not ...

Journal ArticleDOI
TL;DR: Viral infections are commonly associated with acute newborn illness, even in the early neonatal period, and the estimated incidence was 5-fold greater than reported previously for bacterial infections.
Abstract: BACKGROUND: The etiology of >90% of cases of suspected neonatal infection remains unknown. We conducted community-based surveillance in conjunction with hospital-based surveillance in a rural region in Bangladesh from June 2006 to September 2007 to assess the incidence and etiology of community-acquired viral infections among neonates. METHODS: Community health workers (CHWs) assessed neonates at home on days 0 2 5 and 8 after birth and referred cases of suspected illness to the hospital (CHW surveillance). Among neonates with clinically suspected upper respiratory tract infection (URTI) pneumonia sepsis and/or meningitis virus identification studies were conducted on nasal wash cerebrospinal fluid and/or blood specimens. In the hospital-based surveillance similar screening was conducted among all neonates (referred by CHWs and self-referred) who were admitted to the hospital. RESULTS: CHW surveillance found an incidence rate of 15.6 neonatal viral infections per 1000 live births with 30% of infections identified on the day of birth. Among neonates with suspected sepsis a viral etiology was identified in 36% of cases with enterovirus accounting for two-thirds of those infections. Respiratory syncytial virus was the most common etiologic agent among those with viral pneumonia (91%) and URTI (68%). There was a low incidence (1.2%) of influenza in this rural population. CONCLUSION: Viral infections are commonly associated with acute newborn illness even in the early neonatal period. The estimated incidence was 5-fold greater than reported previously for bacterial infections. Low-cost preventive measures for neonatal viral infections are urgently needed.

Journal ArticleDOI
01 Sep 2015-Chest
TL;DR: “Atypical” pneumonia organisms frequently cause community-acquired pneu monia, but their role in pleural infection is unknown because they are not detected by standard pleural fl uid culture.

Journal ArticleDOI
TL;DR: Clostridium difficile is a significant gastrointestinal pathogen and a leading cause of life-threatening diarrhoea in the developed world, and oncology patients are at high risk of infection.
Abstract: Clostridium difficile is a significant gastrointestinal pathogen and a leading cause of life-threatening diarrhoea in the developed world. Antibiotic therapy and immunodeficiency are key risk factors for C. difficile infection (CDI); consequently, oncology patients are at high risk.

Journal ArticleDOI
TL;DR: A notable proportion of staff time was devoted to the management of patients with suspected infectious diarrhoea in England as mentioned in this paper, indicating that either these were not evidence-based, or the evidence base is weak.

Journal ArticleDOI
17 Apr 2015-PLOS ONE
TL;DR: It is suggested that rare, functional variants in MYD88, IRAK4 or IKBKG do not significantly contribute to IPD susceptibility in adults at the population level.
Abstract: Although rare variants within the Toll-like receptor signalling pathway genes have been found to underlie human primary immunodeficiencies associated with selective predisposition to invasive pneumococcal disease (IPD), the contribution of variants in these genes to IPD susceptibility at the population level remains unknown. Complete re-sequencing of IRAK4, MYD88 and IKBKG genes was undertaken in 164 IPD cases from the UK and 164 geographically-matched population-based controls. 233 single-nucleotide variants (SNVs) were identified, of which ten were in coding regions. Four rare coding variants were predicted to be deleterious, two variants in MYD88 and two in IRAK4. The predicted deleterious variants in MYD88 were observed as two heterozygote cases but not seen in controls. Frequencies of predicted deleterious IRAK4 SNVs were the same in cases and controls. Our findings suggest that rare, functional variants in MYD88, IRAK4 or IKBKG do not significantly contribute to IPD susceptibility in adults at the population level.

01 Jan 2015
TL;DR: There is huge potential for using advances in large-scale machine learning methodologies to exploit the contents of these complex data sets by performing robust, scalable, automated inference to improve healthcare outcomes significantly by using patient-specific probabilistic models.
Abstract: Healthcare systems worldwide are entering a new phase: ever-increasing quantities of complex, massively multivariate data concerning all aspects of patient care are starting to be routinely acquired and stored [1], throughout the life of a patient. This exponential growth in data quantities far outpaces the capability of clinical experts to cope, resulting in a so-called data deluge, in which the data are largely unexploited. There is huge potential for using advances in large-scale machine learning methodologies* to exploit the contents of these complex data sets by performing robust, scalable, automated inference to improve healthcare outcomes significantly by using patient-specific probabilistic models, a field in which there is little existing research [2] and which promises to develop into a new

Posted Content
TL;DR: This paper demonstrates its ability to detect genes and genetic variants underlying resistance to 17 antimicrobials in 3363 isolates from four taxonomically diverse clonal and recombining bacteria and reveals a novel association between the outer membrane porin nmpC and cefazolin resistance in E. coli.
Abstract: Bacteria pose unique challenges for genome-wide association studies (GWAS) because of strong structuring into distinct strains and substantial linkage disequilibrium across the genome. While methods developed for human studies can correct for strain structure, this risks considerable loss- of-power because genetic differences between strains often contribute substantial phenotypic variability. Here we propose a new method that captures lineage-level associations even when locus-specific associations cannot be fine-mapped. We demonstrate its ability to detect genes and genetic variants underlying resistance to 17 antimicrobials in 3363 isolates from four taxonomically diverse clonal and recombining bacteria: Mycobacterium tuberculosis, Staphylococcus aureus, Escherichia coli and Klebsiella pneumoniae. Strong selection, recombination and penetrance confer high power to recover known antimicrobial resistance mechanisms, and reveal a novel association between the outer membrane porin nmpC and cefazolin resistance in E. coli. Hence our method pinpoints locus-specific effects where possible, and boosts power by detecting lineage-level differences when fine-mapping is intractable.

Posted ContentDOI
02 Dec 2015-bioRxiv
TL;DR: High levels of resistance gene mobility to multiple bacterial species, which was facilitated by various different genetic mechanisms were revealed, challenging traditional views that drug-resistance outbreaks are due to transmission of a single pathogenic strain.
Abstract: The recent widespread emergence of carbapenem resistance in Enterobacteriaceae is a major public health concern, as carbapenems are a therapy of last resort in this family of common bacterial pathogens. Resistance genes can mobilize via various mechanisms including conjugation and transposition, however the importance of this mobility in short-term evolution, such as within nosocomial outbreaks, is currently unknown. Using a combination of short- and long-read whole genome sequencing of 281 bla KPC -positive Enterobacteriaceae isolated from a single hospital over five years, we demonstrate rapid dissemination of this carbapenem resistance gene to multiple species, strains, and plasmids. Mobility of bla KPC occurs at multiple nested genetic levels, with transmission of bla KPC strains between individuals, frequent transfer of bla KPC plasmids between strains/species, and frequent transposition of the bla KPC transposon Tn 4401 between plasmids. We also identify a common insertion site for Tn 4401 within various Tn 2 -like elements, suggesting that homologous recombination between Tn 2 -like elements has enhanced the spread of Tn 4401 between different plasmid vectors. Furthermore, while short-read sequencing has known limitations for plasmid assembly, various studies have attempted to overcome this with the use of reference-based methods. We also demonstrate that as a consequence of the genetic mobility observed herein, plasmid structures can be extremely dynamic, and therefore these reference-based methods, as well as traditional partial typing methods, can produce very misleading conclusions. Overall, our findings demonstrate that non-clonal resistance gene dissemination can be extremely rapid, presenting significant challenges for public health surveillance and achieving effective control of antibiotic resistance.