scispace - formally typeset
Search or ask a question

Showing papers by "Todd R. Golub published in 2017"


Journal ArticleDOI
30 Nov 2017-Cell
TL;DR: The expanded CMap is reported, made possible by a new, low-cost, high-throughput reduced representation expression profiling method that is shown to be highly reproducible, comparable to RNA sequencing, and suitable for computational inference of the expression levels of 81% of non-measured transcripts.

1,943 citations


Journal ArticleDOI
27 Jul 2017-Cell
TL;DR: DEMETER, an analytical framework that segregates on- from off-target effects of RNAi, demonstrates the basis behind one such predictive model linking hypermethylation of the UBB ubiquitin gene to a dependency on UBC and provides a foundation for a cancer dependency map that facilitates the prioritization of therapeutic targets.

1,533 citations


Journal ArticleDOI
TL;DR: CERES, a computational method to estimate gene-dependency levels from CRISPR–Cas9 essentiality screens while accounting for the copy number–specific effect, is developed and found that CERES decreased false-positive results and estimated sgRNA activity for both this data set and previously published screens performed with different sg RNA libraries.
Abstract: The CRISPR-Cas9 system has revolutionized gene editing both at single genes and in multiplexed loss-of-function screens, thus enabling precise genome-scale identification of genes essential for proliferation and survival of cancer cells. However, previous studies have reported that a gene-independent antiproliferative effect of Cas9-mediated DNA cleavage confounds such measurement of genetic dependency, thereby leading to false-positive results in copy number-amplified regions. We developed CERES, a computational method to estimate gene-dependency levels from CRISPR-Cas9 essentiality screens while accounting for the copy number-specific effect. In our efforts to define a cancer dependency map, we performed genome-scale CRISPR-Cas9 essentiality screens across 342 cancer cell lines and applied CERES to this data set. We found that CERES decreased false-positive results and estimated sgRNA activity for both this data set and previously published screens performed with different sgRNA libraries. We further demonstrate the utility of this collection of screens, after CERES correction, for identifying cancer-type-specific vulnerabilities.

1,239 citations


Journal ArticleDOI
15 Sep 2017-Science
TL;DR: By studying colon cancer models, it is found that bacteria can metabolize the chemotherapeutic drug gemcitabine into its inactive form, 2′,2′-difluorodeoxyuridine, seen primarily in Gammaproteobacteria.
Abstract: Growing evidence suggests that microbes can influence the efficacy of cancer therapies. By studying colon cancer models, we found that bacteria can metabolize the chemotherapeutic drug gemcitabine (2',2'-difluorodeoxycytidine) into its inactive form, 2',2'-difluorodeoxyuridine. Metabolism was dependent on the expression of a long isoform of the bacterial enzyme cytidine deaminase (CDDL), seen primarily in Gammaproteobacteria. In a colon cancer mouse model, gemcitabine resistance was induced by intratumor Gammaproteobacteria, dependent on bacterial CDDL expression, and abrogated by cotreatment with the antibiotic ciprofloxacin. Gemcitabine is commonly used to treat pancreatic ductal adenocarcinoma (PDAC), and we hypothesized that intratumor bacteria might contribute to drug resistance of these tumors. Consistent with this possibility, we found that of the 113 human PDACs that were tested, 86 (76%) were positive for bacteria, mainly Gammaproteobacteria.

923 citations


Posted ContentDOI
10 May 2017-bioRxiv
TL;DR: A new, low-cost, high throughput reduced representation expression profiling method, L1000, is shown to be highly reproducible, comparable to RNA sequencing, and suitable for computational inference of the expression levels of 81% of non-measured transcripts.
Abstract: We previously piloted the concept of a Connectivity Map (CMap), whereby genes, drugs and disease states are connected by virtue of common gene-expression signatures. Here, we report more than a 1,000-fold scale-up of the CMap as part of the NIH LINCS Consortium, made possible by a new, low-cost, high throughput reduced representation expression profiling method that we term L1000. We show that L1000 is highly reproducible, comparable to RNA sequencing, and suitable for computational inference of the expression levels of 81% of non-measured transcripts. We further show that the expanded CMap can be used to discover mechanism of action of small molecules, functionally annotate genetic variants of disease genes, and inform clinical trials. The 1.3 million L1000 profiles described here, as well as tools for their analysis, are available at https://clue.io.

636 citations


Journal ArticleDOI
TL;DR: This work hand-curated a collection of 4,707 compounds, experimentally confirmed their identities, and annotated them with literature-reported targets, to assemble a comprehensive library of drugs that have reached the clinic and established a blueprint for others to easily assemble such a repurposing library.
Abstract: To the Editor: Drug repurposing, the application of an existing therapeutic to a new disease indication, holds promise of rapid clinical impact at a lower cost than de novo drug development. So far, there has not been a systematic effort to identify such opportunities, limited in part by the lack of a comprehensive library of clinical compounds suitable for testing. To address this challenge, we hand-curated a collection of 4,707 compounds, experimentally confirmed their identities, and annotated them with literature-reported targets. The collection includes 3,422 drugs that are marketed around the world or that have been tested in human clinical trials. Compounds were obtained from more than 50 chemical vendors, and the purity of each sample was established. We have thus established a blueprint for others to easily assemble such a repurposing library, and we have created an online Drug Repurposing Hub (http:// www.broadinstitute.org/repurposing) that contains detailed annotation for each of the compounds. Repurposing is attractive and pragmatic, given the substantial cost and time requirements—on average, a decade or more—for drug development1. In addition, a large number of potential drugs never reach clinical testing. Moreover, fewer than 15% of compounds that enter clinical development ultimately receive approval, despite the majority of them being deemed safe2. For either approved or failed drugs for which safety has already been established, finding new indications can rapidly bring benefits to patients. Prior drug-repurposing successes span disease areas; examples include the cyclooxygenase inhibitor aspirin to treat coronary-artery disease, the phosphodiesterase inhibitor sildenafil to treat erectile dysfunction, and the antibiotic erythromycin for impaired gastric motility (Supplementary Table 1)3. Even drugs associated with troubling side effects merit reconsideration, as evidenced by the successful repurposing of the antiemetic thalidomide to treat multiple myeloma4. Risk-mediating measures for avoiding the potential teratogenicity of thalidomide and its derivatives are reasonable in patients with life-threatening cancer, whereas the use of these drugs to treat nausea remains unacceptable. Although the benefits of repurposing are clear, successes thus far have been mostly serendipitous. Systematic, large-scale repurposing efforts have not been possible owing to the lack of a definitive physical drug collection, the low quality of drug annotations, and insufficient readouts of drug activity from which new indications can be predicted. Recent technological advances have enabled a step change in our ability to assess drug activities comprehensively. For example, perturbational gene expression profiles can now be obtained at high throughput across multiple cell types5. Gene expression profiling has enabled recent repurposing discoveries, including sirolimus for glucocorticoid-resistant acute lymphocytic leukemia, topiramate for inflammatory-bowel disease, and imipramine for small-cell lung cancer. For cancer therapeutics, a recently developed assay known as PRISM, which uses barcoded cell lines, enables rapid testing of many drugs against a large number of cancer cell lines in pools6. Molecular features of the cell lines (for example, gene expression, mutation, or copy-number variation) can then be used to identify predictive biomarkers of drug sensitivity (Supplementary Table 2). Finally, morphologic changes in cells can be assessed using high-throughput microscopy and machine-learning approaches. Such imaging-based screening unexpectedly identified the cholesterol drug lovastatin as a potent inhibitor of leukemia stem cells. To take advantage of these advances in experimental methods, we sought to assemble a comprehensive library of drugs that have reached the clinic. Surprisingly, we found that no such chemical library of approved and clinical trial drugs is available for purchase. In particular, drugs that have been tested in clinical trials but did not reach approval are not readily accessible. Even obtaining a complete list of such drugs and their annotations is challenging. A prior effort led by the US National Institutes of Health (NIH) focused on drugs approved by the US Food and Drug Administration (FDA), but the library has few compounds that have yet to achieve FDA approval7. Some chemical vendors offer a subset of approved drugs, but most of these commercial libraries overlap in their content and include only a small fraction of the approximately 10,000 drugs that have reached the clinic in the United States and Europe. Given that no complete collection exists, we launched a three-step effort to create the Repurposing Library by (i) identifying and purchasing compounds; (ii) comprehensively annotating their known activities and clinical indications; and (iii) experimentally confirming drug identity and purity. We employed two approaches to identify clinical-drug structures for the Repurposing Library. First, we searched existing databases, both publicly accessible and proprietary, for clinically tested drugs and then manually integrated them to ensure sufficient drug coverage and chemical-structure reliability (Supplementary Table 3). Sources included DrugBank, the NCATS NCGC Pharmaceutical Collection (NPC), Thomson Reuters Integrity, Thomson Reuters Cortellis, and Citeline Pharmaprojects7–9. Second, we located marketed or approved ingredient lists from regulatory agencies worldwide, including the FDA. After structure standardization and the removal of duplicates, approximately 10,000 small-molecule drugs with disclosed structures were found to have reached clinical development. Most of these drugs are not widely available in commercial screening libraries. Through structure-matching (as opposed to relying on compound names), chemical suppliers were identified for 5,691 compounds (Fig. 1). Controlled substances, nonpharmaceutical substances, and redundant elemental formulations were not pursued further. To assemble the collection, we ultimately purchased 8,584 samples (representing 5,691 unique compounds) from 75 chemical vendors, at an average cost of $29 per sample. We performed chemical-structure analysis on all clinical-drug structures (whether commercially available or not) to assess the extent of The Drug Repurposing Hub: a next-generation drug library and information resource

619 citations


Journal ArticleDOI
31 Mar 2017-Science
TL;DR: This approach provides a general framework to decipher differences between classes of human tumors by decoupling cancer cell genotypes, phenotypes, and the composition of the TME.
Abstract: Tumor subclasses differ according to the genotypes and phenotypes of malignant cells as well as the composition of the tumor microenvironment (TME). We dissected these influences in isocitrate dehydrogenase (IDH)-mutant gliomas by combining 14,226 single-cell RNA sequencing (RNA-seq) profiles from 16 patient samples with bulk RNA-seq profiles from 165 patient samples. Differences in bulk profiles between IDH-mutant astrocytoma and oligodendroglioma can be primarily explained by distinct TME and signature genetic events, whereas both tumor types share similar developmental hierarchies and lineages of glial differentiation. As tumor grade increases, we find enhanced proliferation of malignant cells, larger pools of undifferentiated glioma cells, and an increase in macrophage over microglia expression programs in TME. Our work provides a unifying model for IDH-mutant gliomas and a general framework for dissecting the differences among human tumor subclasses.

563 citations


01 Feb 2017
TL;DR: In this article, the authors dissected the influences in isocitrate dehydrogenase (IDH)-mutant gliomas by combining 14,226 single-cell RNA sequencing (RNA-seq) profiles from 16 patient samples with bulk RNA-seq profiles from 165 patient samples.
Abstract: Tumor subclasses differ according to the genotypes and phenotypes of malignant cells as well as the composition of the tumor microenvironment (TME). We dissected these influences in isocitrate dehydrogenase (IDH)-mutant gliomas by combining 14,226 single-cell RNA sequencing (RNA-seq) profiles from 16 patient samples with bulk RNA-seq profiles from 165 patient samples. Differences in bulk profiles between IDH-mutant astrocytoma and oligodendroglioma can be primarily explained by distinct TME and signature genetic events, whereas both tumor types share similar developmental hierarchies and lineages of glial differentiation. As tumor grade increases, we find enhanced proliferation of malignant cells, larger pools of undifferentiated glioma cells, and an increase in macrophage over microglia expression programs in TME. Our work provides a unifying model for IDH-mutant gliomas and a general framework for dissecting the differences among human tumor subclasses.

520 citations


Journal ArticleDOI
TL;DR: In this paper, a software called ichorCNA was proposed to quantitatively measure tumor content in cfDNA from 0.1× coverage whole-genome sequencing data without prior knowledge of tumor mutations.
Abstract: Whole-exome sequencing of cell-free DNA (cfDNA) could enable comprehensive profiling of tumors from blood but the genome-wide concordance between cfDNA and tumor biopsies is uncertain. Here we report ichorCNA, software that quantifies tumor content in cfDNA from 0.1× coverage whole-genome sequencing data without prior knowledge of tumor mutations. We apply ichorCNA to 1439 blood samples from 520 patients with metastatic prostate or breast cancers. In the earliest tested sample for each patient, 34% of patients have ≥10% tumor-derived cfDNA, sufficient for standard coverage whole-exome sequencing. Using whole-exome sequencing, we validate the concordance of clonal somatic mutations (88%), copy number alterations (80%), mutational signatures, and neoantigens between cfDNA and matched tumor biopsies from 41 patients with ≥10% cfDNA tumor content. In summary, we provide methods to identify patients eligible for comprehensive cfDNA profiling, revealing its applicability to many patients, and demonstrate high concordance of cfDNA and metastatic tumor whole-exome sequencing.

519 citations


Journal ArticleDOI
TL;DR: Several CNAs recurrently observed in primary tumors gradually disappeared in PDXs, indicating that events undergoing positive selection in humans can become dispensable during propagation in mice.
Abstract: Patient-derived xenografts (PDXs) have become a prominent cancer model system, as they are presumed to faithfully represent the genomic features of primary tumors. Here we monitored the dynamics of copy number alterations (CNAs) in 1,110 PDX samples across 24 cancer types. We observed rapid accumulation of CNAs during PDX passaging, often due to selection of preexisting minor clones. CNA acquisition in PDXs was correlated with the tissue-specific levels of aneuploidy and genetic heterogeneity observed in primary tumors. However, the particular CNAs acquired during PDX passaging differed from those acquired during tumor evolution in patients. Several CNAs recurrently observed in primary tumors gradually disappeared in PDXs, indicating that events undergoing positive selection in humans can become dispensable during propagation in mice. Notably, the genomic stability of PDXs was associated with their response to chemotherapy and targeted drugs. These findings have major implications for PDX-based modeling of human cancer.

494 citations


Posted ContentDOI
10 Jul 2017-bioRxiv
TL;DR: CERES, a computational method to estimate gene dependency levels from CRISPR-Cas9 essentiality screens while accounting for the copy-number-specific effect, as well as variable sgRNA activity, is developed and applied to sets of screens performed with different sgRNAs and found that it reduces false positive results and provides meaningful estimates of sg RNA activity.
Abstract: The CRISPR-Cas9 system has revolutionized gene editing both on single genes and in multiplexed loss-of-function screens, enabling precise genome-scale identification of genes essential to proliferation and survival of cancer cells. However, previous studies reported that an anti-proliferative effect of Cas9-mediated DNA cleavage confounds such measurement of genetic dependency, particularly in the setting of copy number gain. We performed genome-scale CRISPR-Cas9 essentiality screens on 342 cancer cell lines and found that this effect is common to all lines, leading to false positive results when targeting genes in copy number amplified regions. We developed CERES, a computational method to estimate gene dependency levels from CRISPR-Cas9 essentiality screens while accounting for the copy-number-specific effect, as well as variable sgRNA activity. We applied CERES to sets of screens performed with different sgRNA libraries and found that it reduces false positive results and provides meaningful estimates of sgRNA activity. As a result, the application of CERES improves confidence in the interpretation of genetic dependency data from CRISPR-Cas9 essentiality screens of cancer cell lines.

Journal ArticleDOI
TL;DR: It is established that germline nonsense and frameshift variants in PALB2, but not in ATM or CHEK2, can also give rise to the same signature and epigenetic silencing of RAD51C and BRCA1 by promoter methylation is strongly associated with signature 3.
Abstract: Biallelic inactivation of BRCA1 or BRCA2 is associated with a pattern of genome-wide mutations known as signature 3. By analyzing ∼1,000 breast cancer samples, we confirmed this association and established that germline nonsense and frameshift variants in PALB2, but not in ATM or CHEK2, can also give rise to the same signature. We were able to accurately classify missense BRCA1 or BRCA2 variants known to impair homologous recombination (HR) on the basis of this signature. Finally, we show that epigenetic silencing of RAD51C and BRCA1 by promoter methylation is strongly associated with signature 3 and, in our data set, was highly enriched in basal-like breast cancers in young individuals of African descent.

Journal ArticleDOI
Alexandra B Keenan1, Sherry L. Jenkins1, Kathleen M. Jagodnik1, Simon Koplev1, Edward He1, Denis Torre1, Zichen Wang1, Anders B. Dohlman1, Moshe C. Silverstein1, Alexander Lachmann1, Maxim V. Kuleshov1, Avi Ma'ayan1, Vasileios Stathias2, Raymond Terryn2, Daniel J. Cooper2, Michele Forlin2, Amar Koleti2, Dusica Vidovic2, Caty Chung2, Stephan C. Schürer2, Jouzas Vasiliauskas3, Marcin Pilarczyk3, Behrouz Shamsaei3, Mehdi Fazel3, Yan Ren3, Wen Niu3, Nicholas A. Clark3, Shana White3, Naim Al Mahi3, Lixia Zhang3, Michal Kouril3, John F. Reichard3, Siva Sivaganesan3, Mario Medvedovic3, Jaroslaw Meller3, Rick J. Koch1, Marc R. Birtwistle1, Ravi Iyengar1, Eric A. Sobie1, Evren U. Azeloglu1, Julia A. Kaye4, Jeannette Osterloh4, Kelly Haston4, Jaslin Kalra4, Steve Finkbiener4, Jonathan Z. Li5, Pamela Milani5, Miriam Adam5, Renan Escalante-Chong5, Karen Sachs5, Alexander LeNail5, Divya Ramamoorthy5, Ernest Fraenkel5, Gavin Daigle6, Uzma Hussain6, Alyssa Coye6, Jeffrey D. Rothstein6, Dhruv Sareen7, Loren Ornelas7, Maria G. Banuelos7, Berhan Mandefro7, Ritchie Ho7, Clive N. Svendsen7, Ryan G. Lim8, Jennifer Stocksdale8, Malcolm Casale8, Terri G. Thompson8, Jie Wu8, Leslie M. Thompson8, Victoria Dardov7, Vidya Venkatraman7, Andrea Matlock7, Jennifer E. Van Eyk7, Jacob D. Jaffe9, Malvina Papanastasiou9, Aravind Subramanian9, Todd R. Golub, Sean D. Erickson10, Mohammad Fallahi-Sichani10, Marc Hafner10, Nathanael S. Gray10, Jia-Ren Lin10, Caitlin E. Mills10, Jeremy L. Muhlich10, Mario Niepel10, Caroline E. Shamu10, Elizabeth H. Williams10, David Wrobel10, Peter K. Sorger10, Laura M. Heiser11, Joe W. Gray11, James E. Korkola11, Gordon B. Mills12, Mark A. LaBarge13, Mark A. LaBarge14, Heidi S. Feiler11, Mark A. Dane11, Elmar Bucher11, Michel Nederlof11, Damir Sudar11, Sean M. Gross11, David Kilburn11, Rebecca Smith11, Kaylyn Devlin11, Ron Margolis, Leslie Derr, Albert Lee, Ajay Pillai 
TL;DR: The LINCS program focuses on cellular physiology shared among tissues and cell types relevant to an array of diseases, including cancer, heart disease, and neurodegenerative disorders.
Abstract: The Library of Integrated Network-Based Cellular Signatures (LINCS) is an NIH Common Fund program that catalogs how human cells globally respond to chemical, genetic, and disease perturbations. Resources generated by LINCS include experimental and computational methods, visualization tools, molecular and imaging data, and signatures. By assembling an integrated picture of the range of responses of human cells exposed to many perturbations, the LINCS program aims to better understand human disease and to advance the development of new therapies. Perturbations under study include drugs, genetic perturbations, tissue micro-environments, antibodies, and disease-causing mutations. Responses to perturbations are measured by transcript profiling, mass spectrometry, cell imaging, and biochemical methods, among other assays. The LINCS program focuses on cellular physiology shared among tissues and cell types relevant to an array of diseases, including cancer, heart disease, and neurodegenerative disorders. This Perspective describes LINCS technologies, datasets, tools, and approaches to data accessibility and reusability.

Journal ArticleDOI
28 Jun 2017-Nature
TL;DR: This study shows that promoter regions harbour recurrent mutations in cancer with functional consequences and that the mutations occur at similar frequencies as in coding regions.
Abstract: Genomic analysis of tumours has led to the identification of hundreds of cancer genes on the basis of the presence of mutations in protein-coding regions. By contrast, much less is known about cancer-causing mutations in non-coding regions. Here we perform deep sequencing in 360 primary breast cancers and develop computational methods to identify significantly mutated promoters. Clear signals are found in the promoters of three genes. FOXA1, a known driver of hormone-receptor positive breast cancer, harbours a mutational hotspot in its promoter leading to overexpression through increased E2F binding. RMRP and NEAT1, two non-coding RNA genes, carry mutations that affect protein binding to their promoters and alter expression levels. Our study shows that promoter regions harbour recurrent mutations in cancer with functional consequences and that the mutations occur at similar frequencies as in coding regions. Power analyses indicate that more such regions remain to be discovered through deep sequencing of adequately sized cohorts of patients.


Journal ArticleDOI
TL;DR: It is shown that Val-boroPro stimulates the immune system by triggering a proinflammatory form of cell death in monocytes and macrophages known as pyroptosis and reveals a new checkpoint that controls the activation of the innate immune system.
Abstract: Inhibitors of the post-proline-cleaving serine proteases DPP8 and DPP9 trigger a lytic form of programmed cell death called pyroptosis by activating pro-caspase-1 without autoproteolysis.

Journal ArticleDOI
TL;DR: This work shows that mitigating off-target effects is feasible in these datasets via computational methodologies to produce a consensus gene signature (CGS) and compared RNAi technology to clustered regularly interspaced short palindromic repeat (CRISPR)-based knockout by analysis of 373 single guide RNAs (sgRNAs) to show that the on-target efficacies are comparable, but CRISPR technology is far less susceptible to systematic off- target effects.
Abstract: The application of RNA interference (RNAi) to mammalian cells has provided the means to perform phenotypic screens to determine the functions of genes. Although RNAi has revolutionized loss-of-function genetic experiments, it has been difficult to systematically assess the prevalence and consequences of off-target effects. The Connectivity Map (CMAP) represents an unprecedented resource to study the gene expression consequences of expressing short hairpin RNAs (shRNAs). Analysis of signatures for over 13,000 shRNAs applied in 9 cell lines revealed that microRNA (miRNA)-like off-target effects of RNAi are far stronger and more pervasive than generally appreciated. We show that mitigating off-target effects is feasible in these datasets via computational methodologies to produce a consensus gene signature (CGS). In addition, we compared RNAi technology to clustered regularly interspaced short palindromic repeat (CRISPR)-based knockout by analysis of 373 single guide RNAs (sgRNAs) in 6 cells lines and show that the on-target efficacies are comparable, but CRISPR technology is far less susceptible to systematic off-target effects. These results will help guide the proper use and analysis of loss-of-function reagents for the determination of gene function.

Journal ArticleDOI
30 Mar 2017-Nature
TL;DR: It is shown, using in vitro and in vivo studies in mice and humans, that the mitochondrial protein LACTB potently inhibits the proliferation of breast cancer cells and demonstrates a connection between mitochondrial lipid metabolism and the differentiation program of Breast cancer cells, thereby revealing a previously undescribed mechanism of tumour suppression.
Abstract: Post-mitotic, differentiated cells exhibit a variety of characteristics that contrast with those of actively growing neoplastic cells, such as the expression of cell-cycle inhibitors and differentiation factors. We hypothesized that the gene expression profiles of these differentiated cells could reveal the identities of genes that may function as tumour suppressors. Here we show, using in vitro and in vivo studies in mice and humans, that the mitochondrial protein LACTB potently inhibits the proliferation of breast cancer cells. Its mechanism of action involves alteration of mitochondrial lipid metabolism and differentiation of breast cancer cells. This is achieved, at least in part, through reduction of the levels of mitochondrial phosphatidylserine decarboxylase, which is involved in the synthesis of mitochondrial phosphatidylethanolamine. These observations uncover a novel mitochondrial tumour suppressor and demonstrate a connection between mitochondrial lipid metabolism and the differentiation program of breast cancer cells, thereby revealing a previously undescribed mechanism of tumour suppression.

Journal ArticleDOI
TL;DR: This microscopy dataset includes 919 265 five-channel fields of view, representing 30 616 tested compounds, available at “The Cell Image Library” (CIL) repository, and includes data files containing morphological features derived from each cell in each image, both at the single-cell level and population-averaged level.
Abstract: Background Large-scale image sets acquired by automated microscopy of perturbed samples enable a detailed comparison of cell states induced by each perturbation, such as a small molecule from a diverse library. Highly multiplexed measurements of cellular morphology can be extracted from each image and subsequently mined for a number of applications. Findings This microscopy dataset includes 919 265 five-channel fields of view, representing 30 616 tested compounds, available at "The Cell Image Library" (CIL) repository. It also includes data files containing morphological features derived from each cell in each image, both at the single-cell level and population-averaged (i.e., per-well) level; the image analysis workflows that generated the morphological features are also provided. Quality-control metrics are provided as metadata, indicating fields of view that are out-of-focus or containing highly fluorescent material or debris. Lastly, chemical annotations are supplied for the compound treatments applied. Conclusions Because computational algorithms and methods for handling single-cell morphological measurements are not yet routine, the dataset serves as a useful resource for the wider scientific community applying morphological (image-based) profiling. The dataset can be mined for many purposes, including small-molecule library enrichment and chemical mechanism-of-action studies, such as target identification. Integration with genetically perturbed datasets could enable identification of small-molecule mimetics of particular disease- or gene-related phenotypes that could be useful as probes or potential starting points for development of future therapeutics.

Journal ArticleDOI
TL;DR: It is demonstrated that MYCN upregulates EZH2, leading to inactivation of a tumor suppressor program in neuroblastoma, and support testing EZh2 inhibitors in patients with MYCN-amplified neuroblastomas.
Abstract: Pharmacologically difficult targets, such as MYC transcription factors, represent a major challenge in cancer therapy. For the childhood cancer neuroblastoma, amplification of the oncogene MYCN is associated with high-risk disease and poor prognosis. Here, we deployed genome-scale CRISPR-Cas9 screening of MYCN-amplified neuroblastoma and found a preferential dependency on genes encoding the polycomb repressive complex 2 (PRC2) components EZH2, EED, and SUZ12. Genetic and pharmacological suppression of EZH2 inhibited neuroblastoma growth in vitro and in vivo. Moreover, compared with neuroblastomas without MYCN amplification, MYCN-amplified neuroblastomas expressed higher levels of EZH2. ChIP analysis showed that MYCN binds at the EZH2 promoter, thereby directly driving expression. Transcriptomic and epigenetic analysis, as well as genetic rescue experiments, revealed that EZH2 represses neuronal differentiation in neuroblastoma in a PRC2-dependent manner. Moreover, MYCN-amplified and high-risk primary tumors from patients with neuroblastoma exhibited strong repression of EZH2-regulated genes. Additionally, overexpression of IGFBP3, a direct EZH2 target, suppressed neuroblastoma growth in vitro and in vivo. We further observed strong synergy between histone deacetylase inhibitors and EZH2 inhibitors. Together, these observations demonstrate that MYCN upregulates EZH2, leading to inactivation of a tumor suppressor program in neuroblastoma, and support testing EZH2 inhibitors in patients with MYCN-amplified neuroblastoma.

Journal ArticleDOI
TL;DR: Large-scale transcriptomic profiling of breast cancer cell lines treated with anti-cancer drugs is performed and it is found that certain drug classes induce cell line specific responses.
Abstract: More effective use of targeted anti-cancer drugs depends on elucidating the connection between the molecular states induced by drug treatment and the cellular phenotypes controlled by these states, such as cytostasis and death. This is particularly true when mutation of a single gene is inadequate as a predictor of drug response. The current paper describes a data set of ~600 drug cell line pairs collected as part of the NIH LINCS Program ( http://www.lincsproject.org/ ) in which molecular data (reduced dimensionality transcript L1000 profiles) were recorded across dose and time in parallel with phenotypic data on cellular cytostasis and cytotoxicity. We report that transcriptional and phenotypic responses correlate with each other in general, but whereas inhibitors of chaperones and cell cycle kinases induce similar transcriptional changes across cell lines, changes induced by drugs that inhibit intra-cellular signaling kinases are cell-type specific. In some drug/cell line pairs significant changes in transcription are observed without a change in cell growth or survival; analysis of such pairs identifies drug equivalence classes and, in one case, synergistic drug interactions. In this case, synergy involves cell-type specific suppression of an adaptive drug response.

Journal ArticleDOI
TL;DR: Mechanistically, miR-31 increased the sensitivity of T cells to type I interferons, which interfered with effector T cell function and increased the expression of several proteins related to T cell dysfunction during chronic infection.
Abstract: During infection, antigen-specific T cells undergo tightly regulated developmental transitions controlled by transcriptional and post-transcriptional regulation of gene expression We found that the microRNA miR-31 was strongly induced by activation of the T cell antigen receptor (TCR) in a pathway involving calcium and activation of the transcription factor NFAT During chronic infection with lymphocytic choriomeningitis virus (LCMV) clone 13, miR-31-deficent mice recovered from clinical disease, while wild-type mice continued to show signs of disease This disease phenotype was explained by the presence of larger numbers of cytokine-secreting LCMV-specific CD8+ T cells in miR-31-deficent mice than in wild-type mice Mechanistically, miR-31 increased the sensitivity of T cells to type I interferons, which interfered with effector T cell function and increased the expression of several proteins related to T cell dysfunction during chronic infection These studies identify miR-31 as an important regulator of T cell exhaustion in chronic infection

Journal ArticleDOI
TL;DR: The results suggest that the expression of a significant fraction of cancer genes may be regulated by ceRNA interactions in each of the two tumor contexts, and statistically significant effects for half of the predicted targets.
Abstract: MicroRNAs (miRNAs) play multiple roles in tumor biology. Interestingly, reports from multiple groups suggest that miRNA targets may be coupled through competitive stoichiometric sequestration. Specifically, computational models predicted and experimental assays confirmed that miRNA activity is dependent on miRNA target abundance, and consequently, changes in the abundance of some miRNA targets lead to changes to the regulation and abundance of their other targets. The resulting indirect regulatory influence between miRNA targets resembles competition and has been dubbed competitive endogenous RNA (ceRNA). Recent studies have questioned the physiological relevance of ceRNA interactions, our ability to accurately predict these interactions, and the number of genes that are impacted by ceRNA interactions in specific cellular contexts. To address these concerns, we reverse engineered ceRNA networks (ceRNETs) in breast and prostate adenocarcinomas using context-specific TCGA profiles, and tested whether ceRNA interactions can predict the effects of RNAi-mediated gene silencing perturbations in PC3 and MCF7 cells._ENREF_22 Our results, based on tests of thousands of inferred ceRNA interactions that are predicted to alter hundreds of cancer genes in each of the two tumor contexts, confirmed statistically significant effects for half of the predicted targets. Our results suggest that the expression of a significant fraction of cancer genes may be regulated by ceRNA interactions in each of the two tumor contexts.

Posted ContentDOI
08 Jun 2017-bioRxiv
TL;DR: This work shows that mitigating off-target effects is feasible in these datasets via computational methodologies to produce a Consensus Gene Signature (CGS), and compares RNAi technology to clustered regularly interspaced short palindromic repeat (CRISPR)-based knockout by analysis of 373 sgRNAs in 6 cells lines, and shows that the on-target efficacies are comparable.
Abstract: The application of RNA interference (RNAi) to mammalian cells has provided the means to perform phenotypic screens to determine the functions of genes. Although RNAi has revolutionized loss of function genetic experiments, it has been difficult to systematically assess the prevalence and consequences of off-target effects. The Connectivity Map (CMAP) represents an unprecedented resource to study the gene expression consequences of expressing short hairpin RNAs (shRNAs). Analysis of signatures for over 13,000 shRNAs applied in 9 cell lines revealed that miRNA-like off-target effects of RNAi are far stronger and more pervasive than generally appreciated. We show that mitigating off-target effects is feasible in these datasets via computational methodologies to produce a Consensus Gene Signature (CGS). In addition, we compared RNAi technology to clustered regularly interspaced short palindromic repeat (CRISPR)-based knockout by analysis of 373 sgRNAs in 6 cells lines, and show that the on-target efficacies are comparable, but CRISPR technology is far less susceptible to systematic off-target effects. These results will help guide the proper use and analysis of loss-of-function reagents for the determination of gene function.

Posted Content
TL;DR: In this article, the authors reverse engineered ceRNA networks (ceRNETs) in breast and prostate adenocarcinomas using context-specific TCGA profiles, and tested whether ceRNA interactions can predict the effects of RNAi-mediated gene silencing perturbations in PC3 and MCF7 cells.
Abstract: Background: MicroRNAs (miRNAs) play multiple roles in tumor biology [1]. Interestingly, reports from multiple groups suggest that miRNA targets may be coupled through competitive stoichiometric sequestration [2]. Specifically, computational models predicted [3, 4] and experimental assays confirmed [5, 6] that miRNA activity is dependent on miRNA target abundance, and consequently, changes to the abundance of some miRNA targets lead to changes to the regulation and abundance of their other targets. The resulting indirect regulatory influence between miRNA targets resembles competition and has been dubbed competitive endogenous RNA (ceRNA) [5, 7, 8]. Recent studies have questioned the physiological relevance of ceRNA interactions [9], researchers ability to accurately predict these interactions [10], and the number of genes that are impacted by ceRNA interactions in specific cellular contexts [11]. Results: To address these concerns, we reverse engineered ceRNA networks (ceRNETs) in breast and prostate adenocarcinomas using context-specific TCGA profiles [12-14], and tested whether ceRNA interactions can predict the effects of RNAi-mediated gene silencing perturbations in PC3 and MCF7 cells. Our results, based on tests of thousands of inferred ceRNA interactions that are predicted to alter hundreds of cancer genes in each of the two tumor contexts, confirmed statistically significant effects for half of the predicted targets. Conclusions: Our results suggest that the expression of a significant fraction of cancer genes may be regulated by ceRNA interactions in each of the two tumor contexts.


Journal ArticleDOI
TL;DR: Clinical and genomic data are used to generate genomic landscapes in pt subgroups and to identify mechanisms of response and resistance to therapies to accelerate research.
Abstract: 1076Background: The Metastatic Breast Cancer Project is a nationwide research study, launched in Oct 2015 in collaboration with patients (pts) and advocacy groups, that directly engages pts through social media and seeks to empower them to share their experiences, clinical information, and samples to accelerate research. Methods: MBC pts enroll by providing their information at mbcproject.org. Pts are sent a saliva kit and asked to mail back a sample which is used to extract germline DNA. We contact pts medical providers and obtain medical records (MRs) and stored tumor samples. Pts may also submit a blood sample, used to extract cell free DNA (cfDNA). Whole exome sequencing (WES) is performed on tumor, germline, and cfDNA; transcriptome sequencing is performed on tumor. Clinical and genomic data are used to generate genomic landscapes in pt subgroups and to identify mechanisms of response and resistance to therapies. Data are shared widely through public databases. Pts receive regular study updates. Resu...

Posted ContentDOI
07 Sep 2017-bioRxiv
TL;DR: First-of-its-kind public resource of proteomic responses to systematically administered perturbagens demonstrated, which could be leveraged against public domain external datasets to recognize therapeutic hypotheses that are consistent with ongoing clinical trials for the treatment of multiple myeloma and acute lymphocytic leukemia.
Abstract: Though the added value of proteomic measurements to gene expression profiling has been demonstrated, profiling of gene expression on its own remains the dominant means of understanding cellular responses to perturbation. Direct protein measurements are typically limited due to issues of cost and scale; however, the recent development of high-throughput, targeted sentinel mass spectrometry assays provides an opportunity for proteomics to contribute at a meaningful scale in high-value areas for drug development. To demonstrate the feasibility of a systematic and comprehensive library of perturbational proteomic signatures, we profiled 90 drugs (in triplicate) in six cell lines using two different proteomic assays -- one measuring global changes of epigenetic marks on histone proteins and another measuring a set of peptides reporting on the phosphoproteome -- for a total of more than 3,400 samples. This effort represents a first-of-its-kind resource for proteomics. The majority of tested drugs generated reproducible responses in both phosphosignaling and chromatin states, but we observed differences in the responses that were cell line- and assay-specific. We formalized the process of comparing response signatures within the data using a concept called connectivity, which enabled us to integrate data across cell types and assays. Furthermore, it facilitated incorporation of transcriptional signatures. Consistent connectivity among cell types revealed cellular responses that transcended cell-specific effects, while consistent connectivity among assays revealed unexpected associations between drugs that were confirmed by experimental follow-up. We further demonstrated how the resource could be leveraged against public domain external datasets to recognize therapeutic hypotheses that are consistent with ongoing clinical trials for the treatment of multiple myeloma and acute lymphocytic leukemia (ALL). These data are available for download via the Gene Expression Omnibus (accession GSE101406), and web apps for interacting with this resource are available at https://clue.io/proteomics.

Posted ContentDOI
24 Jul 2017-bioRxiv
TL;DR: Several recurrent CNAs observed in primary tumors gradually disappeared in PDXs, indicating that events undergoing positive selection in humans can become dispensable during propagation in mice.
Abstract: Patient-derived xenografts (PDXs) have become a prominent model for studying human cancer in vivo. The underlying assumption is that PDXs faithfully represent the genomic features of primary tumors, retaining their molecular characteristics throughout propagation. However, the genomic stability of PDXs during passaging has not yet been evaluated systematically. Here we monitored the dynamics of copy number alterations (CNAs) in 1,110 PDX samples across 24 cancer types. We found that new CNAs accumulated quickly, such that within four passages an average of 12% of the genome was affected by newly acquired CNAs. Selection for pre-existing minor clones was a major contributor to these changes, leading to both gains and losses of CNAs. The rate of CNA acquisition in PDX models was correlated with the extent of both aneuploidy and genetic heterogeneity observed in primary tumors of the same tissue. However, the specific CNAs acquired during PDX passaging differed from those acquired during tumor evolution in patients, suggesting that PDX tumors are subjected to distinct selection pressures compared to those that exist in human hosts. Specifically, several recurrent CNAs observed in primary tumors gradually disappeared in PDXs, indicating that events undergoing positive selection in humans can become dispensable during propagation in mice. Finally, we found that the genomic stability of PDX models also affected their responses to chemotherapy and targeted drugs. Our findings thus highlight the need to couple the timing of PDX molecular characterization to that of drug testing experiments. These results suggest that while PDX models are powerful tools, they should be used with caution.

Journal ArticleDOI
TL;DR: This nationwide study seeks to empower patients to accelerate research by sharing their samples and clinical information remotely, and to identify genomic drivers and mechanisms of response and resistance to therapies.
Abstract: 1519Background: Angiosarcoma (AS) is a rare soft tissue sarcoma, with an incidence of 300/yr and a 5-year DSS of 30%. The low incidence has impeded large-scale research efforts that may lead to improved clinical outcomes. To address this, we launched a nationwide study, which seeks to empower patients (pts) to accelerate research by sharing their samples and clinical information remotely. Methods: With pts and advocacy groups we developed a website to allow AS pts to participate across the US. Pts are mailed a saliva and blood draw kit for germline and cell free (cf) DNA analysis. We then obtain medical records and stored tumor samples. Whole exome sequencing will be performed on tumor, cfDNA and saliva samples. Transcriptome analysis will be performed on tumor samples. A clinically annotated genomic database will be generated and shared widely to identify genomic drivers and mechanisms of response and resistance to therapies. Study updates will be shared with pts regularly. Results: We conducted a 3-week...