scispace - formally typeset
Search or ask a question
Author

Ryan O’Rourke

Other affiliations: Dana Corporation, Broad Institute
Bio: Ryan O’Rourke is an academic researcher from Harvard University. The author has contributed to research in topics: Sequence assembly & Cyclin-dependent kinase 6. The author has an hindex of 7, co-authored 9 publications receiving 829 citations. Previous affiliations of Ryan O’Rourke include Dana Corporation & Broad Institute.

Papers
More filters
Journal ArticleDOI
08 Aug 2018-Nature
TL;DR: The extent, origins and consequences of genetic variation within human cell lines are studied, providing a framework for researchers to measure such variation in efforts to support maximally reproducible cancer research.
Abstract: Human cancer cell lines are the workhorse of cancer research. Although cell lines are known to evolve in culture, the extent of the resultant genetic and transcriptional heterogeneity and its functional consequences remain understudied. Here we use genomic analyses of 106 human cell lines grown in two laboratories to show extensive clonal diversity. Further comprehensive genomic characterization of 27 strains of the common breast cancer cell line MCF7 uncovered rapid genetic diversification. Similar results were obtained with multiple strains of 13 additional cell lines. Notably, genetic changes were associated with differential activation of gene expression programs and marked differences in cell morphology and proliferation. Barcoding experiments showed that cell line evolution occurs as a result of positive clonal selection that is highly sensitive to culture conditions. Analyses of single-cell-derived clones demonstrated that continuous instability quickly translates into heterogeneity of the cell line. When the 27 MCF7 strains were tested against 321 anti-cancer compounds, we uncovered considerably different drug responses: at least 75% of compounds that strongly inhibited some strains were completely inactive in others. This study documents the extent, origins and consequences of genetic variation within cell lines, and provides a framework for researchers to measure such variation in efforts to support maximally reproducible cancer research.

601 citations

Journal ArticleDOI
TL;DR: SvABA demonstrates superior sensitivity and specificity across a large spectrum of SVs and substantially improves detection performance for variants in the 20-300 bp range, compared with existing methods.
Abstract: Structural variants (SVs), including small insertion and deletion variants (indels), are challenging to detect through standard alignment-based variant calling methods. Sequence assembly offers a powerful approach to identifying SVs, but is difficult to apply at scale genome-wide for SV detection due to its computational complexity and the difficulty of extracting SVs from assembly contigs. We describe SvABA, an efficient and accurate method for detecting SVs from short-read sequencing data using genome-wide local assembly with low memory and computing requirements. We evaluated SvABA's performance on the NA12878 human genome and in simulated and real cancer genomes. SvABA demonstrates superior sensitivity and specificity across a large spectrum of SVs and substantially improves detection performance for variants in the 20-300 bp range, compared with existing methods. SvABA also identifies complex somatic rearrangements with chains of short (<1000 bp) templated-sequence insertions copied from distant genomic regions. We applied SvABA to 344 cancer genomes from 11 cancer types and found that short templated-sequence insertions occur in ∼4% of all somatic rearrangements. Finally, we demonstrate that SvABA can identify sites of viral integration and cancer driver alterations containing medium-sized (50-300 bp) SVs.

251 citations

Journal ArticleDOI
TL;DR: This work has identified MYB-QKI fusions as a specific and single candidate driver event in angiocentric gliomas and represents the first example of a single driver rearrangement simultaneously transforming cells via three genetic and epigenetic mechanisms in a tumor.
Abstract: Angiocentric gliomas are pediatric low-grade gliomas (PLGGs) without known recurrent genetic drivers. We performed genomic analysis of new and published data from 249 PLGGs, including 19 angiocentric gliomas. We identified MYB-QKI fusions as a specific and single candidate driver event in angiocentric gliomas. In vitro and in vivo functional studies show that MYB-QKI rearrangements promote tumorigenesis through three mechanisms: MYB activation by truncation, enhancer translocation driving aberrant MYB-QKI expression and hemizygous loss of the tumor suppressor QKI. To our knowledge, this represents the first example of a single driver rearrangement simultaneously transforming cells via three genetic and epigenetic mechanisms in a tumor.

194 citations

Posted ContentDOI
01 Feb 2017-bioRxiv
TL;DR: SvABA demonstrates superior sensitivity and specificity across a large spectrum of SVs, and substantially improved detection performance for variants in the 20-300 bp range, compared with existing methods.
Abstract: Structural variants (SVs), including small insertion and deletion variants (indels), are challenging to detect through standard alignment-based variant calling methods. Sequence assembly offers a powerful approach to identifying SVs, but is difficult to apply at-scale genome-wide for SV detection due to its computational complexity and the difficulty of extracting SVs from assembly contigs. We describe SvABA, an efficient and accurate method for detecting SVs from short-read sequencing data using genome-wide local assembly with low memory and computing requirements. We evaluated SvABA's performance on the NA12878 human genome and in simulated and real cancer genomes. SvABA demonstrates superior sensitivity and specificity across a large spectrum of SVs, and substantially improved detection performance for variants in the 20-300 bp range, compared with existing methods. SvABA also identifies complex somatic rearrangements with chains of short (< 1,000 bp) templated-sequence insertions copied from distant genomic regions. We applied SvABA to 344 cancer genomes from 11 cancer types, and found that templated-sequence insertions occur in ~4% of all somatic rearrangements. Finally, we demonstrate that SvABA can identify sites of viral integration and cancer driver alterations containing medium-sized SVs.

88 citations

Journal ArticleDOI
TL;DR: The combination of OncoPanel and OncoCopy multiplex genomic assays can identify critical diagnostic, prognostic, and treatment-relevant alterations and represents an effective precision medicine approach for clinical evaluation of pediatric brain tumors.
Abstract: Background Clinical genomics platforms are needed to identify targetable alterations, but implementation of these technologies and best practices in routine clinical pediatric oncology practice are not yet well established. Methods Profile is an institution-wide prospective clinical research initiative that uses targeted sequencing to identify targetable alterations in tumors. OncoPanel, a multiplexed targeted exome-sequencing platform that includes 300 cancer-causing genes, was used to assess single nucleotide variants and rearrangements/indels. Alterations were annotated (Tiers 1-4) based on clinical significance, with Tier 1 alterations having well-established clinical utility. OncoCopy, a clinical genome-wide array comparative genomic hybridization (aCGH) assay, was also performed to evaluate copy number alterations and better define rearrangement breakpoints. Results Cancer genomes of 203 pediatric brain tumors were profiled across histological subtypes, including 117 samples analyzed by OncoPanel, 146 by OncoCopy, and 60 tumors subjected to both methodologies. OncoPanel revealed clinically relevant alterations in 56% of patients (44 cancer mutations and 20 rearrangements), including BRAF alterations that directed the use of targeted inhibitors. Rearrangements in MYB-QKI, MYBL1, BRAF, and FGFR1 were also detected. Furthermore, while copy number profiles differed across histologies, the combined use of OncoPanel and OncoCopy identified subgroup-specific alterations in 89% (17/19) of medulloblastomas. Conclusion The combination of OncoPanel and OncoCopy multiplex genomic assays can identify critical diagnostic, prognostic, and treatment-relevant alterations and represents an effective precision medicine approach for clinical evaluation of pediatric brain tumors.

66 citations


Cited by
More filters
01 Jan 2011
TL;DR: The sheer volume and scope of data posed by this flood of data pose a significant challenge to the development of efficient and intuitive visualization tools able to scale to very large data sets and to flexibly integrate multiple data types, including clinical data.
Abstract: Rapid improvements in sequencing and array-based platforms are resulting in a flood of diverse genome-wide data, including data from exome and whole-genome sequencing, epigenetic surveys, expression profiling of coding and noncoding RNAs, single nucleotide polymorphism (SNP) and copy number profiling, and functional assays. Analysis of these large, diverse data sets holds the promise of a more comprehensive understanding of the genome and its relation to human disease. Experienced and knowledgeable human review is an essential component of this process, complementing computational approaches. This calls for efficient and intuitive visualization tools able to scale to very large data sets and to flexibly integrate multiple data types, including clinical data. However, the sheer volume and scope of data pose a significant challenge to the development of such tools.

2,187 citations

Journal ArticleDOI
08 May 2019-Nature
TL;DR: The original Cancer Cell Line Encyclopedia is expanded with deeper characterization of over 1,000 cell lines, including genomic, transcriptomic, and proteomic data, and integration with drug-sensitivity and gene-dependency data, which reveals potential targets for cancer drugs and associated biomarkers.
Abstract: Large panels of comprehensively characterized human cancer models, including the Cancer Cell Line Encyclopedia (CCLE), have provided a rigorous framework with which to study genetic variants, candidate targets, and small-molecule and biological therapeutics and to identify new marker-driven cancer dependencies. To improve our understanding of the molecular features that contribute to cancer phenotypes, including drug responses, here we have expanded the characterizations of cancer cell lines to include genetic, RNA splicing, DNA methylation, histone H3 modification, microRNA expression and reverse-phase protein array data for 1,072 cell lines from individuals of various lineages and ethnicities. Integration of these data with functional characterizations such as drug-sensitivity, short hairpin RNA knockdown and CRISPR-Cas9 knockout data reveals potential targets for cancer drugs and associated biomarkers. Together, this dataset and an accompanying public data portal provide a resource for the acceleration of cancer research using model cancer cell lines.

1,801 citations

Journal ArticleDOI
Peter J. Campbell1, Gad Getz2, Jan O. Korbel3, Joshua M. Stuart4  +1329 moreInstitutions (238)
06 Feb 2020-Nature
TL;DR: The flagship paper of the ICGC/TCGA Pan-Cancer Analysis of Whole Genomes Consortium describes the generation of the integrative analyses of 2,658 whole-cancer genomes and their matching normal tissues across 38 tumour types, the structures for international data sharing and standardized analyses, and the main scientific findings from across the consortium studies.
Abstract: Cancer is driven by genetic change, and the advent of massively parallel sequencing has enabled systematic documentation of this variation at the whole-genome scale1,2,3. Here we report the integrative analysis of 2,658 whole-cancer genomes and their matching normal tissues across 38 tumour types from the Pan-Cancer Analysis of Whole Genomes (PCAWG) Consortium of the International Cancer Genome Consortium (ICGC) and The Cancer Genome Atlas (TCGA). We describe the generation of the PCAWG resource, facilitated by international data sharing using compute clouds. On average, cancer genomes contained 4–5 driver mutations when combining coding and non-coding genomic elements; however, in around 5% of cases no drivers were identified, suggesting that cancer driver discovery is not yet complete. Chromothripsis, in which many clustered structural variants arise in a single catastrophic event, is frequently an early event in tumour evolution; in acral melanoma, for example, these events precede most somatic point mutations and affect several cancer-associated genes simultaneously. Cancers with abnormal telomere maintenance often originate from tissues with low replicative activity and show several mechanisms of preventing telomere attrition to critical levels. Common and rare germline variants affect patterns of somatic mutation, including point mutations, structural variants and somatic retrotransposition. A collection of papers from the PCAWG Consortium describes non-coding mutations that drive cancer beyond those in the TERT promoter4; identifies new signatures of mutational processes that cause base substitutions, small insertions and deletions and structural variation5,6; analyses timings and patterns of tumour evolution7; describes the diverse transcriptional consequences of somatic mutation on splicing, expression levels, fusion genes and promoter activity8,9; and evaluates a range of more-specialized features of cancer genomes8,10,11,12,13,14,15,16,17,18.

1,600 citations

Journal ArticleDOI
TL;DR: A comprehensive genetic analysis of 304 primary DLBCLs identified low-frequency alterations, captured recurrent mutations, somatic copy number alterations, and structural variants, and defined coordinate signatures in patients with available outcome data to provide a roadmap for an actionableDLBCL classification.
Abstract: Diffuse large B cell lymphoma (DLBCL), the most common lymphoid malignancy in adults, is a clinically and genetically heterogeneous disease that is further classified into transcriptionally defined activated B cell (ABC) and germinal center B cell (GCB) subtypes. We carried out a comprehensive genetic analysis of 304 primary DLBCLs and identified low-frequency alterations, captured recurrent mutations, somatic copy number alterations, and structural variants, and defined coordinate signatures in patients with available outcome data. We integrated these genetic drivers using consensus clustering and identified five robust DLBCL subsets, including a previously unrecognized group of low-risk ABC-DLBCLs of extrafollicular/marginal zone origin; two distinct subsets of GCB-DLBCLs with different outcomes and targetable alterations; and an ABC/GCB-independent group with biallelic inactivation of TP53, CDKN2A loss, and associated genomic instability. The genetic features of the newly characterized subsets, their mutational signatures, and the temporal ordering of identified alterations provide new insights into DLBCL pathogenesis. The coordinate genetic signatures also predict outcome independent of the clinical International Prognostic Index and suggest new combination treatment strategies. More broadly, our results provide a roadmap for an actionable DLBCL classification.

1,081 citations

Journal ArticleDOI
TL;DR: The 2018 Catalogue of Somatic Mutations in Cancer (COSMIC) Cancer Gene Census (CGC) is discussed, an expert-curated description of human cancer genes, which has recently been expanded to include functional descriptions of how each gene contributes to cancer.
Abstract: The Catalogue of Somatic Mutations in Cancer (COSMIC) Cancer Gene Census (CGC) is an expert-curated description of the genes driving human cancer that is used as a standard in cancer genetics across basic research, medical reporting and pharmaceutical development. After a major expansion and complete re-evaluation, the 2018 CGC describes in detail the effect of 719 cancer-driving genes. The recent expansion includes functional and mechanistic descriptions of how each gene contributes to disease generation in terms of the key cancer hallmarks and the impact of mutations on gene and protein function. These functional characteristics depict the extraordinary complexity of cancer biology and suggest multiple cancer-related functions for many genes, which are often highly tissue-dependent or tumour stage-dependent. The 2018 CGC encompasses a second tier, describing an expanding list of genes (currently 145) from more recent cancer studies that show supportive but less detailed indications of a role in cancer.

895 citations