scispace - formally typeset
Search or ask a question
Author

Caty Chung

Bio: Caty Chung is an academic researcher from University of Miami. The author has contributed to research in topics: Ontology (information science) & Metadata. The author has an hindex of 14, co-authored 19 publications receiving 813 citations. Previous affiliations of Caty Chung include Scripps Research Institute & Icahn School of Medicine at Mount Sinai.

Papers
More filters
Journal ArticleDOI
Alexandra B Keenan1, Sherry L. Jenkins1, Kathleen M. Jagodnik1, Simon Koplev1, Edward He1, Denis Torre1, Zichen Wang1, Anders B. Dohlman1, Moshe C. Silverstein1, Alexander Lachmann1, Maxim V. Kuleshov1, Avi Ma'ayan1, Vasileios Stathias2, Raymond Terryn2, Daniel J. Cooper2, Michele Forlin2, Amar Koleti2, Dusica Vidovic2, Caty Chung2, Stephan C. Schürer2, Jouzas Vasiliauskas3, Marcin Pilarczyk3, Behrouz Shamsaei3, Mehdi Fazel3, Yan Ren3, Wen Niu3, Nicholas A. Clark3, Shana White3, Naim Al Mahi3, Lixia Zhang3, Michal Kouril3, John F. Reichard3, Siva Sivaganesan3, Mario Medvedovic3, Jaroslaw Meller3, Rick J. Koch1, Marc R. Birtwistle1, Ravi Iyengar1, Eric A. Sobie1, Evren U. Azeloglu1, Julia A. Kaye4, Jeannette Osterloh4, Kelly Haston4, Jaslin Kalra4, Steve Finkbiener4, Jonathan Z. Li5, Pamela Milani5, Miriam Adam5, Renan Escalante-Chong5, Karen Sachs5, Alexander LeNail5, Divya Ramamoorthy5, Ernest Fraenkel5, Gavin Daigle6, Uzma Hussain6, Alyssa Coye6, Jeffrey D. Rothstein6, Dhruv Sareen7, Loren Ornelas7, Maria G. Banuelos7, Berhan Mandefro7, Ritchie Ho7, Clive N. Svendsen7, Ryan G. Lim8, Jennifer Stocksdale8, Malcolm Casale8, Terri G. Thompson8, Jie Wu8, Leslie M. Thompson8, Victoria Dardov7, Vidya Venkatraman7, Andrea Matlock7, Jennifer E. Van Eyk7, Jacob D. Jaffe9, Malvina Papanastasiou9, Aravind Subramanian9, Todd R. Golub, Sean D. Erickson10, Mohammad Fallahi-Sichani10, Marc Hafner10, Nathanael S. Gray10, Jia-Ren Lin10, Caitlin E. Mills10, Jeremy L. Muhlich10, Mario Niepel10, Caroline E. Shamu10, Elizabeth H. Williams10, David Wrobel10, Peter K. Sorger10, Laura M. Heiser11, Joe W. Gray11, James E. Korkola11, Gordon B. Mills12, Mark A. LaBarge13, Mark A. LaBarge14, Heidi S. Feiler11, Mark A. Dane11, Elmar Bucher11, Michel Nederlof11, Damir Sudar11, Sean M. Gross11, David Kilburn11, Rebecca Smith11, Kaylyn Devlin11, Ron Margolis, Leslie Derr, Albert Lee, Ajay Pillai 
TL;DR: The LINCS program focuses on cellular physiology shared among tissues and cell types relevant to an array of diseases, including cancer, heart disease, and neurodegenerative disorders.
Abstract: The Library of Integrated Network-Based Cellular Signatures (LINCS) is an NIH Common Fund program that catalogs how human cells globally respond to chemical, genetic, and disease perturbations. Resources generated by LINCS include experimental and computational methods, visualization tools, molecular and imaging data, and signatures. By assembling an integrated picture of the range of responses of human cells exposed to many perturbations, the LINCS program aims to better understand human disease and to advance the development of new therapies. Perturbations under study include drugs, genetic perturbations, tissue micro-environments, antibodies, and disease-causing mutations. Responses to perturbations are measured by transcript profiling, mass spectrometry, cell imaging, and biochemical methods, among other assays. The LINCS program focuses on cellular physiology shared among tissues and cell types relevant to an array of diseases, including cancer, heart disease, and neurodegenerative disorders. This Perspective describes LINCS technologies, datasets, tools, and approaches to data accessibility and reusability.

300 citations

Journal ArticleDOI
TL;DR: TheLINCS Data Portal (LDP) is described, a unified web interface to access datasets generated by the LINCS DSGCs, and its underlying database, LINCS Data Registry (LDR).
Abstract: The Library of Integrated Network-based Cellular Signatures (LINCS) program is a national consortium funded by the NIH to generate a diverse and extensive reference library of cell-based perturbation-response signatures, along with novel data analytics tools to improve our understanding of human diseases at the systems level. In contrast to other large-scale data generation efforts, LINCS Data and Signature Generation Centers (DSGCs) employ a wide range of assay technologies cataloging diverse cellular responses. Integration of, and unified access to LINCS data has therefore been particularly challenging. The Big Data to Knowledge (BD2K) LINCS Data Coordination and Integration Center (DCIC) has developed data standards specifications, data processing pipelines, and a suite of end-user software tools to integrate and annotate LINCS-generated data, to make LINCS signatures searchable and usable for different types of users. Here, we describe the LINCS Data Portal (LDP) (http://lincsportal.ccs.miami.edu/), a unified web interface to access datasets generated by the LINCS DSGCs, and its underlying database, LINCS Data Registry (LDR). LINCS data served on the LDP contains extensive metadata and curated annotations. We highlight the features of the LDP user interface that is designed to enable search, browsing, exploration, download and analysis of LINCS data and related curated content.

125 citations

Journal ArticleDOI
TL;DR: The cornerstone of this update has been the decision to reprocess all high-level LINCS datasets and make them accessible at the data point level enabling users to directly access and download any subset of signatures across the entire library independent from the originating source, project or assay.
Abstract: The Library of Integrated Network-Based Cellular Signatures (LINCS) is an NIH Common Fund program with the goal of generating a large-scale and comprehensive catalogue of perturbation-response signatures by utilizing a diverse collection of perturbations across many model systems and assay types. The LINCS Data Portal (LDP) has been the primary access point for the compendium of LINCS data and has been widely utilized. Here, we report the first major update of LDP (http://lincsportal.ccs.miami.edu/signatures) with substantial changes in the data architecture and APIs, a completely redesigned user interface, and enhanced curated metadata annotations to support more advanced, intuitive and deeper querying, exploration and analysis capabilities. The cornerstone of this update has been the decision to reprocess all high-level LINCS datasets and make them accessible at the data point level enabling users to directly access and download any subset of signatures across the entire library independent from the originating source, project or assay. Access to the individual signatures also enables the newly implemented signature search functionality, which utilizes the iLINCS platform to identify conditions that mimic or reverse gene set queries. A newly designed query interface enables global metadata search with autosuggest across all annotations associated with perturbations, model systems, and signatures.

97 citations

Journal ArticleDOI
TL;DR: The evolution of BAO related to its formal structures, engineering approaches, and content is described to enable modeling of complex assays and integration with other ontologies and datasets to enable effective integration, aggregation, retrieval, and analyses of drug screening data.
Abstract: The lack of established standards to describe and annotate biological assays and screening outcomes in the domain of drug and chemical probe discovery is a severe limitation to utilize public and proprietary drug screening data to their maximum potential. We have created the BioAssay Ontology (BAO) project (http://bioassayontology.org) to develop common reference metadata terms and definitions required for describing relevant information of low-and high-throughput drug and probe screening assays and results. The main objectives of BAO are to enable effective integration, aggregation, retrieval, and analyses of drug screening data. Since we first released BAO on the BioPortal in 2010 we have considerably expanded and enhanced BAO and we have applied the ontology in several internal and external collaborative projects, for example the BioAssay Research Database (BARD). We describe the evolution of BAO with a design that enables modeling complex assays including profile and panel assays such as those in the Library of Integrated Network-based Cellular Signatures (LINCS). One of the critical questions in evolving BAO is the following: how can we provide a way to efficiently reuse and share among various research projects specific parts of our ontologies without violating the integrity of the ontology and without creating redundancies. This paper provides a comprehensive answer to this question with a description of a methodology for ontology modularization using a layered architecture. Our modularization approach defines several distinct BAO components and separates internal from external modules and domain-level from structural components. This approach facilitates the generation/extraction of derived ontologies (or perspectives) that can suit particular use cases or software applications. We describe the evolution of BAO related to its formal structures, engineering approaches, and content to enable modeling of complex assays and integration with other ontologies and datasets.

88 citations

Journal ArticleDOI
TL;DR: In this paper, the authors present metadata specifications for the most important molecular and cellular components and recommend them for adoption beyond the National Institutes of Health Library of Integrated Network-based Cellular Signatures (LINCS) program.
Abstract: The National Institutes of Health Library of Integrated Network-based Cellular Signatures (LINCS) program is generating extensive multidimensional data sets, including biochemical, genome-wide transcriptional, and phenotypic cellular response signatures to a variety of small-molecule and genetic perturbations with the goal of creating a sustainable, widely applicable, and readily accessible systems biology knowledge resource. Integration and analysis of diverse LINCS data sets depend on the availability of sufficient metadata to describe the assays and screening results and on their syntactic, structural, and semantic consistency. Here we report metadata specifications for the most important molecular and cellular components and recommend them for adoption beyond the LINCS project. We focus on the minimum required information to model LINCS assays and results based on a number of use cases, and we recommend controlled terminologies and ontologies to annotate assays with syntactic consistency and semantic integrity. We also report specifications for a simple annotation format (SAF) to describe assays and screening results based on our metadata specifications with explicit controlled vocabularies. SAF specifically serves to programmatically access and exchange LINCS data as a prerequisite for a distributed information management infrastructure. We applied the metadata specifications to annotate large numbers of LINCS cell lines, proteins, and small molecules. The resources generated and presented here are freely available.

76 citations


Cited by
More filters
Journal ArticleDOI
TL;DR: ChEMBL is an open large-scale bioactivity database that includes the annotation of assays and targets using ontologies, the inclusion of targets and indications for clinical candidates, addition of metabolic pathways for drugs and calculation of structural alerts.
Abstract: ChEMBL is an open large-scale bioactivity database (https://www.ebi.ac.uk/chembl), previously described in the 2012 and 2014 Nucleic Acids Research Database Issues. Since then, alongside the continued extraction of data from the medicinal chemistry literature, new sources of bioactivity data have also been added to the database. These include: deposited data sets from neglected disease screening; crop protection data; drug metabolism and disposition data and bioactivity data from patents. A number of improvements and new features have also been incorporated. These include the annotation of assays and targets using ontologies, the inclusion of targets and indications for clinical candidates, addition of metabolic pathways for drugs and calculation of structural alerts. The ChEMBL data can be accessed via a web-interface, RDF distribution, data downloads and RESTful web-services.

1,601 citations

Journal ArticleDOI
TL;DR: This update describes content expansion, new features and interoperability improvements introduced in the 10 releases since August 2015, and introduces the newly funded project for the Guide to IMMUNOPHARMACOLOGY (GtoImmuPdb, www.guidetoimmunopharmacology.org).
Abstract: The IUPHAR/BPS Guide to PHARMACOLOGY (GtoPdb, www.guidetopharmacology.org) and its precursor IUPHAR-DB, have captured expert-curated interactions between targets and ligands from selected papers in pharmacology and drug discovery since 2003. This resource continues to be developed in conjunction with the International Union of Basic and Clinical Pharmacology (IUPHAR) and the British Pharmacological Society (BPS). As previously described, our unique model of content selection and quality control is based on 96 target-class subcommittees comprising 512 scientists collaborating with in-house curators. This update describes content expansion, new features and interoperability improvements introduced in the 10 releases since August 2015. Our relationship matrix now describes ∼9000 ligands, ∼15 000 binding constants, ∼6000 papers and ∼1700 human proteins. As an important addition, we also introduce our newly funded project for the Guide to IMMUNOPHARMACOLOGY (GtoImmuPdb, www.guidetoimmunopharmacology.org). This has been 'forked' from the well-established GtoPdb data model and expanded into new types of data related to the immune system and inflammatory processes. This includes new ligands, targets, pathways, cell types and diseases for which we are recruiting new IUPHAR expert committees. Designed as an immunopharmacological gateway, it also has an emphasis on potential therapeutic interventions.

1,466 citations

Journal ArticleDOI
TL;DR: A new dedicated aspect of BioGRID annotates genome-wide CRISPR/Cas9-based screens that report gene–phenotype and gene–gene relationships, and captures chemical interaction data, including chemical–protein interactions for human drug targets drawn from the DrugBank database and manually curated bioactive compounds reported in the literature.
Abstract: The Biological General Repository for Interaction Datasets (BioGRID: https://thebiogrid.org) is an open access database dedicated to the curation and archival storage of protein, genetic and chemical interactions for all major model organism species and humans. As of September 2018 (build 3.4.164), BioGRID contains records for 1 598 688 biological interactions manually annotated from 55 809 publications for 71 species, as classified by an updated set of controlled vocabularies for experimental detection methods. BioGRID also houses records for >700 000 post-translational modification sites. BioGRID now captures chemical interaction data, including chemical-protein interactions for human drug targets drawn from the DrugBank database and manually curated bioactive compounds reported in the literature. A new dedicated aspect of BioGRID annotates genome-wide CRISPR/Cas9-based screens that report gene-phenotype and gene-gene relationships. An extension of the BioGRID resource called the Open Repository for CRISPR Screens (ORCS) database (https://orcs.thebiogrid.org) currently contains over 500 genome-wide screens carried out in human or mouse cell lines. All data in BioGRID is made freely available without restriction, is directly downloadable in standard formats and can be readily incorporated into existing applications via our web service platforms. BioGRID data are also freely distributed through partner model organism databases and meta-databases.

1,046 citations

Journal ArticleDOI
TL;DR: Several important improvements have been made to ChEMBL in the last two years, including more robust capture and representation of assay details; a new data deposition system, allowing updating of data sets and deposition of supplementary data; and a completely redesigned web interface, with enhanced search and filtering capabilities.
Abstract: ChEMBL is a large, open-access bioactivity database (https://www.ebi.ac.uk/chembl), previously described in the 2012, 2014 and 2017 Nucleic Acids Research Database Issues. In the last two years, several important improvements have been made to the database and are described here. These include more robust capture and representation of assay details; a new data deposition system, allowing updating of data sets and deposition of supplementary data; and a completely redesigned web interface, with enhanced search and filtering capabilities.

993 citations

Journal ArticleDOI
TL;DR: Results strongly suggest solution-processed small molecular materials are excellent candidates for organic solar cells, using single junction and double junction tandem solar cells.
Abstract: A two-dimensional conjugated small molecule (SMPV1) was designed and synthesized for high performance solution-processed organic solar cells. This study explores the photovoltaic properties of this molecule as a donor, with a fullerene derivative as an acceptor, using solution processing in single junction and double junction tandem solar cells. The single junction solar cells based on SMPV1 exhibited a certified power conversion efficiency of 8.02% under AM 1.5 G irradiation (100 mW cm−2). A homo-tandem solar cell based on SMPV1 was constructed with a novel interlayer (or tunnel junction) consisting of bilayer conjugated polyelectrolyte, demonstrating an unprecedented PCE of 10.1%. These results strongly suggest solution-processed small molecular materials are excellent candidates for organic solar cells.

566 citations