scispace - formally typeset
Search or ask a question
Author

Annika L. Gable

Bio: Annika L. Gable is an academic researcher from Swiss Institute of Bioinformatics. The author has contributed to research in topics: String (computer science) & Metastasis. The author has an hindex of 6, co-authored 7 publications receiving 6167 citations. Previous affiliations of Annika L. Gable include Heidelberg Institute for Theoretical Studies & Watson School of Biological Sciences.

Papers
More filters
Journal ArticleDOI
TL;DR: The latest version of STRING more than doubles the number of organisms it covers, and offers an option to upload entire, genome-wide datasets as input, allowing users to visualize subsets as interaction networks and to perform gene-set enrichment analysis on the entire input.
Abstract: Proteins and their functional interactions form the backbone of the cellular machinery. Their connectivity network needs to be considered for the full understanding of biological phenomena, but the available information on protein-protein associations is incomplete and exhibits varying levels of annotation granularity and reliability. The STRING database aims to collect, score and integrate all publicly available sources of protein-protein interaction information, and to complement these with computational predictions. Its goal is to achieve a comprehensive and objective global network, including direct (physical) as well as indirect (functional) interactions. The latest version of STRING (11.0) more than doubles the number of organisms it covers, to 5090. The most important new feature is an option to upload entire, genome-wide datasets as input, allowing users to visualize subsets as interaction networks and to perform gene-set enrichment analysis on the entire input. For the enrichment analysis, STRING implements well-known classification systems such as Gene Ontology and KEGG, but also offers additional, new classification systems based on high-throughput text-mining as well as on a hierarchical clustering of the association network itself. The STRING resource is available online at https://string-db.org/.

10,584 citations

Journal ArticleDOI
TL;DR: Changes to the text-mining system, a new scoring-mode for physical interactions, as well as extensive user interface features for customizing, extending and sharing protein networks are described.
Abstract: Cellular life depends on a complex web of functional associations between biomolecules. Among these associations, protein-protein interactions are particularly important due to their versatility, specificity and adaptability. The STRING database aims to integrate all known and predicted associations between proteins, including both physical interactions as well as functional associations. To achieve this, STRING collects and scores evidence from a number of sources: (i) automated text mining of the scientific literature, (ii) databases of interaction experiments and annotated complexes/pathways, (iii) computational interaction predictions from co-expression and from conserved genomic context and (iv) systematic transfers of interaction evidence from one organism to another. STRING aims for wide coverage; the upcoming version 11.5 of the resource will contain more than 14 000 organisms. In this update paper, we describe changes to the text-mining system, a new scoring-mode for physical interactions, as well as extensive user interface features for customizing, extending and sharing protein networks. In addition, we describe how to query STRING with genome-wide, experimental data, including the automated detection of enriched functionalities and potential biases in the user's query data. The STRING resource is available online, at https://string-db.org/.

3,253 citations

Journal ArticleDOI
15 Feb 2018-Nature
TL;DR: It is shown that asparagine bioavailability strongly influences metastatic potential, providing at least one potential mechanism for how the bioavailability of a single amino acid could regulate metastatic progression.
Abstract: Using a functional model of breast cancer heterogeneity, we previously showed that clonal sub-populations proficient at generating circulating tumour cells were not all equally capable of forming metastases at secondary sites. A combination of differential expression and focused in vitro and in vivo RNA interference screens revealed candidate drivers of metastasis that discriminated metastatic clones. Among these, asparagine synthetase expression in a patient's primary tumour was most strongly correlated with later metastatic relapse. Here we show that asparagine bioavailability strongly influences metastatic potential. Limiting asparagine by knockdown of asparagine synthetase, treatment with l-asparaginase, or dietary asparagine restriction reduces metastasis without affecting growth of the primary tumour, whereas increased dietary asparagine or enforced asparagine synthetase expression promotes metastatic progression. Altering asparagine availability in vitro strongly influences invasive potential, which is correlated with an effect on proteins that promote the epithelial-to-mesenchymal transition. This provides at least one potential mechanism for how the bioavailability of a single amino acid could regulate metastatic progression.

326 citations

Journal ArticleDOI
16 Apr 2015-Nature
TL;DR: A polyclonal mouse model of breast tumour heterogeneity is developed, and it is proposed that vascular mimicry drives the ability of some breast tumours to contribute to distant metastases while simultaneously satisfying a critical need of the primary tumour to be fed by the vasculature.
Abstract: Cancer metastasis requires that primary tumour cells evolve the capacity to intravasate into the lymphatic system or vasculature, and extravasate into and colonize secondary sites. Others have demonstrated that individual cells within complex populations show heterogeneity in their capacity to form secondary lesions. Here we develop a polyclonal mouse model of breast tumour heterogeneity, and show that distinct clones within a mixed population display specialization, for example, dominating the primary tumour, contributing to metastatic populations, or showing tropism for entering the lymphatic or vasculature systems. We correlate these stable properties to distinct gene expression profiles. Those clones that efficiently enter the vasculature express two secreted proteins, Serpine2 and Slpi, which were necessary and sufficient to program these cells for vascular mimicry. Our data indicate that these proteins not only drive the formation of extravascular networks but also ensure their perfusion by acting as anticoagulants. We propose that vascular mimicry drives the ability of some breast tumour cells to contribute to distant metastases while simultaneously satisfying a critical need of the primary tumour to be fed by the vasculature. Enforced expression of SERPINE2 and SLPI in human breast cancer cell lines also programmed them for vascular mimicry, and SERPINE2 and SLPI were overexpressed preferentially in human patients that had lung-metastatic relapse. Thus, these two secreted proteins, and the phenotype they promote, may be broadly relevant as drivers of metastatic progression in human cancer.

325 citations

Journal ArticleDOI
TL;DR: STRING as mentioned in this paper collects and integrates protein-protein interactions, both physical interactions as well as functional associations, from a number of sources: automated text mining of the scientific literature, computational interaction predictions from co-expression, conserved genomic context, databases of interaction experiments and known complexes/pathways from curated sources.
Abstract: Abstract Much of the complexity within cells arises from functional and regulatory interactions among proteins. The core of these interactions is increasingly known, but novel interactions continue to be discovered, and the information remains scattered across different database resources, experimental modalities and levels of mechanistic detail. The STRING database (https://string-db.org/) systematically collects and integrates protein–protein interactions—both physical interactions as well as functional associations. The data originate from a number of sources: automated text mining of the scientific literature, computational interaction predictions from co-expression, conserved genomic context, databases of interaction experiments and known complexes/pathways from curated sources. All of these interactions are critically assessed, scored, and subsequently automatically transferred to less well-studied organisms using hierarchical orthology information. The data can be accessed via the website, but also programmatically and via bulk downloads. The most recent developments in STRING (version 12.0) are: (i) it is now possible to create, browse and analyze a full interaction network for any novel genome of interest, by submitting its complement of encoded proteins, (ii) the co-expression channel now uses variational auto-encoders to predict interactions, and it covers two new sources, single-cell RNA-seq and experimental proteomics data and (iii) the confidence in each experimentally derived interaction is now estimated based on the detection method used, and communicated to the user in the web-interface. Furthermore, STRING continues to enhance its facilities for functional enrichment analysis, which are now fully available also for user-submitted genomes.

127 citations


Cited by
More filters
Journal ArticleDOI
28 May 2020-Cell
TL;DR: It is proposed that reduced innate antiviral defenses coupled with exuberant inflammatory cytokine production are the defining and driving features of COVID-19.

3,286 citations

Journal ArticleDOI
TL;DR: In this article, a tool called CellChat is developed to quantitatively infer and analyze intercellular communication networks from single-cell RNA-sequencing (scRNA-seq) data.
Abstract: Understanding global communications among cells requires accurate representation of cell-cell signaling links and effective systems-level analyses of those links. We construct a database of interactions among ligands, receptors and their cofactors that accurately represent known heteromeric molecular complexes. We then develop CellChat, a tool that is able to quantitatively infer and analyze intercellular communication networks from single-cell RNA-sequencing (scRNA-seq) data. CellChat predicts major signaling inputs and outputs for cells and how those cells and signals coordinate for functions using network analysis and pattern recognition approaches. Through manifold learning and quantitative contrasts, CellChat classifies signaling pathways and delineates conserved and context-specific pathways across different datasets. Applying CellChat to mouse and human skin datasets shows its ability to extract complex signaling patterns. Our versatile and easy-to-use toolkit CellChat and a web-based Explorer ( http://www.cellchat.org/ ) will help discover novel intercellular communications and build cell-cell communication atlases in diverse tissues.

1,141 citations

Posted Content
TL;DR: The OGB datasets are large-scale, encompass multiple important graph ML tasks, and cover a diverse range of domains, ranging from social and information networks to biological networks, molecular graphs, source code ASTs, and knowledge graphs, indicating fruitful opportunities for future research.
Abstract: We present the Open Graph Benchmark (OGB), a diverse set of challenging and realistic benchmark datasets to facilitate scalable, robust, and reproducible graph machine learning (ML) research. OGB datasets are large-scale, encompass multiple important graph ML tasks, and cover a diverse range of domains, ranging from social and information networks to biological networks, molecular graphs, source code ASTs, and knowledge graphs. For each dataset, we provide a unified evaluation protocol using meaningful application-specific data splits and evaluation metrics. In addition to building the datasets, we also perform extensive benchmark experiments for each dataset. Our experiments suggest that OGB datasets present significant challenges of scalability to large-scale graphs and out-of-distribution generalization under realistic data splits, indicating fruitful opportunities for future research. Finally, OGB provides an automated end-to-end graph ML pipeline that simplifies and standardizes the process of graph data loading, experimental setup, and model evaluation. OGB will be regularly updated and welcomes inputs from the community. OGB datasets as well as data loaders, evaluation scripts, baseline code, and leaderboards are publicly available at this https URL .

1,097 citations

Posted ContentDOI
22 Jul 2020-bioRxiv
TL;DR: A database of interactions among ligands, receptors and their cofactors that accurately represents known heteromeric molecular complexes is constructed and a tool that is able to quantitively infer and analyze intercellular communication networks from single-cell RNA-sequencing (scRNA-seq) data is developed.
Abstract: Understanding global communications among cells requires accurate representation of cell-cell signaling links and effective systems-level analyses of those links. We constructed a database of interactions among ligands, receptors and their cofactors that accurately represents known heteromeric molecular complexes. Based on mass action models, we then developed CellChat, a tool that is able to quantitively infer and analyze intercellular communication networks from single-cell RNA-sequencing (scRNA-seq) data. CellChat predicts major signaling inputs and outputs for cells and how those cells and signals coordinate for functions using network analysis and pattern recognition approaches. Through manifold learning and quantitative contrasts, CellChat classifies signaling pathways and delineates conserved and context-specific pathways across different datasets. Applications of CellChat to several mouse skin scRNA-seq datasets for embryonic development and adult wound healing shows its ability to extract complex signaling patterns, both previously known as well as novel. Our versatile and easy-to-use toolkit CellChat and a web-based Explorer (http://www.cellchat.org/) will help discover novel intercellular communications and build a cell-cell communication atlas in diverse tissues.

975 citations

Journal ArticleDOI
TL;DR: SMART version 9 contains manually curated models for more than 1300 protein domains, with a topical set of 68 new models added since the last update article, greatly increasing the total number of annotated domains and other protein features available in architecture analysis mode.
Abstract: SMART (Simple Modular Architecture Research Tool) is a web resource (https://smart.embl.de) for the identification and annotation of protein domains and the analysis of protein domain architectures. SMART version 9 contains manually curated models for more than 1300 protein domains, with a topical set of 68 new models added since our last update article (1). All the new models are for diverse recombinase families and subfamilies and as a set they provide a comprehensive overview of mobile element recombinases namely transposase, integrase, relaxase, resolvase, cas1 casposase and Xer like cellular recombinase. Further updates include the synchronization of the underlying protein databases with UniProt (2), Ensembl (3) and STRING (4), greatly increasing the total number of annotated domains and other protein features available in architecture analysis mode. Furthermore, SMART's vector-based protein display engine has been extended and updated to use the latest web technologies and the domain architecture analysis components have been optimized to handle the increased number of protein features available.

898 citations