Home
/
Authors
/
Denis Torre

Author

Denis Torre

Bio: Denis Torre is an academic researcher from Icahn School of Medicine at Mount Sinai. The author has contributed to research in topics: Medicine & Biology. The author has an hindex of 13, co-authored 26 publications receiving 1066 citations. Previous affiliations of Denis Torre include University of Miami.

Topics: Medicine, Biology, Downregulation and upregulation, Executable, Population ...read more

Papers

PDF

Open Access

More filters

Journal Article•DOI•

Massive mining of publicly available RNA-seq data from human and mouse.

[...]

Alexander Lachmann¹, Denis Torre¹, Alexandra B Keenan¹, Kathleen M. Jagodnik¹, Hoyjin J. Lee¹, Lily Wang¹, Moshe C. Silverstein¹, Avi Ma'ayan¹ - Show less +4 more•Institutions (1)

Icahn School of Medicine at Mount Sinai¹

10 Apr 2018-Nature Communications

TL;DR: A high-throughput processing infrastructure and search database (ARCHS4) that provides processed RNA-seq data for 187,946 publicly available mouse and human samples to support exploration and reuse is developed.

...read moreread less

Abstract: RNA sequencing (RNA-seq) is the leading technology for genome-wide transcript quantification. However, publicly available RNA-seq data is currently provided mostly in raw form, a significant barrier for global and integrative retrospective analyses. ARCHS4 is a web resource that makes the majority of published RNA-seq data from human and mouse available at the gene and transcript levels. For developing ARCHS4, available FASTQ files from RNA-seq experiments from the Gene Expression Omnibus (GEO) were aligned using a cloud-based infrastructure. In total 187,946 samples are accessible through ARCHS4 with 103,083 mouse and 84,863 human. Additionally, the ARCHS4 web interface provides intuitive exploration of the processed data through querying tools, interactive visualization, and gene pages that provide average expression across cell lines and tissues, top co-expressed genes for each gene, and predicted biological functions and protein–protein interactions for each gene based on prior knowledge combined with co-expression.

...read moreread less

428 citations

Journal Article•DOI•

ChEA3: transcription factor enrichment analysis by orthogonal omics integration.

[...]

Alexandra B Keenan¹, Denis Torre¹, Alexander Lachmann¹, Ariel K Leong¹, Megan L. Wojciechowicz¹, Vivian Utti¹, Kathleen M. Jagodnik¹, Eryk Kropiwnicki¹, Zichen Wang¹, Avi Ma'ayan¹ - Show less +6 more•Institutions (1)

Icahn School of Medicine at Mount Sinai¹

02 Jul 2019-Nucleic Acids Research

TL;DR: The ChEA3 background database contains a collection of gene set libraries generated from multiple sources including TF–gene co-expression from RNA-seq studies, TF–target associations from ChIP-seq experiments, and TF-gree co-occurrence computed from crowd-submitted gene lists, which illuminate general transcription factor properties such as whether the TF behaves as an activator or a repressor.

...read moreread less

Abstract: Identifying the transcription factors (TFs) responsible for observed changes in gene expression is an important step in understanding gene regulatory networks. ChIP-X Enrichment Analysis 3 (ChEA3) is a transcription factor enrichment analysis tool that ranks TFs associated with user-submitted gene sets. The ChEA3 background database contains a collection of gene set libraries generated from multiple sources including TF-gene co-expression from RNA-seq studies, TF-target associations from ChIP-seq experiments, and TF-gene co-occurrence computed from crowd-submitted gene lists. Enrichment results from these distinct sources are integrated to generate a composite rank that improves the prediction of the correct upstream TF compared to ranks produced by individual libraries. We compare ChEA3 with existing TF prediction tools and show that ChEA3 performs better. By integrating the ChEA3 libraries, we illuminate general transcription factor properties such as whether the TF behaves as an activator or a repressor. The ChEA3 web-server is available from https://amp.pharm.mssm.edu/ChEA3.

...read moreread less

379 citations

Journal Article•DOI•

The Library of Integrated Network-Based Cellular Signatures NIH Program: System-Level Cataloging of Human Cells Response to Perturbations

[...]

Alexandra B Keenan¹, Sherry L. Jenkins¹, Kathleen M. Jagodnik¹, Simon Koplev¹, Edward He¹, Denis Torre¹, Zichen Wang¹, Anders B. Dohlman¹, Moshe C. Silverstein¹, Alexander Lachmann¹, Maxim V. Kuleshov¹, Avi Ma'ayan¹, Vasileios Stathias², Raymond Terryn², Daniel J. Cooper², Michele Forlin², Amar Koleti², Dusica Vidovic², Caty Chung², Stephan C. Schürer², Jouzas Vasiliauskas³, Marcin Pilarczyk³, Behrouz Shamsaei³, Mehdi Fazel³, Yan Ren³, Wen Niu³, Nicholas A. Clark³, Shana White³, Naim Al Mahi³, Lixia Zhang³, Michal Kouril³, John F. Reichard³, Siva Sivaganesan³, Mario Medvedovic³, Jaroslaw Meller³, Rick J. Koch¹, Marc R. Birtwistle¹, Ravi Iyengar¹, Eric A. Sobie¹, Evren U. Azeloglu¹, Julia A. Kaye⁴, Jeannette Osterloh⁴, Kelly Haston⁴, Jaslin Kalra⁴, Steve Finkbiener⁴, Jonathan Z. Li⁵, Pamela Milani⁵, Miriam Adam⁵, Renan Escalante-Chong⁵, Karen Sachs⁵, Alexander LeNail⁵, Divya Ramamoorthy⁵, Ernest Fraenkel⁵, Gavin Daigle⁶, Uzma Hussain⁶, Alyssa Coye⁶, Jeffrey D. Rothstein⁶, Dhruv Sareen⁷, Loren Ornelas⁷, Maria G. Banuelos⁷, Berhan Mandefro⁷, Ritchie Ho⁷, Clive N. Svendsen⁷, Ryan G. Lim⁸, Jennifer Stocksdale⁸, Malcolm Casale⁸, Terri G. Thompson⁸, Jie Wu⁸, Leslie M. Thompson⁸, Victoria Dardov⁷, Vidya Venkatraman⁷, Andrea Matlock⁷, Jennifer E. Van Eyk⁷, Jacob D. Jaffe⁹, Malvina Papanastasiou⁹, Aravind Subramanian⁹, Todd R. Golub, Sean D. Erickson¹⁰, Mohammad Fallahi-Sichani¹⁰, Marc Hafner¹⁰, Nathanael S. Gray¹⁰, Jia-Ren Lin¹⁰, Caitlin E. Mills¹⁰, Jeremy L. Muhlich¹⁰, Mario Niepel¹⁰, Caroline E. Shamu¹⁰, Elizabeth H. Williams¹⁰, David Wrobel¹⁰, Peter K. Sorger¹⁰, Laura M. Heiser¹¹, Joe W. Gray¹¹, James E. Korkola¹¹, Gordon B. Mills¹², Mark A. LaBarge¹³, Mark A. LaBarge¹⁴, Heidi S. Feiler¹¹, Mark A. Dane¹¹, Elmar Bucher¹¹, Michel Nederlof¹¹, Damir Sudar¹¹, Sean M. Gross¹¹, David Kilburn¹¹, Rebecca Smith¹¹, Kaylyn Devlin¹¹, Ron Margolis, Leslie Derr, Albert Lee, Ajay Pillai - Show less +104 more•Institutions (14)

Icahn School of Medicine at Mount Sinai¹, University of Miami², University of Cincinnati³, University of California, San Francisco⁴, Massachusetts Institute of Technology⁵, Johns Hopkins University⁶, Cedars-Sinai Medical Center⁷, University of California, Irvine⁸, Broad Institute⁹, Harvard University¹⁰, Oregon Health & Science University¹¹, University of Texas MD Anderson Cancer Center¹², City of Hope National Medical Center¹³, University of Bergen¹⁴

29 Nov 2017-Cell systems

TL;DR: The LINCS program focuses on cellular physiology shared among tissues and cell types relevant to an array of diseases, including cancer, heart disease, and neurodegenerative disorders.

...read moreread less

Abstract: The Library of Integrated Network-Based Cellular Signatures (LINCS) is an NIH Common Fund program that catalogs how human cells globally respond to chemical, genetic, and disease perturbations. Resources generated by LINCS include experimental and computational methods, visualization tools, molecular and imaging data, and signatures. By assembling an integrated picture of the range of responses of human cells exposed to many perturbations, the LINCS program aims to better understand human disease and to advance the development of new therapies. Perturbations under study include drugs, genetic perturbations, tissue micro-environments, antibodies, and disease-causing mutations. Responses to perturbations are measured by transcript profiling, mass spectrometry, cell imaging, and biochemical methods, among other assays. The LINCS program focuses on cellular physiology shared among tissues and cell types relevant to an array of diseases, including cancer, heart disease, and neurodegenerative disorders. This Perspective describes LINCS technologies, datasets, tools, and approaches to data accessibility and reusability.

...read moreread less

300 citations

Journal Article•DOI•

BioJupies: Automated Generation of Interactive Notebooks for RNA-Seq Data Analysis in the Cloud.

[...]

Denis Torre¹, Alexander Lachmann¹, Avi Ma'ayan¹•Institutions (1)

Icahn School of Medicine at Mount Sinai¹

28 Nov 2018-Cell systems

TL;DR: By providing an intuitive user interface for notebook generation for RNA-seq data analysis, starting from the raw reads all the way to a complete interactive and reproducible report, BioJupies is a useful resource for experimental and computational biologists.

...read moreread less

Abstract: Summary BioJupies is a web application that enables the automated creation, storage, and deployment of Jupyter Notebooks containing RNA-seq data analyses. Through an intuitive interface, novice users can rapidly generate tailored reports to analyze and visualize their own raw sequencing files, gene expression tables, or fetch data from >9,000 published studies containing >300,000 preprocessed RNA-seq samples. Generated notebooks have the executable code of the entire pipeline, rich narrative text, interactive data visualizations, differential expression, and enrichment analyses. The notebooks are permanently stored in the cloud and made available online through a persistent URL. The notebooks are downloadable, customizable, and can run within a Docker container. By providing an intuitive user interface for notebook generation for RNA-seq data analysis, starting from the raw reads all the way to a complete interactive and reproducible report, BioJupies is a useful resource for experimental and computational biologists. BioJupies is freely available as a web-based application from http://biojupies.cloud .

...read moreread less

183 citations

Posted Content•DOI•

Massive Mining of Publicly Available RNA-seq Data from Human and Mouse

[...]

Alexander Lachmann¹, Denis Torre¹, Alexandra B Keenan¹, Kathleen M. Jagodnik¹, Hoyjin J. Lee¹, Moshe C. Silverstein¹, Lily Wang¹, Avi Ma'ayan¹ - Show less +4 more•Institutions (1)

Icahn School of Medicine at Mount Sinai¹

14 Sep 2017-bioRxiv

TL;DR: ARCHS4, a web resource that makes the majority of previously published RNA-seq data from human and mouse freely available at the gene count level, outperforms co-expression data created from other major gene expression data repositories such as GTEx and CCLE.

...read moreread less

Abstract: RNA-sequencing (RNA-seq) is currently the leading technology for genome-wide transcript quantification. While the volume of RNA-seq data is rapidly increasing, the currently publicly available RNA-seq data is provided mostly in raw form, with small portions processed non-uniformly. This is mainly because the computational demand, particularly for the alignment step, is a significant barrier for global and integrative retrospective analyses. To address this challenge, we developed all RNA-seq and ChIP-seq sample and signature search (ARCHS4), a web resource that makes the majority of previously published RNA-seq data from human and mouse freely available at the gene count level. Such uniformly processed data enables easy integration for downstream analyses. For developing the ARCHS4 resource, all available FASTQ files from RNA-seq experiments were retrieved from the Gene Expression Omnibus (GEO) and aligned using a cloud-based infrastructure. In total 137,792 samples are accessible through ARCHS4 with 72,363 mouse and 65,429 human samples. Through efficient use of cloud resources and dockerized deployment of the sequencing pipeline, the alignment cost per sample is reduced to less than one cent. ARCHS4 is updated automatically by adding newly published samples to the database as they become available. Additionally, the ARCHS4 web interface provides intuitive exploration of the processed data through querying tools, interactive visualization, and gene landing pages that provide average expression across cell lines and tissues, top co-expressed genes, and predicted biological functions and protein-protein interactions for each gene based on prior knowledge combined with co-expression. Benchmarking the quality of these predictions, co-expression correlation data created from ARCHS4 outperforms co-expression data created from other major gene expression data repositories such as GTEx and CCLE. ARCHS4 is freely accessible from: http://amp.pharm.mssm.edu/archs4.

...read moreread less

146 citations

1
2
3
4
…
5
6

Collapse

Cited by

PDF

Open Access

More filters

Journal Article•DOI•

STRING v11: protein-protein association networks with increased coverage, supporting functional discovery in genome-wide experimental datasets.

[...]

Damian Szklarczyk¹, Annika L. Gable¹, David Lyon¹, Alexander Junge², Stefan Wyder¹, Jaime Huerta-Cepas³, Milan Simonovic¹, Nadezhda Tsankova Doncheva², John H. Morris⁴, Peer Bork, Lars Juhl Jensen², Christian von Mering¹ - Show less +8 more•Institutions (4)

Swiss Institute of Bioinformatics¹, University of Copenhagen², Technical University of Madrid³, University of California, San Francisco⁴

08 Jan 2019-Nucleic Acids Research

TL;DR: The latest version of STRING more than doubles the number of organisms it covers, and offers an option to upload entire, genome-wide datasets as input, allowing users to visualize subsets as interaction networks and to perform gene-set enrichment analysis on the entire input.

...read moreread less

Abstract: Proteins and their functional interactions form the backbone of the cellular machinery. Their connectivity network needs to be considered for the full understanding of biological phenomena, but the available information on protein-protein associations is incomplete and exhibits varying levels of annotation granularity and reliability. The STRING database aims to collect, score and integrate all publicly available sources of protein-protein interaction information, and to complement these with computational predictions. Its goal is to achieve a comprehensive and objective global network, including direct (physical) as well as indirect (functional) interactions. The latest version of STRING (11.0) more than doubles the number of organisms it covers, to 5090. The most important new feature is an option to upload entire, genome-wide datasets as input, allowing users to visualize subsets as interaction networks and to perform gene-set enrichment analysis on the entire input. For the enrichment analysis, STRING implements well-known classification systems such as Gene Ontology and KEGG, but also offers additional, new classification systems based on high-throughput text-mining as well as on a hierarchical clustering of the association network itself. The STRING resource is available online at https://string-db.org/.

...read moreread less

10,584 citations

Journal Article•DOI•

Reference-based analysis of lung single-cell sequencing reveals a transitional profibrotic macrophage

[...]

Dvir Aran¹, Agnieszka P. Looney¹, Leqian Liu¹, Esther Wu¹, Valerie Fong¹, Austin Hsu, Suzanna Chak¹, Ram P. Naikawadi¹, Paul J. Wolters¹, Adam R. Abate¹, Adam R. Abate², Atul J. Butte¹, Mallar Bhattacharya¹ - Show less +9 more•Institutions (2)

University of California, San Francisco¹, California Institute for Quantitative Biosciences²

14 Jan 2019-Nature Immunology

TL;DR: Using scRNA-seq analysis, Bhattacharya and colleagues identify a subset of profibrotic lung macrophages that have a gene expression signature intermediate between those of monocytes and alveolar macrophage.

...read moreread less

Abstract: Tissue fibrosis is a major cause of mortality that results from the deposition of matrix proteins by an activated mesenchyme. Macrophages accumulate in fibrosis, but the role of specific subgroups in supporting fibrogenesis has not been investigated in vivo. Here, we used single-cell RNA sequencing (scRNA-seq) to characterize the heterogeneity of macrophages in bleomycin-induced lung fibrosis in mice. A novel computational framework for the annotation of scRNA-seq by reference to bulk transcriptomes (SingleR) enabled the subclustering of macrophages and revealed a disease-associated subgroup with a transitional gene expression profile intermediate between monocyte-derived and alveolar macrophages. These CX3CR1+SiglecF+ transitional macrophages localized to the fibrotic niche and had a profibrotic effect in vivo. Human orthologs of genes expressed by the transitional macrophages were upregulated in samples from patients with idiopathic pulmonary fibrosis. Thus, we have identified a pathological subgroup of transitional macrophages that are required for the fibrotic response to injury.

...read moreread less

1,790 citations

Journal Article•DOI•

Genetic meta-analysis of diagnosed Alzheimer’s disease identifies new risk loci and implicates Aβ, tau, immunity and lipid processing

[...]

Cohorts for Heart¹, Genetic², Genetic³, Genetic⁴, Environmental Risk in Ad, Polygenic Defining Genetic, Perades⁵ - Show less +3 more•Institutions (5)

Cardiff University¹, Pasteur Institute², French Institute of Health and Medical Research³, university of lille⁴, Erasmus University Rotterdam⁵

01 Mar 2019-Nature Genetics

TL;DR: Pathway analysis implicates immunity, lipid metabolism, tau binding proteins, and amyloid precursor protein (APP) metabolism, showing that genetic variants affecting APP and Aβ processing are associated not only with early-onset autosomal dominant Alzheimer’s disease but also with LOAD.

...read moreread less

Abstract: Risk for late-onset Alzheimer’s disease (LOAD), the most prevalent dementia, is partially driven by genetics. To identify LOAD risk loci, we performed a large genome-wide association meta-analysis of clinically diagnosed LOAD (94,437 individuals). We confirm 20 previous LOAD risk loci and identify five new genome-wide loci (IQCK, ACE, ADAM10, ADAMTS1, and WWOX), two of which (ADAM10, ACE) were identified in a recent genome-wide association (GWAS)-by-familial-proxy of Alzheimer’s or dementia. Fine-mapping of the human leukocyte antigen (HLA) region confirms the neurological and immune-mediated disease haplotype HLA-DR15 as a risk factor for LOAD. Pathway analysis implicates immunity, lipid metabolism, tau binding proteins, and amyloid precursor protein (APP) metabolism, showing that genetic variants affecting APP and Aβ processing are associated not only with early-onset autosomal dominant Alzheimer’s disease but also with LOAD. Analyses of risk genes and pathways show enrichment for rare variants (P = 1.32 × 10−7), indicating that additional rare variants remain to be identified. We also identify important genetic correlations between LOAD and traits such as family history of dementia and education.

...read moreread less

1,641 citations

Journal Article•DOI•

The BioGRID interaction database: 2019 update

[...]

Rose Oughtred¹, Chris Stark², Bobby-Joe Breitkreutz², Jennifer M. Rust¹, Lorrie Boucher², Christie S. Chang¹, Nadine Kolas², Lara O'Donnell², Genie Leung², Rochelle McAdam, Frederick Zhang, Sonam Dolma, Andrew Willems², Jasmin Coulombe-Huntington³, Andrew Chatr-aryamontri³, Kara Dolinski¹, Mike Tyers³, Mike Tyers² - Show less +14 more•Institutions (3)

Princeton University¹, Lunenfeld-Tanenbaum Research Institute², Université de Montréal³

08 Jan 2019-Nucleic Acids Research

TL;DR: A new dedicated aspect of BioGRID annotates genome-wide CRISPR/Cas9-based screens that report gene–phenotype and gene–gene relationships, and captures chemical interaction data, including chemical–protein interactions for human drug targets drawn from the DrugBank database and manually curated bioactive compounds reported in the literature.

...read moreread less

Abstract: The Biological General Repository for Interaction Datasets (BioGRID: https://thebiogrid.org) is an open access database dedicated to the curation and archival storage of protein, genetic and chemical interactions for all major model organism species and humans. As of September 2018 (build 3.4.164), BioGRID contains records for 1 598 688 biological interactions manually annotated from 55 809 publications for 71 species, as classified by an updated set of controlled vocabularies for experimental detection methods. BioGRID also houses records for >700 000 post-translational modification sites. BioGRID now captures chemical interaction data, including chemical-protein interactions for human drug targets drawn from the DrugBank database and manually curated bioactive compounds reported in the literature. A new dedicated aspect of BioGRID annotates genome-wide CRISPR/Cas9-based screens that report gene-phenotype and gene-gene relationships. An extension of the BioGRID resource called the Open Repository for CRISPR Screens (ORCS) database (https://orcs.thebiogrid.org) currently contains over 500 genome-wide screens carried out in human or mouse cell lines. All data in BioGRID is made freely available without restriction, is directly downloadable in standard formats and can be readily incorporated into existing applications via our web service platforms. BioGRID data are also freely distributed through partner model organism databases and meta-databases.

...read moreread less

1,046 citations

Journal Article•DOI•

Gene Set Knowledge Discovery with Enrichr.

[...]

Zhuorui Xie¹, Allison Bailey¹, Maxim V. Kuleshov¹, Daniel J.B. Clarke¹, John Erol Evangelista¹, Sherry L. Jenkins¹, Alexander Lachmann¹, Megan L. Wojciechowicz¹, Eryk Kropiwnicki¹, Kathleen M. Jagodnik¹, Minji Jeon¹, Avi Ma'ayan¹ - Show less +8 more•Institutions (1)

Icahn School of Medicine at Mount Sinai¹

01 Mar 2021

TL;DR: Enrichr as discussed by the authors is a gene set search engine that enables the querying of hundreds of thousands of annotated gene sets Enrichr uniquely integrates knowledge from many high-profile projects to provide synthesized information about mammalian genes and gene sets.

...read moreread less

Abstract: Profiling samples from patients, tissues, and cells with genomics, transcriptomics, epigenomics, proteomics, and metabolomics ultimately produces lists of genes and proteins that need to be further analyzed and integrated in the context of known biology Enrichr (Chen et al, 2013; Kuleshov et al, 2016) is a gene set search engine that enables the querying of hundreds of thousands of annotated gene sets Enrichr uniquely integrates knowledge from many high-profile projects to provide synthesized information about mammalian genes and gene sets The platform provides various methods to compute gene set enrichment, and the results are visualized in several interactive ways This protocol provides a summary of the key features of Enrichr, which include using Enrichr programmatically and embedding an Enrichr button on any website © 2021 Wiley Periodicals LLC Basic Protocol 1: Analyzing lists of differentially expressed genes from transcriptomics, proteomics and phosphoproteomics, GWAS studies, or other experimental studies Basic Protocol 2: Searching Enrichr by a single gene or key search term Basic Protocol 3: Preparing raw or processed RNA-seq data through BioJupies in preparation for Enrichr analysis Basic Protocol 4: Analyzing gene sets for model organisms using modEnrichr Basic Protocol 5: Using Enrichr in Geneshot Basic Protocol 6: Using Enrichr in ARCHS4 Basic Protocol 7: Using the enrichment analysis visualization Appyter to visualize Enrichr results Basic Protocol 8: Using the Enrichr API Basic Protocol 9: Adding an Enrichr button to a website

...read moreread less

884 citations

1
2
3
4
…
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200

Collapse