Home
/
Authors
/
Ariel Rabinovic

Author

Ariel Rabinovic

Bio: Ariel Rabinovic is an academic researcher from Harvard University. The author has contributed to research in topics: Sample size determination & AU-rich element. The author has an hindex of 2, co-authored 2 publications receiving 4983 citations.

Papers

PDF

Open Access

More filters

Journal Article•DOI•

Adjusting batch effects in microarray expression data using empirical Bayes methods

[...]

W. Evan Johnson¹, Cheng Li¹, Ariel Rabinovic¹•Institutions (1)

Harvard University¹

01 Jan 2007-Biostatistics

TL;DR: This paper proposed parametric and non-parametric empirical Bayes frameworks for adjusting data for batch effects that is robust to outliers in small sample sizes and performs comparable to existing methods for large samples.

...read moreread less

Abstract: SUMMARY Non-biological experimental variation or “batch effects” are commonly observed across multiple batches of microarray experiments, often rendering the task of combining data from these batches difficult. The ability to combine microarray data sets is advantageous to researchers to increase statistical power to detect biological phenomena from studies where logistical considerations restrict sample size or in studies that require the sequential hybridization of arrays. In general, it is inappropriate to combine data sets without adjusting for batch effects. Methods have been proposed to filter batch effects from data, but these are often complicated and require large batch sizes (>25) to implement. Because the majority of microarray studies are conducted using much smaller sample sizes, existing methods are not sufficient. We propose parametric and non-parametric empirical Bayes frameworks for adjusting data for batch effects that is robust to outliers in small sample sizes and performs comparable to existing methods for large samples. We illustrate our methods using two example data sets and show that our methods are justifiable, easy to apply, and useful in practice. Software for our method is freely available at: http://biosun1.harvard.edu/complab/batch/.

...read moreread less

6,319 citations

Journal Article•DOI•

Analysis of nitric oxide-stabilized mRNAs in human fibroblasts reveals HuR-dependent heme oxygenase 1 upregulation.

[...]

Yuki Kuwano¹, Ariel Rabinovic², Subramanya Srikantan¹, Myriam Gorospe¹, Bruce Demple² - Show less +1 more•Institutions (2)

National Institutes of Health¹, Harvard University²

15 May 2009-Molecular and Cellular Biology

TL;DR: The results indicate that NO stabilizes mRNA subsets in fibroblasts, identify HuR as an RBP implicated in the NO response, reveal that HuR alone is insufficient for stabilizing several mRNAs by NO, and show that HO-1 induction by NO is regulated by HuR.

...read moreread less

Abstract: We previously observed that nitric oxide (NO) exposure increases the stability of mRNAs encoding heme oxygenase 1 (HO-1) and TIEG-1 in human and mouse fibroblasts. Here, we have used microarrays to look broadly for changes in mRNA stability in response to NO treatment. Using human IMR-90 and mouse NIH 3T3 fibroblasts treated with actinomycin D to block de novo transcription, microarray analysis suggested that the stability of the majority of mRNAs was unaffected. Among the mRNAs that were stabilized by NO treatment, seven transcripts were found in both IMR-90 and NIH 3T3 cells (CHIC2, GADD45B, HO-1, PTGS2, RGS2, TIEG, and ID3) and were chosen for further analysis. All seven mRNAs showed at least one hit of a signature motif for the stabilizing RNA-binding protein (RBP) HuR; accordingly, ribonucleoprotein immunoprecipitation analysis revealed that all seven mRNAs associated with HuR. In keeping with a functional role of HuR in the response to NO, a measurable fraction of HuR increased in the cytoplasm following NO treatment. However, among the seven transcripts, only HO-1 mRNA showed a robust increase in the level of its association with HuR following NO treatment. In turn, HO-1 mRNA and protein levels were significantly reduced when HuR levels were silenced in IMR-90 cells, and they were elevated when HuR was overexpressed. In sum, our results indicate that NO stabilizes mRNA subsets in fibroblasts, identify HuR as an RBP implicated in the NO response, reveal that HuR alone is insufficient for stabilizing several mRNAs by NO, and show that HO-1 induction by NO is regulated by HuR.

...read moreread less

37 citations

Cited by

PDF

Open Access

More filters

Journal Article•DOI•

Integrating single-cell transcriptomic data across different conditions, technologies, and species.

[...]

Andrew Butler, Paul J. Hoffman, Peter Smibert, Efthymia Papalexi¹, Rahul Satija¹ - Show less +1 more•Institutions (1)

New York University¹

02 Apr 2018-Nature Biotechnology

TL;DR: An analytical strategy for integrating scRNA-seq data sets based on common sources of variation is introduced, enabling the identification of shared populations across data sets and downstream comparative analysis.

...read moreread less

Abstract: Computational single-cell RNA-seq (scRNA-seq) methods have been successfully applied to experiments representing a single condition, technology, or species to discover and define cellular phenotypes. However, identifying subpopulations of cells that are present across multiple data sets remains challenging. Here, we introduce an analytical strategy for integrating scRNA-seq data sets based on common sources of variation, enabling the identification of shared populations across data sets and downstream comparative analysis. We apply this approach, implemented in our R toolkit Seurat (http://satijalab.org/seurat/), to align scRNA-seq data sets of peripheral blood mononuclear cells under resting and stimulated conditions, hematopoietic progenitors sequenced using two profiling technologies, and pancreatic cell 'atlases' generated from human and mouse islets. In each case, we learn distinct or transitional cell states jointly across data sets, while boosting statistical power through integrated analysis. Our approach facilitates general comparisons of scRNA-seq data sets, potentially deepening our understanding of how distinct cell states respond to perturbation, disease, and evolution.

...read moreread less

7,741 citations

Journal Article•DOI•

GSVA: gene set variation analysis for microarray and RNA-seq data.

[...]

Sonja Hänzelmann, Robert Castelo¹, Justin Guinney²•Institutions (2)

Pompeu Fabra University¹, Sage Bionetworks²

16 Jan 2013-BMC Bioinformatics

TL;DR: This work introduces Gene Set Variation Analysis (GSVA), a GSE method that estimates variation of pathway activity over a sample population in an unsupervised manner and constitutes a starting point to build pathway-centric models of biology.

...read moreread less

Abstract: Gene set enrichment (GSE) analysis is a popular framework for condensing information from gene expression profiles into a pathway or signature summary. The strengths of this approach over single gene analysis include noise and dimension reduction, as well as greater biological interpretability. As molecular profiling experiments move beyond simple case-control studies, robust and flexible GSE methodologies are needed that can model pathway activity within highly heterogeneous data sets. To address this challenge, we introduce Gene Set Variation Analysis (GSVA), a GSE method that estimates variation of pathway activity over a sample population in an unsupervised manner. We demonstrate the robustness of GSVA in a comparison with current state of the art sample-wise enrichment methods. Further, we provide examples of its utility in differential pathway activity and survival analysis. Lastly, we show how GSVA works analogously with data from both microarray and RNA-seq experiments. GSVA provides increased power to detect subtle pathway activity changes over a sample population in comparison to corresponding methods. While GSE methods are generally regarded as end points of a bioinformatic analysis, GSVA constitutes a starting point to build pathway-centric models of biology. Moreover, GSVA contributes to the current need of GSE methods for RNA-seq data. GSVA is an open source software package for R which forms part of the Bioconductor project and can be downloaded at http://www.bioconductor.org .

...read moreread less

6,125 citations

Journal Article•DOI•

The consensus molecular subtypes of colorectal cancer

[...]

Justin Guinney¹, Rodrigo Dienstmann², Rodrigo Dienstmann¹, Xingwu Wang³, Xingwu Wang⁴, Aurélien de Reyniès, Andreas Schlicker⁵, Charlotte Soneson⁶, Laetitia Marisa, Paul Roepman, Gift Nyamundanda, Paolo Angelino⁶, Brian M. Bot¹, Jeffrey S. Morris⁷, Iris Simon, Sarah Gerster⁶, Evelyn Fessler³, Felipe De Sousa E Melo³, Edoardo Missiaglia⁶, Hena R. Ramay⁶, David Barras⁶, Krisztian Homicsko⁸, Dipen M. Maru⁷, Ganiraju C. Manyam⁷, Bradley M. Broom⁷, Valérie Boige⁹, Beatriz Perez-Villamil¹⁰, Ted Laderas¹, Ramon Salazar, Joe W. Gray¹¹, Douglas Hanahan⁸, Josep Tabernero², René Bernards⁵, Stephen H. Friend¹, Pierre Laurent-Puig¹², Jan Paul Medema³, Anguraj Sadanandam, Lodewyk F. A. Wessels⁵, Mauro Delorenzi¹³, Mauro Delorenzi⁶, Scott Kopetz⁷, Louis Vermeulen³, Sabine Tejpar¹⁴ - Show less +39 more•Institutions (14)

Sage Bionetworks¹, Autonomous University of Barcelona², University of Amsterdam³, City University of Hong Kong⁴, Netherlands Cancer Institute⁵, Swiss Institute of Bioinformatics⁶, University of Texas MD Anderson Cancer Center⁷, École Polytechnique Fédérale de Lausanne⁸, Institut Gustave Roussy⁹, Hospital Clínico San Carlos¹⁰, Oregon Health & Science University¹¹, Paris Descartes University¹², University of Lausanne¹³, Katholieke Universiteit Leuven¹⁴

01 Nov 2015-Nature Medicine

TL;DR: An international consortium dedicated to large-scale data sharing and analytics across expert groups is formed, showing marked interconnectivity between six independent classification systems coalescing into four consensus molecular subtypes (CMSs) with distinguishing features.

...read moreread less

Abstract: Colorectal cancer (CRC) is a frequently lethal disease with heterogeneous outcomes and drug responses. To resolve inconsistencies among the reported gene expression-based CRC classifications and facilitate clinical translation, we formed an international consortium dedicated to large-scale data sharing and analytics across expert groups. We show marked interconnectivity between six independent classification systems coalescing into four consensus molecular subtypes (CMSs) with distinguishing features: CMS1 (microsatellite instability immune, 14%), hypermutated, microsatellite unstable and strong immune activation; CMS2 (canonical, 37%), epithelial, marked WNT and MYC signaling activation; CMS3 (metabolic, 13%), epithelial and evident metabolic dysregulation; and CMS4 (mesenchymal, 23%), prominent transforming growth factor-β activation, stromal invasion and angiogenesis. Samples with mixed features (13%) possibly represent a transition phenotype or intratumoral heterogeneity. We consider the CMS groups the most robust classification system currently available for CRC-with clear biological interpretability-and the basis for future clinical stratification and subtype-based targeted interventions.

...read moreread less

3,351 citations

Journal Article•DOI•

The sva package for removing batch effects and other unwanted variation in high-throughput experiments

[...]

Jeffrey T. Leek¹, W. Evan Johnson², Hilary S. Parker¹, Andrew E. Jaffe¹, John D. Storey¹ - Show less +1 more•Institutions (2)

Johns Hopkins University¹, Boston University²

01 Mar 2012-Bioinformatics

TL;DR: The sva package is described, which supports surrogate variable estimation with the sva function, direct adjustment for known batch effects with the ComBat function and adjustment for batch and latent variables in prediction problems with the fsva function.

...read moreread less

Abstract: Summary: Heterogeneity and latent variables are now widely recognized as major sources of bias and variability in high-throughput experiments. The most well-known source of latent variation in genomic experiments are batch effects—when samples are processed on different days, in different groups or by different people. However, there are also a large number of other variables that may have a major impact on high-throughput measurements. Here we describe the sva package for identifying, estimating and removing unwanted sources of variation in high-throughput experiments. The sva package supports surrogate variable estimation with the sva function, direct adjustment for known batch effects with the ComBat function and adjustment for batch and latent variables in prediction problems with the fsva function. Availability: The R package sva is freely available from http://www.bioconductor.org. Contact: jleek@jhsph.edu Supplementary information:Supplementary data are available at Bioinformatics online.

...read moreread less

3,343 citations