A comparison of normalization methods for high density oligonucleotide array data based on variance and bias

doi:10.1093/BIOINFORMATICS/19.2.185

Open AccessJournal ArticleDOI

A comparison of normalization methods for high density oligonucleotide array data based on variance and bias

Benjamin M. Bolstad, +4 more

- 22 Jan 2003 -

Bioinformatics

- Vol. 19, Iss: 2, pp 185-193

TLDR

Three methods of performing normalization at the probe intensity level are presented: a one number scaling based algorithm and a method that uses a non-linear normalizing relation by comparing the variability and bias of an expression measure and the simplest and quickest complete data method is found to perform favorably.

Abstract:

Motivation: When running experiments that involve multiple high density oligonucleotide arrays, it is important to remove sources of variation between arrays of non-biological origin. Normalization is a process for reducing this variation. It is common to see non-linear relations between arrays and the standard normalization provided by Affymetrix does not perform well in these situations. Results: We present three methods of performing normalization at the probe intensity level. These methods are called complete data methods because they make use of data from all arrays in an experiment to form the normalizing relation. These algorithms are compared to two methods that make use of a baseline array: a one number scaling based algorithm and a method that uses a non-linear normalizing relation by comparing the variability and bias of an expression measure. Two publicly available datasets are used to carry out the comparisons. The simplest and quickest complete data method is found to perform favorably. Availabilty: Software implementing all three of the complete data normalization methods is available as part of the R package Affy, which is a part of the Bioconductor project http://www.bioconductor.org. Contact: bolstad@stat.berkeley.edu Supplementary information: Additional figures may be found at http://www.stat.berkeley.edu/∼bolstad/normalize/ index.html

Citations

PDF

Open Access

More filters

Journal ArticleDOI

limma powers differential expression analyses for RNA-sequencing and microarray studies

Matthew E. Ritchie, +7 more

- 20 Apr 2015 -

Nucleic Acids Research

TL;DR: The philosophy and design of the limma package is reviewed, summarizing both new and historical features, with an emphasis on recent enhancements and features that have not been previously described.

...read moreread less

Journal ArticleDOI

Exploration, normalization, and summaries of high density oligonucleotide array probe level data

Rafael A. Irizarry, +6 more

- 01 Apr 2003 -

Biostatistics

TL;DR: There is no obvious downside to using RMA and attaching a standard error (SE) to this quantity using a linear model which removes probe-specific affinities, and the exploratory data analyses of the probe level data motivate a new summary measure that is a robust multi-array average (RMA) of background-adjusted, normalized, and log-transformed PM values.

...read moreread less

Journal ArticleDOI

A scaling normalization method for differential expression analysis of RNA-seq data

Mark D. Robinson, +2 more

- 02 Mar 2010 -

Genome Biology

TL;DR: A simple and effective method for performing normalization is outlined and dramatically improved results for inferring differential expression in simulated and publicly available data sets are shown.

...read moreread less

Journal ArticleDOI

Normalization of Real-Time Quantitative Reverse Transcription-PCR Data: A Model-Based Variance Estimation Approach to Identify Genes Suited for Normalization, Applied to Bladder and Colon Cancer Data Sets

Claus L. Andersen, +2 more

- 01 Aug 2004 -

Cancer Research

TL;DR: A novel, innovative, and robust strategy to identify stably expressed genes among a set of candidate normalization genes, rooted in a mathematical model of gene expression, that provides a direct measure for the estimated expression variation, enabling the user to evaluate the systematic error introduced when using the gene.

...read moreread less

Journal ArticleDOI

Summaries of Affymetrix GeneChip probe level data

Rafael A. Irizarry, +6 more

- 15 Feb 2003 -

Nucleic Acids Research

TL;DR: It is found that the performance of the current version of the default expression measure provided by Affymetrix Microarray Suite can be significantly improved by the use of probe level summaries derived from empirically motivated statistical models.

...read moreread less

Collapse

References

PDF

Open Access

More filters

Journal ArticleDOI

Exploration, normalization, and summaries of high density oligonucleotide array probe level data

Rafael A. Irizarry, +6 more

- 01 Apr 2003 -

Biostatistics

TL;DR: There is no obvious downside to using RMA and attaching a standard error (SE) to this quantity using a linear model which removes probe-specific affinities, and the exploratory data analyses of the probe level data motivate a new summary measure that is a robust multi-array average (RMA) of background-adjusted, normalized, and log-transformed PM values.

...read moreread less

Journal ArticleDOI

R: A Language for Data Analysis and Graphics

Ross Ihaka, +1 more

- 01 Sep 1996 -

Journal of Computational and Graphical S...

TL;DR: In this article, the authors discuss their experience designing and implementing a statistical computing language, which combines what they felt were useful features from two existing computer languages, and they feel that the new language provides advantages in the areas of portability, computational efficiency, memory management, and scope.

...read moreread less

Journal ArticleDOI

Locally Weighted Regression: An Approach to Regression Analysis by Local Fitting

William S. Cleveland, +1 more

- 01 Sep 1988 -

Journal of the American Statistical Asso...

TL;DR: Locally weighted regression as discussed by the authors is a way of estimating a regression surface through a multivariate smoothing procedure, fitting a function of the independent variables locally and in a moving fashion analogous to how a moving average is computed for a time series.

...read moreread less

Journal ArticleDOI

Modern Applied Statistics with S-Plus.

W. N. Venables, +1 more

- 01 Dec 1996 -

Biometrics

Journal ArticleDOI

High density synthetic oligonucleotide arrays

Robert J. Lipshutz, +3 more

- 01 Jan 1999 -

Nature Genetics

TL;DR: An approach in which sequence information is used directly to design high–density, two–dimensional arrays of synthetic oligonucleotides is developed, which have been designed and used for quantitative and highly parallel measurements of gene expression, to discover polymorphic loci and to detect the presence of thousands of alternative alleles.

...read moreread less