scispace - formally typeset
Open AccessJournal ArticleDOI

A data-driven approach to preprocessing Illumina 450K methylation array data

Reads0
Chats0
TLDR
It is demonstrated that quantile normalization methods produce marked improvement, even in highly consistent data, by all three metrics, and that careful selection of preprocessing steps can minimize variance and thus improve statistical power, especially for the detection of the small absolute DNA methylation changes likely associated with complex disease phenotypes.
Abstract
As the most stable and experimentally accessible epigenetic mark, DNA methylation is of great interest to the research community. The landscape of DNA methylation across tissues, through development and in disease pathogenesis is not yet well characterized. Thus there is a need for rapid and cost effective methods for assessing genome-wide levels of DNA methylation. The Illumina Infinium HumanMethylation450 (450K) BeadChip is a very useful addition to the available methods for DNA methylation analysis but its complex design, incorporating two different assay methods, requires careful consideration. Accordingly, several normalization schemes have been published. We have taken advantage of known DNA methylation patterns associated with genomic imprinting and X-chromosome inactivation (XCI), in addition to the performance of SNP genotyping assays present on the array, to derive three independent metrics which we use to test alternative schemes of correction and normalization. These metrics also have potential utility as quality scores for datasets. The standard index of DNA methylation at any specific CpG site is β = M/(M + U + 100) where M and U are methylated and unmethylated signal intensities, respectively. Betas (βs) calculated from raw signal intensities (the default GenomeStudio behavior) perform well, but using 11 methylomic datasets we demonstrate that quantile normalization methods produce marked improvement, even in highly consistent data, by all three metrics. The commonly used procedure of normalizing betas is inferior to the separate normalization of M and U, and it is also advantageous to normalize Type I and Type II assays separately. More elaborate manipulation of quantiles proves to be counterproductive. Careful selection of preprocessing steps can minimize variance and thus improve statistical power, especially for the detection of the small absolute DNA methylation changes likely associated with complex disease phenotypes. For the convenience of the research community we have created a user-friendly R software package called wateRmelon, downloadable from bioConductor, compatible with the existing methylumi, minfi and IMA packages, that allows others to utilize the same normalization methods and data quality tests on 450K data.

read more

Content maybe subject to copyright    Report

Citations
More filters
Posted ContentDOI

Genome-wide study of DNA methylation in Amyotrophic Lateral Sclerosis identifies differentially methylated loci and implicates metabolic, inflammatory and cholesterol pathways

Paul J. Hop, +96 more
- 24 Mar 2021 - 
TL;DR: In this paper, the authors presented a case-control study of DNA methylation for amyotrophic lateral sclerosis (ALS) using 10,462 samples (7,344 ALS patients and 3,118 controls) and identified a total of 45 differentially methylated positions annotated to 42 genes.
Journal ArticleDOI

Nasal DNA methylation at three CpG sites predicts childhood allergic disease

TL;DR: In this article , the authors show that DNA methylation has the strongest predictive power to diagnose childhood allergy, surpassing genetic risk scores, and environmental factors, and suggest the potential of methylation-based allergy diagnosis.
Posted ContentDOI

interpolatedXY: a two-step strategy to normalise DNA methylation microarray data avoiding sex bias

TL;DR: In this paper, a two-step approach is proposed to normalize sex chromosome data without introducing bias, which is applicable to all quantile-based normalization methods, such as funnorm or dasen.
Posted ContentDOI

Senotherapeutic peptide reduces skin biological age and improves skin health markers

TL;DR: It is determined that peptide 14 is safe for long-term applications and also significantly extends both the lifespan and healthspan of C. elegans worms tested in two independent testings, highlighting the potential for geroprotective applications of the senotherapeutic compounds identified using the screening platform beyond the skin.
Journal ArticleDOI

Profiling placental DNA methylation associated with maternal SSRI treatment during pregnancy

TL;DR: In this paper , the authors found no evidence for altered DNA methylation in maternal depression-exposed versus depression nonexposed placentas, in a prospectively-recruited cohort with clinician-ascertained SSRI exposure and mood assessments.
References
More filters
Journal ArticleDOI

Exploration, normalization, and summaries of high density oligonucleotide array probe level data

TL;DR: There is no obvious downside to using RMA and attaching a standard error (SE) to this quantity using a linear model which removes probe-specific affinities, and the exploratory data analyses of the probe level data motivate a new summary measure that is a robust multi-array average (RMA) of background-adjusted, normalized, and log-transformed PM values.
Book ChapterDOI

limma: Linear Models for Microarray Data

TL;DR: This chapter starts with the simplest replicated designs and progresses through experiments with two or more groups, direct designs, factorial designs and time course experiments with technical as well as biological replication.
Book

Bioinformatics and Computational Biology Solutions Using R and Bioconductor

TL;DR: In this article, the authors present a detailed case study of R algorithms with publicly available data, and a major section of the book is devoted to fully worked case studies, with a companion website where readers can reproduce every number, figure and table on their own computers.
Journal ArticleDOI

Bioinformatics and Computational Biology Solutions Using R and Bioconductor

TL;DR: In this article, the authors present a Bioinformatics and Computational Biology Solutions Using R and Bioconductor (BIBOS) using R and BIBOS, which is a combination of R and CRF.
Related Papers (5)