False discovery rates: a new deal.

doi:10.1093/BIOSTATISTICS/KXW041

Open AccessJournal ArticleDOI

False discovery rates: a new deal.

Matthew Stephens

- 17 Oct 2016 -

Biostatistics

- Vol. 18, Iss: 2, pp 275-294

Chats0

TLDR

A new Empirical Bayes approach for large‐scale hypothesis testing, including estimating false discovery rates (FDRs), and effect sizes, and it is argued that the local false sign rate is a superior measure of significance than the local FDR because it is both more generally applicable and can be more robustly estimated.

Abstract:

We introduce a new Empirical Bayes approach for large-scale hypothesis testing, including estimating false discovery rates (FDRs), and effect sizes. This approach has two key differences from existing approaches to FDR analysis. First, it assumes that the distribution of the actual (unobserved) effects is unimodal, with a mode at 0. This "unimodal assumption" (UA), although natural in many contexts, is not usually incorporated into standard FDR analysis, and we demonstrate how incorporating it brings many benefits. Specifically, the UA facilitates efficient and robust computation-estimating the unimodal distribution involves solving a simple convex optimization problem-and enables more accurate inferences provided that it holds. Second, the method takes as its input two numbers for each test (an effect size estimate and corresponding standard error), rather than the one number usually used ($p$ value or $z$ score). When available, using two numbers instead of one helps account for variation in measurement precision across tests. It also facilitates estimation of effects, and unlike standard FDR methods, our approach provides interval estimates (credible regions) for each effect in addition to measures of significance. To provide a bridge between interval estimates and significance measures, we introduce the term "local false sign rate" to refer to the probability of getting the sign of an effect wrong and argue that it is a superior measure of significance than the local FDR because it is both more generally applicable and can be more robustly estimated. Our methods are implemented in an R package ashr available from http://github.com/stephens999/ashr.

False discovery rates: a new deal.

Citations

An Expanded View of Complex Traits: From Polygenic to Omnigenic

Heavy-tailed prior distributions for sequence count data: removing the noise and preserving large differences.

Large-Scale Simultaneous Hypothesis Testing: The Choice of a Null Hypothesis

Data-driven hypothesis weighting increases detection power in genome-scale multiple testing

Heavy-tailed prior distributions for sequence count data: removing the noise and preserving large differences

References

R: A language and environment for statistical computing.

Controlling the false discovery rate: a practical and powerful approach to multiple testing

Convex Optimization

ggplot2: Elegant Graphics for Data Analysis

Linear Models and Empirical Bayes Methods for Assessing Differential Expression in Microarray Experiments

Related Papers (5)

Moderated estimation of fold change and dispersion for RNA-seq data with DESeq2

Controlling the false discovery rate: a practical and powerful approach to multiple testing

STAR: ultrafast universal RNA-seq aligner

voom: precision weights unlock linear model analysis tools for RNA-seq read counts

A direct approach to false discovery rates