scispace - formally typeset
Open AccessJournal ArticleDOI

Using Bayes to get the most out of non-significant results

Zoltan Dienes
- 29 Jul 2014 - 
- Vol. 5, pp 781-781
Reads0
Chats0
TLDR
It is argued Bayes factors allow theory to be linked to data in a way that overcomes the weaknesses of the other approaches, and provides a coherent approach to determining whether non-significant results support a null hypothesis over a theory, or whether the data are just insensitive.
Abstract
No scientific conclusion follows automatically from a statistically non-significant result, yet people routinely use non-significant results to guide conclusions about the status of theories (or the effectiveness of practices). To know whether a non-significant result counts against a theory, or if it just indicates data insensitivity, researchers must use one of: power, intervals (such as confidence or credibility intervals), or else an indicator of the relative evidence for one theory over another, such as a Bayes factor. I argue Bayes factors allow theory to be linked to data in a way that overcomes the weaknesses of the other approaches. Specifically, Bayes factors use the data themselves to determine their sensitivity in distinguishing theories (unlike power), and they make use of those aspects of a theory’s predictions that are often easiest to specify (unlike power and intervals, which require specifying the minimal interesting value in order to address theory). Bayes factors provide a coherent approach to determining whether non-significant results support a null hypothesis over a theory, or whether the data are just insensitive. They allow accepting and rejecting the null hypothesis to be put on an equal footing. Concrete examples are provided to indicate the range of application of a simple online Bayes calculator, which reveal both the strengths and weaknesses of Bayes factors.

read more

Content maybe subject to copyright    Report

Citations
More filters
Journal ArticleDOI

Busting a myth with the Bayes Factor: Effects of letter bigram frequency in visual lexical decision do not reflect reading processes

TL;DR: For instance, this article found no effect of bigram frequency in lexical decision in the British Lexicon Project, and some evidence for an inhibitory effect in the English lexicon project.
Journal ArticleDOI

Evaluation of behavioral problems after prenatal dexamethasone treatment in Swedish adolescents at risk of CAH

TL;DR: The entire data set was unreliable and needed to be re–analysed and the results have been published here: https://www.sciencedirect.com/science/article/pii/S0018506X17300752
Journal ArticleDOI

Assessment of the Choice Behavior Under Cued Conditions (CBUCC) paradigm as a measure of motivation to smoke under laboratory conditions

TL;DR: Craving and 'money spent' in the Choice Behavior under Cued Conditions task (CBUCC) appears to be responsive to cigarette versus water cues and money spent appears to show greater difference in responsiveness to cigarette than water cues after abstinence.
Posted ContentDOI

Test-retest reliability and convergent validity of (R)-[11C]PK11195 outcome measures without arterial input function

TL;DR: Evaluated techniques for estimating (R)-[11C]PK11195 binding without arterial measurements, such as standardized uptake values (SUVs), supervised-cluster analysis (SVCA), or the use of a pseudo-reference region showed poor reliability and little to no convergent validity with outcomes derived using an AIF.
Proceedings ArticleDOI

Convergence Across Behavioral and Self-report Measures Evaluating Individuals' Trust in an Autonomous Golf Cart

TL;DR: In this paper , the authors used an autonomous golf cart to drive participants to different locations around the campus of James Madison University while a camera recorded them, and participants were given the AICP-R and TOAST to evaluate their complacency potential and trust.
References
More filters
Book

Statistical Power Analysis for the Behavioral Sciences

TL;DR: The concepts of power analysis are discussed in this paper, where Chi-square Tests for Goodness of Fit and Contingency Tables, t-Test for Means, and Sign Test are used.
Book

Model Selection and Multimodel Inference: A Practical Information-Theoretic Approach

TL;DR: The second edition of this book is unique in that it focuses on methods for making formal statistical inference from all the models in an a priori set (Multi-Model Inference).
Journal ArticleDOI

Statistical power analyses using G*Power 3.1: tests for correlation and regression analyses.

TL;DR: In the new version, procedures to analyze the power of tests based on single-sample tetrachoric correlations, comparisons of dependent correlations, bivariate linear regression, multiple linear regression based on the random predictor model, logistic regression, and Poisson regression are added.
Journal ArticleDOI

Bayesian data analysis.

TL;DR: A fatal flaw of NHST is reviewed and some benefits of Bayesian data analysis are introduced and illustrative examples of multiple comparisons in Bayesian analysis of variance and Bayesian approaches to statistical power are presented.
Journal ArticleDOI

Power failure: why small sample size undermines the reliability of neuroscience

TL;DR: It is shown that the average statistical power of studies in the neurosciences is very low, and the consequences include overestimates of effect size and low reproducibility of results.
Related Papers (5)