Using Bayes to get the most out of non-significant results

doi:10.3389/FPSYG.2014.00781

Open AccessJournal ArticleDOI

Using Bayes to get the most out of non-significant results

Zoltan Dienes

- 29 Jul 2014 -

Frontiers in Psychology

- Vol. 5, pp 781-781

Chats0

TLDR

It is argued Bayes factors allow theory to be linked to data in a way that overcomes the weaknesses of the other approaches, and provides a coherent approach to determining whether non-significant results support a null hypothesis over a theory, or whether the data are just insensitive.

Abstract:

No scientific conclusion follows automatically from a statistically non-significant result, yet people routinely use non-significant results to guide conclusions about the status of theories (or the effectiveness of practices). To know whether a non-significant result counts against a theory, or if it just indicates data insensitivity, researchers must use one of: power, intervals (such as confidence or credibility intervals), or else an indicator of the relative evidence for one theory over another, such as a Bayes factor. I argue Bayes factors allow theory to be linked to data in a way that overcomes the weaknesses of the other approaches. Specifically, Bayes factors use the data themselves to determine their sensitivity in distinguishing theories (unlike power), and they make use of those aspects of a theory’s predictions that are often easiest to specify (unlike power and intervals, which require specifying the minimal interesting value in order to address theory). Bayes factors provide a coherent approach to determining whether non-significant results support a null hypothesis over a theory, or whether the data are just insensitive. They allow accepting and rejecting the null hypothesis to be put on an equal footing. Concrete examples are provided to indicate the range of application of a simple online Bayes calculator, which reveal both the strengths and weaknesses of Bayes factors.

Citations

PDF

Open Access

More filters

Journal ArticleDOI

Busting a myth with the Bayes Factor: Effects of letter bigram frequency in visual lexical decision do not reflect reading processes

Xenia Schmalz, +1 more

- 16 Mar 2017 -

The Mental Lexicon

TL;DR: For instance, this article found no effect of bigram frequency in lexical decision in the British Lexicon Project, and some evidence for an inhibitory effect in the English lexicon project.

...read moreread less

Journal ArticleDOI

Evaluation of behavioral problems after prenatal dexamethasone treatment in Swedish adolescents at risk of CAH

Lena Wallensteen, +6 more

- 01 Sep 2016 -

Hormones and Behavior

TL;DR: The entire data set was unreliable and needed to be re–analysed and the results have been published here: https://www.sciencedirect.com/science/article/pii/S0018506X17300752

...read moreread less

Journal ArticleDOI

Assessment of the Choice Behavior Under Cued Conditions (CBUCC) paradigm as a measure of motivation to smoke under laboratory conditions

Julie C. Gass, +1 more

- 01 Feb 2020 -

Addiction

TL;DR: Craving and 'money spent' in the Choice Behavior under Cued Conditions task (CBUCC) appears to be responsive to cigarette versus water cues and money spent appears to show greater difference in responsiveness to cigarette than water cues after abstinence.

...read moreread less

Posted ContentDOI

Test-retest reliability and convergent validity of (R)-[11C]PK11195 outcome measures without arterial input function

Pontus Plavén-Sigray, +5 more

- 11 Apr 2018 -

bioRxiv

TL;DR: Evaluated techniques for estimating (R)-[11C]PK11195 binding without arterial measurements, such as standardized uptake values (SUVs), supervised-cluster analysis (SVCA), or the use of a pseudo-reference region showed poor reliability and little to no convergent validity with outcomes derived using an AIF.

...read moreread less

Proceedings ArticleDOI

Convergence Across Behavioral and Self-report Measures Evaluating Individuals' Trust in an Autonomous Golf Cart

Jenna E. Cotter, +7 more

TL;DR: In this paper , the authors used an autonomous golf cart to drive participants to different locations around the campus of James Madison University while a camera recorded them, and participants were given the AICP-R and TOAST to evaluate their complacency potential and trust.

...read moreread less

Collapse

References

PDF

Open Access

More filters

Book

Statistical Power Analysis for the Behavioral Sciences

Jacob Cohen

TL;DR: The concepts of power analysis are discussed in this paper, where Chi-square Tests for Goodness of Fit and Contingency Tables, t-Test for Means, and Sign Test are used.

...read moreread less

Book

Model Selection and Multimodel Inference: A Practical Information-Theoretic Approach

Kenneth P. Burnham, +1 more

TL;DR: The second edition of this book is unique in that it focuses on methods for making formal statistical inference from all the models in an a priori set (Multi-Model Inference).

...read moreread less

Journal ArticleDOI

Statistical power analyses using G*Power 3.1: tests for correlation and regression analyses.

Franz Faul, +3 more

- 01 Nov 2009 -

Behavior Research Methods

TL;DR: In the new version, procedures to analyze the power of tests based on single-sample tetrachoric correlations, comparisons of dependent correlations, bivariate linear regression, multiple linear regression based on the random predictor model, logistic regression, and Poisson regression are added.

...read moreread less

Journal ArticleDOI

Bayesian data analysis.

John K. Kruschke

- 01 Sep 2010 -

Wiley Interdisciplinary Reviews: Cogniti...

TL;DR: A fatal flaw of NHST is reviewed and some benefits of Bayesian data analysis are introduced and illustrative examples of multiple comparisons in Bayesian analysis of variance and Bayesian approaches to statistical power are presented.

...read moreread less

Journal ArticleDOI

Power failure: why small sample size undermines the reliability of neuroscience

Katherine S. Button, +6 more

- 01 May 2013 -

Nature Reviews Neuroscience

TL;DR: It is shown that the average statistical power of studies in the neurosciences is very low, and the consequences include overestimates of effect size and low reproducibility of results.

...read moreread less