Using Bayes to get the most out of non-significant results

doi:10.3389/FPSYG.2014.00781

Open AccessJournal ArticleDOI

Using Bayes to get the most out of non-significant results

Zoltan Dienes

- 29 Jul 2014 -

Frontiers in Psychology

- Vol. 5, pp 781-781

Chats0

TLDR

It is argued Bayes factors allow theory to be linked to data in a way that overcomes the weaknesses of the other approaches, and provides a coherent approach to determining whether non-significant results support a null hypothesis over a theory, or whether the data are just insensitive.

Abstract:

No scientific conclusion follows automatically from a statistically non-significant result, yet people routinely use non-significant results to guide conclusions about the status of theories (or the effectiveness of practices). To know whether a non-significant result counts against a theory, or if it just indicates data insensitivity, researchers must use one of: power, intervals (such as confidence or credibility intervals), or else an indicator of the relative evidence for one theory over another, such as a Bayes factor. I argue Bayes factors allow theory to be linked to data in a way that overcomes the weaknesses of the other approaches. Specifically, Bayes factors use the data themselves to determine their sensitivity in distinguishing theories (unlike power), and they make use of those aspects of a theory’s predictions that are often easiest to specify (unlike power and intervals, which require specifying the minimal interesting value in order to address theory). Bayes factors provide a coherent approach to determining whether non-significant results support a null hypothesis over a theory, or whether the data are just insensitive. They allow accepting and rejecting the null hypothesis to be put on an equal footing. Concrete examples are provided to indicate the range of application of a simple online Bayes calculator, which reveal both the strengths and weaknesses of Bayes factors.

Citations

PDF

Open Access

More filters

Posted ContentDOI

Retrieval of a well-established skill is resistant to distraction: evidence from an implicit probabilistic sequence learning task

Teodóra Vékony, +11 more

- 21 Apr 2020 -

bioRxiv

TL;DR: It is shown that although the dual-task condition significantly prolonged the overall reaction times in the primary task, the access to the previously learned probabilistic representations remained intact and an inverse relationship between the ability to successfully retrieve sequence knowledge and the accuracy of the secondary task was found.

...read moreread less

Proceedings ArticleDOI

Null hypothesis significance testing in simulation

Marko Hofmann

TL;DR: A critical reflection of the arguments contra NHST shows that although NHST is indeed ill-suited for many simulation applications and objectives it is by no means superfluous, neither in general, nor in particular for simulation.

...read moreread less

Dissertation

Reliability, replicability and reproducibility in PET imaging

Granville J. Matheson

TL;DR: This thesis explores themes of reliability, replicability and reproducibility for PET research, which allows researchers to more effectively gauge the feasibility of new between-individual studies before collection of any data, and to focus their efforts on research questions which can be expected to yield more interpretable outcomes.

...read moreread less

Journal ArticleDOI

A novel team Familiarity Score for operating teams is a predictor of length of a procedure—A retrospective Bayesian analysis

Katarzyna Powezka, +3 more

- 01 Mar 2020 -

Journal of Vascular Surgery

TL;DR: FS in vascular teams was shown to be strongly associated with LOP, suggesting that more familiar teams might collaborate more efficiently, and Bayesian statistics was used to analyze the data.

...read moreread less

Journal ArticleDOI

Situational factors shape moral judgements in the trolley dilemma in Eastern, Southern and Western countries in a culturally diverse sample

Bence Bago, +269 more

- 14 Apr 2022 -

Nature Human Behaviour

TL;DR: The authors empirically tested the universality of the effects of intent and personal force on moral dilemma judgements by replicating the experiments of Greene et al. in 45 countries from all inhabited continents and found that personal force and its interaction with intention exert influence on moral judgements in the US and Western cultural clusters, replicating and expanding the original findings.

...read moreread less

Collapse

References

PDF

Open Access

More filters

Book

Statistical Power Analysis for the Behavioral Sciences

Jacob Cohen

TL;DR: The concepts of power analysis are discussed in this paper, where Chi-square Tests for Goodness of Fit and Contingency Tables, t-Test for Means, and Sign Test are used.

...read moreread less

Book

Model Selection and Multimodel Inference: A Practical Information-Theoretic Approach

Kenneth P. Burnham, +1 more

TL;DR: The second edition of this book is unique in that it focuses on methods for making formal statistical inference from all the models in an a priori set (Multi-Model Inference).

...read moreread less

Journal ArticleDOI

Statistical power analyses using G*Power 3.1: tests for correlation and regression analyses.

Franz Faul, +3 more

- 01 Nov 2009 -

Behavior Research Methods

TL;DR: In the new version, procedures to analyze the power of tests based on single-sample tetrachoric correlations, comparisons of dependent correlations, bivariate linear regression, multiple linear regression based on the random predictor model, logistic regression, and Poisson regression are added.

...read moreread less

Journal ArticleDOI

Bayesian data analysis.

John K. Kruschke

- 01 Sep 2010 -

Wiley Interdisciplinary Reviews: Cogniti...

TL;DR: A fatal flaw of NHST is reviewed and some benefits of Bayesian data analysis are introduced and illustrative examples of multiple comparisons in Bayesian analysis of variance and Bayesian approaches to statistical power are presented.

...read moreread less

Journal ArticleDOI

Power failure: why small sample size undermines the reliability of neuroscience

Katherine S. Button, +6 more

- 01 May 2013 -

Nature Reviews Neuroscience

TL;DR: It is shown that the average statistical power of studies in the neurosciences is very low, and the consequences include overestimates of effect size and low reproducibility of results.

...read moreread less