Home
/
Authors
/
Štěpán Bahník

Author

Štěpán Bahník

Other affiliations: Prague College, Academy of Sciences of the Czech Republic, University of Würzburg

Bio: Štěpán Bahník is an academic researcher from University of Economics, Prague. The author has contributed to research in topics: Replication (statistics) & Anchoring. The author has an hindex of 13, co-authored 42 publications receiving 6511 citations. Previous affiliations of Štěpán Bahník include Prague College & Academy of Sciences of the Czech Republic.

Papers

PDF

Open Access

More filters

Journal Article•DOI•

Estimating the reproducibility of psychological science

[...]

Alexander A. Aarts, Joanna E. Anderson¹, Christopher J. Anderson², Peter Raymond Attridge³ +287 more•Institutions (116)

28 Aug 2015-Science

TL;DR: A large-scale assessment suggests that experimental reproducibility in psychology leaves a lot to be desired, and correlational tests suggest that replication success was better predicted by the strength of original evidence than by characteristics of the original and replication teams.

...read moreread less

Abstract: Reproducibility is a defining feature of science, but the extent to which it characterizes current research is unknown. We conducted replications of 100 experimental and correlational studies published in three psychology journals using high-powered designs and original materials when available. Replication effects were half the magnitude of original effects, representing a substantial decline. Ninety-seven percent of original studies had statistically significant results. Thirty-six percent of replications had statistically significant results; 47% of original effect sizes were in the 95% confidence interval of the replication effect size; 39% of effects were subjectively rated to have replicated the original result; and if no bias in original results is assumed, combining original and replication results left 68% with statistically significant effects. Correlational tests suggest that replication success was better predicted by the strength of original evidence than by characteristics of the original and replication teams.

...read moreread less

5,532 citations

Journal Article•DOI•

Investigating variation in replicability: A “Many Labs” replication project

[...]

Richard A. Klein¹, Kate A. Ratliff¹, Michelangelo Vianello², Reginald B. Adams, Štěpán Bahník³, Michael J. Bernstein⁴, Konrad Bocian⁵, Mark J. Brandt⁶, Beach Brooks¹, Claudia Chloe Brumbaugh⁷, Zeynep Cemalcilar⁸, Jesse Chandler⁹, Winnee Cheong¹⁰, William E. Davis¹¹, Thierry Devos¹², Matthew Eisner⁹, Natalia Frankowska⁵, David Furrow¹³, Elisa Maria Galliani², Fred Hasselman¹⁴, Joshua A. Hicks¹¹, James Hovermale¹⁵, S. Jane Hunt¹⁶, Jeffrey R. Huntsinger¹⁷, Hans IJzerman⁶, Melissa-Sue John¹⁸, Jennifer A. Joy-Gaba¹⁵, Heather Barry Kappes¹⁹, Lacy E. Krueger¹⁶, Jaime L. Kurtz²⁰, Carmel A. Levitan²¹, Robyn K. Mallett¹⁷, Wendy L. Morris²², Anthony J. Nelson⁴, Jason A. Nier²³, Grant Packard²⁴, Ronaldo Pilati²⁵, Abraham M. Rutchick²⁶, Kathleen Schmidt²⁷, Jeanine L. M. Skorinko¹⁸, Robert W. Smith²⁸, Troy G. Steiner⁴, Justin Storbeck⁷, Lyn M. Van Swol²⁹, Donna Thompson¹³, A. E. van ‘t Veer⁶, Leigh Ann Vaughn³⁰, Marek A. Vranka³¹, Aaron L. Wichman³², Julie A. Woodzicka³³, Brian A. Nosek²⁷ - Show less +47 more•Institutions (33)

University of Florida¹, University of Padua², University of Würzburg³, Pennsylvania State University⁴, University of Social Sciences and Humanities⁵, Tilburg University⁶, City University of New York⁷, Koç University⁸, University of Michigan⁹, University of Kuala Lumpur¹⁰, Texas A&M University¹¹, San Diego State University¹², Mount Saint Vincent University¹³, Radboud University Nijmegen¹⁴, Virginia Commonwealth University¹⁵, Texas A&M University–Commerce¹⁶, Loyola University Chicago¹⁷, Worcester Polytechnic Institute¹⁸, London School of Economics and Political Science¹⁹, James Madison University²⁰, Occidental College²¹, McDaniel College²², Connecticut College²³, Wilfrid Laurier University²⁴, University of Brasília²⁵, California State University, Northridge²⁶, University of Virginia²⁷, Ohio State University²⁸, University of Wisconsin-Madison²⁹, Ithaca College³⁰, Charles University in Prague³¹, Western Kentucky University³², Washington and Lee University³³

01 Jan 2014-Social Psychology

TL;DR: The authors compared variation in the replicability of 13 classic and contemporary effects across 36 independent samples totaling 6,344 participants and found that the results of these experiments are more dependent on the effect itself than on the sample and setting used to investigate the effect.

...read moreread less

Abstract: Although replication is a central tenet of science, direct replications are rare in psychology. This research tested variation in the replicability of 13 classic and contemporary effects across 36 independent samples totaling 6,344 participants. In the aggregate, 10 effects replicated consistently. One effect – imagined contact reducing prejudice – showed weak support for replicability. And two effects – flag priming influencing conservatism and currency priming influencing system justification – did not replicate. We compared whether the conditions such as lab versus online or US versus international sample predicted effect magnitudes. By and large they did not. The results of this small sample of effects suggest that replicability is more dependent on the effect itself than on the sample and setting used to investigate the effect.

...read moreread less

767 citations

Journal Article•DOI•

Many Labs 2: Investigating Variation in Replicability Across Samples and Settings

[...]

Richard A. Klein¹, Michelangelo Vianello², Fred Hasselman³, Byron G. Adams⁴ +187 more•Institutions (118)

24 Dec 2018

TL;DR: This paper conducted preregistered replications of 28 classic and contemporary published findings, with protocols that were peer reviewed in advance, to examine variation in effect magnitudes across samples and settings, and found that very little heterogeneity was attributable to the order in which the tasks were performed or whether the task were administered in lab versus online.

...read moreread less

Abstract: We conducted preregistered replications of 28 classic and contemporary published findings, with protocols that were peer reviewed in advance, to examine variation in effect magnitudes across samples and settings. Each protocol was administered to approximately half of 125 samples that comprised 15,305 participants from 36 countries and territories. Using the conventional criterion of statistical significance (p < .05), we found that 15 (54%) of the replications provided evidence of a statistically significant effect in the same direction as the original finding. With a strict significance criterion (p < .0001), 14 (50%) of the replications still provided such evidence, a reflection of the extremely high-powered design. Seven (25%) of the replications yielded effect sizes larger than the original ones, and 21 (75%) yielded effect sizes smaller than the original ones. The median comparable Cohen’s ds were 0.60 for the original findings and 0.15 for the replications. The effect sizes were small (< 0.20) in 16 of the replications (57%), and 9 effects (32%) were in the direction opposite the direction of the original effect. Across settings, the Q statistic indicated significant heterogeneity in 11 (39%) of the replication effects, and most of those were among the findings with the largest overall effect sizes; only 1 effect that was near zero in the aggregate showed significant heterogeneity according to this measure. Only 1 effect had a tau value greater than .20, an indication of moderate heterogeneity. Eight others had tau values near or slightly above .10, an indication of slight heterogeneity. Moderation tests indicated that very little heterogeneity was attributable to the order in which the tasks were performed or whether the tasks were administered in lab versus online. Exploratory comparisons revealed little heterogeneity between Western, educated, industrialized, rich, and democratic (WEIRD) cultures and less WEIRD cultures (i.e., cultures with relatively high and low WEIRDness scores, respectively). Cumulatively, variability in the observed effect sizes was attributable more to the effect being studied than to the sample or setting in which it was studied.

...read moreread less

495 citations

Journal Article•DOI•

Many analysts, one dataset: Making transparent how variations in analytical choices affect results

[...]

Raphael Silberzahn¹, Eric Luis Uhlmann², D. P. Martin³, Pasquale Anselmi⁴, Frederik Aust⁵, Eli Awtrey⁶, Štěpán Bahník⁷, Feng Bai⁸, Colin Bannard⁹, Evelina Bonnier¹⁰, Rickard Carlsson¹¹, Felix Cheung¹², G. Christensen¹³, Russ Clay¹⁴, M. A. Craig¹⁵, A. Dalla Rosa⁴, Lammertjan Dam, Mathew H. Evans¹⁶, I. Flores Cervantes¹⁷, Nathan M. Fong¹⁸, Monica Gamez-Djokic¹⁹, A. Glenz²⁰, S. Gordon-McKeon, Timothy J Heaton²¹, Karin Hederos²², Moritz Heene²³, A. J. Hofelich Mohr²⁴, Fabia Högden⁵, K. Hui²⁵, Magnus Johannesson¹⁰, Jonathan Kalodimos²⁶, Erikson Kaszubowski²⁷, Deanna M. Kennedy²⁸, R. Lei¹⁵, T. A. Lindsay²⁴, Silvia Liverani²⁹, Christopher R. Madan³⁰, Daniel C. Molden¹⁹, Eric Molleman, Richard D. Morey³¹, Laetitia B. Mulder, B. R. Nijstad, Nolan G. Pope³², Bryson R. Pope³³, Jason M. Prenoveau³⁴, Floor Rink, E. Robusto⁴, H. Roderique³⁵, Anna Sandberg²², E. Schlüter³⁶, Felix D. Schönbrodt²³, Martin F. Sherman³⁴, S. A. Sommer³⁷, Kristin Lee Sotak³⁸, Seth M. Spain³⁹, Christoph Spörlein⁴⁰, Tom Stafford²¹, L. Stefanutti⁴, Susanne Täuber, J. Ullrich²⁰, Michelangelo Vianello⁴, Eric-Jan Wagenmakers⁴¹, M. Witkowiak, S. Yoon¹⁸, Brian A. Nosek³, Brian A. Nosek⁴² - Show less +62 more•Institutions (42)

University of Sussex¹, INSEAD², University of Virginia³, University of Padua⁴, University of Cologne⁵, University of Cincinnati⁶, University of Economics, Prague⁷, Hong Kong Polytechnic University⁸, University of Liverpool⁹, Stockholm School of Economics¹⁰, Linnaeus University¹¹, University of Hong Kong¹², University of California, Berkeley¹³, City University of New York¹⁴, New York University¹⁵, University of Manchester¹⁶, Westat¹⁷, Temple University¹⁸, Northwestern University¹⁹, University of Zurich²⁰, University of Sheffield²¹, Stockholm University²², Ludwig Maximilian University of Munich²³, University of Minnesota²⁴, Xiamen University²⁵, Oregon State University²⁶, Universidade Federal de Santa Catarina²⁷, University of Washington²⁸, Queen Mary University of London²⁹, University of Nottingham³⁰, Cardiff University³¹, University of Maryland, College Park³², Brigham Young University³³, Loyola University Maryland³⁴, University of Toronto³⁵, University of Giessen³⁶, United States Military Academy³⁷, State University of New York at Oswego³⁸, Concordia University³⁹, University of Bamberg⁴⁰, University of Amsterdam⁴¹, Center for Open Science⁴²

23 Aug 2018

TL;DR: In this paper, 29 teams involving 61 analysts used the same data set to address the same research question: whether soccer referees are more likely to give red cards to dark-skin-toned players than to light-skinned-players.

...read moreread less

Abstract: Twenty-nine teams involving 61 analysts used the same data set to address the same research question: whether soccer referees are more likely to give red cards to dark-skin-toned players than to light-skin-toned players. Analytic approaches varied widely across the teams, and the estimated effect sizes ranged from 0.89 to 2.93 (Mdn = 1.31) in odds-ratio units. Twenty teams (69%) found a statistically significant positive effect, and 9 teams (31%) did not observe a significant relationship. Overall, the 29 different analyses used 21 unique combinations of covariates. Neither analysts’ prior beliefs about the effect of interest nor their level of expertise readily explained the variation in the outcomes of the analyses. Peer ratings of the quality of the analyses also did not account for the variability. These findings suggest that significant variation in the results of analyses of complex data may be difficult to avoid, even by experts with honest intentions. Crowdsourcing data analysis, a strategy in which numerous research teams are recruited to simultaneously investigate the same research question, makes transparent how defensible, yet subjective, analytic choices influence research results.

...read moreread less

396 citations

Journal Article•DOI•

Registered Replication Report: Schooler and Engstler-Schooler (1990)

[...]

Victoria K. Alogna¹, M. K. Attaya², P. Aucoin³, Štěpán Bahník⁴, S. Birch⁵, Angie R. Birt³, Brian H. Bornstein⁶, Samantha Bouwmeester⁷, Maria A. Brandimonte⁸, Charity Brown⁹, K. Buswell¹⁰, Curt A. Carlson¹¹, Maria A. Carlson¹¹, Simon Chu, Aleksandra Cislak¹², M. Colarusso¹³, Melissa F. Colloff¹⁴, Kimberly S. Dellapaolera⁶, Jean-Francois Delvenne⁹, A. Di Domenico, Aaron Drummond¹⁵, Gerald Echterhoff¹⁶, John E. Edlund¹⁷, Casey Eggleston¹⁸, Beth Fairfield, Gregory Franco¹⁹, Fiona Gabbert²⁰, Bradlee W. Gamblin²¹, Maryanne Garry¹⁹, R. Gentry¹⁰, Elizabeth Gilbert¹⁸, D. L. Greenberg²², Jamin Halberstadt¹, Lauren C. Hall¹⁵, Peter J. B. Hancock²³, D. Hirsch²⁴, Glenys A. Holt²⁵, Joshua Conrad Jackson¹, Jonathan Jong²⁶, Andre Kehn²¹, C. Koch¹⁰, René Kopietz¹⁶, U. Körner²⁷, Melina A. Kunar¹⁴, Calvin K. Lai¹⁸, Stephen R. H. Langton²³, Fábio Pitombo Leite²⁸, Nicola Mammarella, John E. Marsh²⁹, K. A. McConnaughy², S. McCoy³⁰, Alex H. McIntyre²³, Christian A. Meissner³¹, Robert B. Michael¹⁹, A. A. Mitchell³², M. Mugayar-Baldocchi²², R. Musselman¹³, C. Ng¹, Austin Lee Nichols³³, Narina Nunez³⁴, Matthew A. Palmer²⁵, J. E. Pappagianopoulos², Marilyn S. Petro³², Christopher R. Poirier², Emma Portch⁹, M. Rainsford²⁵, A. Rancourt³⁰, C. Romig²⁴, Eva Rubínová³⁵, Mevagh Sanson¹⁹, Liam Satchell³⁶, James D. Sauer³⁶, Kimberly Schweitzer³⁴, J. Shaheed¹⁰, Faye Collette Skelton²⁹, G. A. Sullivan², Kyle J. Susa³⁷, Jessica K. Swanner³¹, W. B. Thompson³⁸, R. Todaro²⁴, Joanna Ulatowska, Tim Valentine²⁰, Peter P. J. L. Verkoeijen⁷, Marek A. Vranka³⁹, Kimberley A. Wade¹⁴, Christopher A. Was²⁴, Dawn R. Weatherford⁴⁰, K. Wiseman³⁴, Tara Zaksaite⁹, Daniel V. Zuj²⁵, Rolf A. Zwaan⁷ - Show less +87 more•Institutions (40)

University of Otago¹, Stonehill College², Mount Saint Vincent University³, University of Würzburg⁴, State University of New York at Brockport⁵, University of Nebraska–Lincoln⁶, Erasmus University Rotterdam⁷, Università degli Studi Suor Orsola Benincasa⁸, University of Leeds⁹, George Fox University¹⁰, Texas A&M University–Commerce¹¹, University of Social Sciences and Humanities¹², Lehigh Carbon Community College¹³, University of Warwick¹⁴, Flinders University¹⁵, University of Münster¹⁶, Rochester Institute of Technology¹⁷, University of Virginia¹⁸, Victoria University of Wellington¹⁹, Goldsmiths, University of London²⁰, University of North Dakota²¹, College of Charleston²², University of Stirling²³, Kent State University²⁴, University of Tasmania²⁵, University of Oxford²⁶, University of Düsseldorf²⁷, Ohio State University²⁸, University of Central Lancashire²⁹, University of Maine³⁰, Iowa State University³¹, Nebraska Wesleyan University³², University of Navarra³³, University of Wyoming³⁴, Masaryk University³⁵, University of Portsmouth³⁶, University of Texas at El Paso³⁷, Niagara University³⁸, Charles University in Prague³⁹, Arkansas State University⁴⁰

17 Sep 2014-Perspectives on Psychological Science

TL;DR: This article found that participants who described the robber were 25% worse at identifying the robber in a lineup than were participants who instead listed U.S. states and capitals, which has been termed the verbal overshadowing effect.

...read moreread less

Abstract: Trying to remember something now typically improves your ability to remember it later. However, after watching a video of a simulated bank robbery, participants who verbally described the robber were 25% worse at identifying the robber in a lineup than were participants who instead listed U.S. states and capitals—this has been termed the “verbal overshadowing” effect (Schooler & Engstler-Schooler, 1990). More recent studies suggested that this effect might be substantially smaller than first reported. Given uncertainty about the effect size, the influence of this finding in the memory literature, and its practical importance for police procedures, we conducted two collections of preregistered direct replications (RRR1 and RRR2) that differed only in the order of the description task and a filler task. In RRR1, when the description task immediately followed the robbery, participants who provided a description were 4% less likely to select the robber than were those in the control condition. In RRR2, when the description was delayed by 20 min, they were 16% less likely to select the robber. These findings reveal a robust verbal overshadowing effect that is strongly influenced by the relative timing of the tasks. The discussion considers further implications of these replications for our understanding of verbal overshadowing.

...read moreread less

180 citations

1
2
3
4
…
5
6
7
8
9
10

Collapse

Cited by

PDF

Open Access

More filters

Journal Article•DOI•

Estimating the reproducibility of psychological science

[...]

Alexander A. Aarts, Joanna E. Anderson¹, Christopher J. Anderson², Peter Raymond Attridge³ +287 more•Institutions (116)

28 Aug 2015-Science

...read moreread less

5,532 citations

Journal Article•DOI•

1,500 scientists lift the lid on reproducibility

[...]

Monya Baker

26 May 2016-Nature

2,609 citations

Journal Article•DOI•

A manifesto for reproducible science

[...]

Marcus R. Munafò¹, Brian A. Nosek², Brian A. Nosek³, Dorothy V. M. Bishop⁴, Katherine S. Button, Christopher D. Chambers⁵, Nathalie Percie du Sert, Uri Simonsohn⁶, Eric-Jan Wagenmakers⁷, Jennifer J. Ware, John P. A. Ioannidis⁸ - Show less +7 more•Institutions (8)

University of Bristol¹, Center for Open Science², University of Virginia³, University of Oxford⁴, Cardiff University⁵, University of Pennsylvania⁶, Amsterdam University College⁷, Stanford University⁸

10 Jan 2017-Nature Human Behaviour

TL;DR: This work argues for the adoption of measures to optimize key elements of the scientific process: methods, reporting and dissemination, reproducibility, evaluation and incentives, in the hope that this will facilitate action toward improving the transparency, reproducible and efficiency of scientific research.

...read moreread less

Abstract: Improving the reliability and efficiency of scientific research will increase the credibility of the published scientific literature and accelerate discovery. Here we argue for the adoption of measures to optimize key elements of the scientific process: methods, reporting and dissemination, reproducibility, evaluation and incentives. There is some evidence from both simulations and empirical studies supporting the likely effectiveness of these measures, but their broad adoption by researchers, institutions, funders and journals will require iterative evaluation and improvement. We discuss the goals of these measures, and how they can be implemented, in the hope that this will facilitate action toward improving the transparency, reproducibility and efficiency of scientific research.

...read moreread less

1,951 citations

Journal Article•DOI•

Inside the Turk Understanding Mechanical Turk as a Participant Pool

[...]

Gabriele Paolacci¹, Jesse Chandler²•Institutions (2)

Erasmus University Rotterdam¹, University of Michigan²

03 Jun 2014-Current Directions in Psychological Science

TL;DR: The characteristics of Mechanical Turk as a participant pool for psychology and other social sciences, highlighting the traits of the MTurk samples, why people become Mechanical Turk workers and research participants, and how data quality on Mechanical Turk compares to that from other pools and depends on controllable and uncontrollable factors as mentioned in this paper.

...read moreread less

Abstract: Mechanical Turk (MTurk), an online labor market created by Amazon, has recently become popular among social scientists as a source of survey and experimental data. The workers who populate this market have been assessed on dimensions that are universally relevant to understanding whether, why, and when they should be recruited as research participants. We discuss the characteristics of MTurk as a participant pool for psychology and other social sciences, highlighting the traits of the MTurk samples, why people become MTurk workers and research participants, and how data quality on MTurk compares to that from other pools and depends on controllable and uncontrollable factors.

...read moreread less

1,926 citations

Journal Article•DOI•

Redefine statistical significance

[...]

Daniel J. Benjamin¹, James O. Berger², Magnus Johannesson³, Magnus Johannesson¹, Brian A. Nosek⁴, Brian A. Nosek⁵, Eric-Jan Wagenmakers⁶, Richard A. Berk⁷, Kenneth A. Bollen⁸, Björn Brembs⁹, Lawrence D. Brown⁷, Colin F. Camerer¹⁰, David Cesarini¹¹, David Cesarini¹², Christopher D. Chambers¹³, Merlise A. Clyde², Thomas D. Cook¹⁴, Thomas D. Cook¹⁵, Paul De Boeck¹⁶, Zoltan Dienes¹⁷, Anna Dreber³, Kenny Easwaran¹⁸, Charles Efferson¹⁹, Ernst Fehr²⁰, Fiona Fidler²¹, Andy P. Field¹⁷, Malcolm R. Forster²², Edward I. George⁷, Richard Gonzalez²³, Steven N. Goodman²⁴, Edwin J. Green²⁵, Donald P. Green²⁶, Anthony G. Greenwald²⁷, Jarrod D. Hadfield²⁸, Larry V. Hedges¹⁵, Leonhard Held²⁰, Teck-Hua Ho²⁹, Herbert Hoijtink³⁰, Daniel J. Hruschka³¹, Kosuke Imai³², Guido W. Imbens²⁴, John P. A. Ioannidis²⁴, Minjeong Jeon³³, James Holland Jones³⁴, Michael Kirchler³⁵, David Laibson³⁶, John A. List³⁷, Roderick J. A. Little²³, Arthur Lupia²³, Edouard Machery³⁸, Scott E. Maxwell³⁹, Michael A. McCarthy²¹, Don A. Moore⁴⁰, Stephen L. Morgan⁴¹, Marcus R. Munafò⁴², Shinichi Nakagawa⁴³, Brendan Nyhan⁴⁴, Timothy H. Parker⁴⁵, Luis R. Pericchi⁴⁶, Marco Perugini⁴⁷, Jeffrey N. Rouder⁴⁸, Judith Rousseau⁴⁹, Victoria Savalei⁵⁰, Felix D. Schönbrodt⁵¹, Thomas Sellke⁵², Betsy Sinclair⁵³, Dustin Tingley³⁶, Trisha Van Zandt¹⁶, Simine Vazire⁵⁴, Duncan J. Watts⁵⁵, Christopher Winship³⁶, Robert L. Wolpert², Yu Xie³², Cristobal Young²⁴, Jonathan Zinman⁴⁴, Valen E. Johnson¹⁸, Valen E. Johnson¹ - Show less +73 more•Institutions (55)

University of Southern California¹, Duke University², Stockholm School of Economics³, University of Virginia⁴, Center for Open Science⁵, University of Amsterdam⁶, University of Pennsylvania⁷, University of North Carolina at Chapel Hill⁸, University of Regensburg⁹, California Institute of Technology¹⁰, New York University¹¹, Research Institute of Industrial Economics¹², Cardiff University¹³, Mathematica Policy Research¹⁴, Northwestern University¹⁵, Ohio State University¹⁶, University of Sussex¹⁷, Texas A&M University¹⁸, Royal Holloway, University of London¹⁹, University of Zurich²⁰, University of Melbourne²¹, University of Wisconsin-Madison²², University of Michigan²³, Stanford University²⁴, Rutgers University²⁵, Columbia University²⁶, University of Washington²⁷, University of Edinburgh²⁸, National University of Singapore²⁹, Utrecht University³⁰, Arizona State University³¹, Princeton University³², University of California, Los Angeles³³, Imperial College London³⁴, University of Innsbruck³⁵, Harvard University³⁶, University of Chicago³⁷, University of Pittsburgh³⁸, University of Notre Dame³⁹, University of California, Berkeley⁴⁰, Johns Hopkins University⁴¹, University of Bristol⁴², University of New South Wales⁴³, Dartmouth College⁴⁴, Whitman College⁴⁵, University of Puerto Rico⁴⁶, University of Milan⁴⁷, University of California, Irvine⁴⁸, Paris Dauphine University⁴⁹, University of British Columbia⁵⁰, Ludwig Maximilian University of Munich⁵¹, Purdue University⁵², Washington University in St. Louis⁵³, University of California, Davis⁵⁴, Microsoft⁵⁵

01 Jan 2018-Nature Human Behaviour

TL;DR: The default P-value threshold for statistical significance is proposed to be changed from 0.05 to 0.005 for claims of new discoveries in order to reduce uncertainty in the number of discoveries.

...read moreread less

Abstract: We propose to change the default P-value threshold for statistical significance from 0.05 to 0.005 for claims of new discoveries.

...read moreread less

1,586 citations

1
2
3
4
…
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200

Collapse