Research Commentary---Too Big to Fail: Large Samples and the p-Value Problem

doi:10.1287/ISRE.2013.0480

Journal ArticleDOI

Research Commentary---Too Big to Fail: Large Samples and the p-Value Problem

Mingfeng Lin, +2 more

- 01 Dec 2013 -

Information Systems Research

- Vol. 24, Iss: 4, pp 906-917

TLDR

This research commentary recommends a series of actions the researcher can take to mitigate the p-value problem in large samples and illustrates them with an example of over 300,000 camera sales on eBay.

Abstract:

The Internet has provided IS researchers with the opportunity to conduct studies with extremely large samples, frequently well over 10,000 observations. There are many advantages to large samples, but researchers using statistical inference must be aware of the p-value problem associated with them. In very large samples, p-values go quickly to zero, and solely relying on p-values can lead the researcher to claim support for results of no practical significance. In a survey of large sample IS research, we found that a significant number of papers rely on a low p-value and the sign of a regression coefficient alone to support their hypotheses. This research commentary recommends a series of actions the researcher can take to mitigate the p-value problem in large samples and illustrates them with an example of over 300,000 camera sales on eBay. We believe that addressing the p-value problem will increase the credibility of large sample IS research as well as provide more insights for readers.

Citations

PDF

Open Access

More filters

Journal ArticleDOI

Early or Late? Entry timing in online IT service markets and the moderating effects of market characteristics

Xinlin Yao, +3 more

- 01 Jun 2020 -

Journal of Business Research

TL;DR: In this paper, the authors empirically examined entry timing effects in the online IT service market and the moderating effects of market characteristics, and found that both early and late entry resulted in superior performance to that of intermediate entry.

...read moreread less

Journal ArticleDOI

Statistical Issues in Small and Large Sample: Need of Optimum Upper Bound for the Sample Size

Subramanian Chandrasekharan, +2 more

TL;DR: In this paper, the authors investigated the pattern of changes in the estimates and testing results for varying sample sizes and concluded that larger sample does not make differences on the values of descriptive statistics, but has significant impact on the value of inferential statistics and therefore an upper bound for the sample size needs to be fixed.

...read moreread less

Journal ArticleDOI

Bias Testing of the Public Safety Assessment: Error Rate Balance Between Whites and Blacks for New Arrests:

Matthew DeMichele, +1 more

- 01 Nov 2021 -

Crime & Delinquency

TL;DR: This article evaluated predictive bias by race when using a pretrial risk assessment, using data from Kentucky from July 2013 through December 2014 (n = 1,164,597) to evaluate differences ac...

...read moreread less

Journal ArticleDOI

Political freedom, education, and value liberalization and deliberalization: A cross-national analysis of the world values survey, 1981-2014

Tony Huiquan Zhang

- 26 Feb 2020 -

Social Science Journal

TL;DR: This paper argued that value liberalization has been a worldwide trend, but more recently, there was a resurgence of conservativism, and modernization and cultural theories have difficulty explaining it.

...read moreread less

Journal ArticleDOI

Investigating the indoor environmental quality of different workplaces through web-scraping and text-mining of Glassdoor reviews

Giorgia Chinazzo

- 09 Apr 2021 -

Building Research and Information

TL;DR: The analysis of occupants’ perception can improve building indoor environmental quality (IEQ) and the need for IEQ investigations beyond office buildings is highlighted, as well as the potential detrimental effect that uncomfortable IEQ conditions can have on job satisfaction.

...read moreread less

Collapse

References

PDF

Open Access

More filters

Journal ArticleDOI

Things I Have Learned (So Far).

Jacob Cohen

- 01 Dec 1990 -

American Psychologist

TL;DR: The application of statistics to psychology and the other sociobiomedical sciences has been studied extensively as discussed by the authors, including the principles "less is more" (fewer variables, more highly targeted issues, sharp rounding off), "simple is better" (graphic representation, unit weighting for linear composites), and "some things you learn aren't so."

...read moreread less

Journal ArticleDOI

To Explain or to Predict

Galit Shmueli

- 01 Aug 2010 -

Statistical Science

TL;DR: The distinction between explanatory and predictive models is discussed in this paper, and the practical implications of the distinction to each step in the model- ing process are discussed as well as a discussion of the differences that arise in the process of modeling for an explanatory ver- sus a predictive goal.

...read moreread less

Journal ArticleDOI

Examining the Relationship Between Reviews and Sales: The Role of Reviewer Identity Disclosure in Electronic Markets

Chris Forman, +2 more

- 26 Jul 2006 -

Social Science Research Network

TL;DR: It is suggested that identity-relevant information about reviewers shapes community members' judgment of products and reviews and shows that shared geographical location increases the relationship between disclosure and product sales, thus highlighting the important role of geography in electronic commerce.

...read moreread less

Journal ArticleDOI

Examining the Relationship Between Reviews and Sales: The Role of Reviewer Identity Disclosure in Electronic Markets

Chris Forman, +2 more

- 01 Sep 2008 -

Information Systems Research

TL;DR: In this paper, the authors used a unique data set based on both chronologically compiled ratings as well as reviewer characteristics for a given set of products and geographical location-based purchasing behavior from Amazon, and provided evidence that community norms are an antecedent to reviewer disclosure of identity-descriptive information.

...read moreread less

Book

Regression Methods in Biostatistics: Linear, Logistic, Survival, and Repeated Measures Models

Eric Vittinghoff, +3 more

TL;DR: McCoch as discussed by the authors provides a unified, in-depth, readable introduction to the multipredictor regression methods most widely used in biostatistics: linear models for continuous outcomes, logistic models for binary outcomes, the Cox model for right-censored survival times, repeated-measures models for longitudinal and hierarchical outcomes, and generalized linear model for counts and other outcomes.

...read moreread less

Collapse

Research Commentary---Too Big to Fail: Large Samples and the p-Value Problem

Citations

Early or Late? Entry timing in online IT service markets and the moderating effects of market characteristics

Statistical Issues in Small and Large Sample: Need of Optimum Upper Bound for the Sample Size

Bias Testing of the Public Safety Assessment: Error Rate Balance Between Whites and Blacks for New Arrests:

Political freedom, education, and value liberalization and deliberalization: A cross-national analysis of the world values survey, 1981-2014

Investigating the indoor environmental quality of different workplaces through web-scraping and text-mining of Glassdoor reviews

References

Things I Have Learned (So Far).

To Explain or to Predict

Examining the Relationship Between Reviews and Sales: The Role of Reviewer Identity Disclosure in Electronic Markets

Examining the Relationship Between Reviews and Sales: The Role of Reviewer Identity Disclosure in Electronic Markets

Regression Methods in Biostatistics: Linear, Logistic, Survival, and Repeated Measures Models

Related Papers (5)

The ASA's Statement on p-Values: Context, Process, and Purpose

Statistical Power Analysis for the Behavioral Sciences

R: A language and environment for statistical computing.

A power primer.

Sample Selection Bias as a Specification Error

Trending Questions (1)