scispace - formally typeset
Journal ArticleDOI

Research Commentary---Too Big to Fail: Large Samples and the p-Value Problem

TLDR
This research commentary recommends a series of actions the researcher can take to mitigate the p-value problem in large samples and illustrates them with an example of over 300,000 camera sales on eBay.
Abstract
The Internet has provided IS researchers with the opportunity to conduct studies with extremely large samples, frequently well over 10,000 observations. There are many advantages to large samples, but researchers using statistical inference must be aware of the p-value problem associated with them. In very large samples, p-values go quickly to zero, and solely relying on p-values can lead the researcher to claim support for results of no practical significance. In a survey of large sample IS research, we found that a significant number of papers rely on a low p-value and the sign of a regression coefficient alone to support their hypotheses. This research commentary recommends a series of actions the researcher can take to mitigate the p-value problem in large samples and illustrates them with an example of over 300,000 camera sales on eBay. We believe that addressing the p-value problem will increase the credibility of large sample IS research as well as provide more insights for readers.

read more

Citations
More filters
Posted ContentDOI

Confronting p-hacking: Addressing p-value dependence on sample size

TL;DR: A systematic and easy-to-follow protocol that models the p-value as an exponential function to test the existence of real statistical significance and provides a robust assessment of the null hypothesis with accurate values for the minimum data-size needed to reject it is proposed.
Dissertation

The Effect of Physiological Modulators on Resting-state fMRI Functional Connectivity

TL;DR: The results confirm that it is important to account for the effect of physiological noise when examining resting-state fMRI functional connectivity, in the study of healthy brain function, but more importantly, in states of altered brain function.
Journal ArticleDOI

Point and interval forecasts of electricity demand with Reg-SARMA models

TL;DR: In this article, a two-stage approach to forecasting hourly electricity demand by using a linear regression model with serially correlated residuals is presented. But the model is not a white noise series, and the residuals are not a seasonal autoregressive moving average.
Journal ArticleDOI

Open-Source Personality Trait Norms for the United Kingdom and Ireland

TL;DR: In this paper , the authors created open-source norm tables for different age groups (14-17 years, 18-25 years, and 30+ years) within a combined standardization sample from the United Kingdom (UK) and Ireland (N = 18,591).
Journal ArticleDOI

Migrant background and the impact of the COVID-19 pandemic on mental healthcare consultations among children and adolescents in Norway: a nationwide registry study

TL;DR: In this paper , the authors investigated the impact of lockdown and subsequent COVID-19 infection control measures on children's health service use for mental health problems according to migrant background and found that after lockdown, mental health consultation volumes for non-migrant children increased more than for children with migrant background.
References
More filters
Journal ArticleDOI

Things I Have Learned (So Far).

TL;DR: The application of statistics to psychology and the other sociobiomedical sciences has been studied extensively as discussed by the authors, including the principles "less is more" (fewer variables, more highly targeted issues, sharp rounding off), "simple is better" (graphic representation, unit weighting for linear composites), and "some things you learn aren't so."
Journal ArticleDOI

To Explain or to Predict

TL;DR: The distinction between explanatory and predictive models is discussed in this paper, and the practical implications of the distinction to each step in the model- ing process are discussed as well as a discussion of the differences that arise in the process of modeling for an explanatory ver- sus a predictive goal.
Journal ArticleDOI

Examining the Relationship Between Reviews and Sales: The Role of Reviewer Identity Disclosure in Electronic Markets

TL;DR: It is suggested that identity-relevant information about reviewers shapes community members' judgment of products and reviews and shows that shared geographical location increases the relationship between disclosure and product sales, thus highlighting the important role of geography in electronic commerce.
Journal ArticleDOI

Examining the Relationship Between Reviews and Sales: The Role of Reviewer Identity Disclosure in Electronic Markets

TL;DR: In this paper, the authors used a unique data set based on both chronologically compiled ratings as well as reviewer characteristics for a given set of products and geographical location-based purchasing behavior from Amazon, and provided evidence that community norms are an antecedent to reviewer disclosure of identity-descriptive information.
Book

Regression Methods in Biostatistics: Linear, Logistic, Survival, and Repeated Measures Models

TL;DR: McCoch as discussed by the authors provides a unified, in-depth, readable introduction to the multipredictor regression methods most widely used in biostatistics: linear models for continuous outcomes, logistic models for binary outcomes, the Cox model for right-censored survival times, repeated-measures models for longitudinal and hierarchical outcomes, and generalized linear model for counts and other outcomes.
Trending Questions (1)
What are the positives of large samples in research?

Large samples in research provide researchers with more statistical power, increased generalizability of findings, and the ability to detect smaller effect sizes.