Journal ArticleDOI
Research Commentary---Too Big to Fail: Large Samples and the p-Value Problem
TLDR
This research commentary recommends a series of actions the researcher can take to mitigate the p-value problem in large samples and illustrates them with an example of over 300,000 camera sales on eBay.Abstract:
The Internet has provided IS researchers with the opportunity to conduct studies with extremely large samples, frequently well over 10,000 observations. There are many advantages to large samples, but researchers using statistical inference must be aware of the p-value problem associated with them. In very large samples, p-values go quickly to zero, and solely relying on p-values can lead the researcher to claim support for results of no practical significance. In a survey of large sample IS research, we found that a significant number of papers rely on a low p-value and the sign of a regression coefficient alone to support their hypotheses. This research commentary recommends a series of actions the researcher can take to mitigate the p-value problem in large samples and illustrates them with an example of over 300,000 camera sales on eBay. We believe that addressing the p-value problem will increase the credibility of large sample IS research as well as provide more insights for readers.read more
Citations
More filters
Journal ArticleDOI
Comparison Between Selective Sampling and Random Undersampling for Classification of Customer Defection Using Support Vector Machine
TL;DR: The SS-SVM outperforms RU-S VM in the sense that it is capable to run the process effectively, where the SS-sVM reduces the duration of classification process 3 to 20 h shorter than using RU- SVM, with slightly different accuracy rate.
Journal ArticleDOI
Improving the representativeness of a simple random sample: an optimization model and its application to the Continuous Sample of Working Lives
Vicente A. Núñez Antón,Juan Manuel Pérez-Salamero González,Marta Regúlez Castillo,Carlos Vidal Meliá +3 more
TL;DR: In this article, an optimization model for selecting a large subsample that improves the representativeness of a simple random sample previously obtained from a population larger than the population of interest is presented.
Journal ArticleDOI
Fasting glucose of patients from public health care in the southern region of São Paulo: correlation with glycated hemoglobin and lipid levels.
Luciana Ferreira Franco,Ana Carolina Cintra Nunes Mafra,Mario Maia Bracco,Laércio Joel Franco,Larissa Kozloff Naves,Glória Maria Ferreira Ribeiro,Cristóvão Luis Pitangueira Mangueira +6 more
TL;DR: The high frequency of fasting glucose with abnormal results may reflect the high proportion of exams performed by individuals with diagnosis of diabetes, to evaluate their glycemic control and that this is a population with high cardiovascular risk.
Journal ArticleDOI
Drivers of spatial behaviour of the endangered undulate skate, Raja undulata
Katharina Leeb,Katharina Leeb,David Villegas-Ríos,Gonzalo Mucientes,Manuel E. Garci,Miguel Gilcoto,Alexandre Alonso-Fernández +6 more
Journal ArticleDOI
Expressing uncertainty in Human-Robot interaction.
TL;DR: An empirically verified list of probabilities phrases that HRI researchers can use to complement the numerical values, such as “likely” and “almost certainly not” are provided.
References
More filters
Journal ArticleDOI
Things I Have Learned (So Far).
TL;DR: The application of statistics to psychology and the other sociobiomedical sciences has been studied extensively as discussed by the authors, including the principles "less is more" (fewer variables, more highly targeted issues, sharp rounding off), "simple is better" (graphic representation, unit weighting for linear composites), and "some things you learn aren't so."
Journal ArticleDOI
To Explain or to Predict
TL;DR: The distinction between explanatory and predictive models is discussed in this paper, and the practical implications of the distinction to each step in the model- ing process are discussed as well as a discussion of the differences that arise in the process of modeling for an explanatory ver- sus a predictive goal.
Journal ArticleDOI
Examining the Relationship Between Reviews and Sales: The Role of Reviewer Identity Disclosure in Electronic Markets
TL;DR: It is suggested that identity-relevant information about reviewers shapes community members' judgment of products and reviews and shows that shared geographical location increases the relationship between disclosure and product sales, thus highlighting the important role of geography in electronic commerce.
Journal ArticleDOI
Examining the Relationship Between Reviews and Sales: The Role of Reviewer Identity Disclosure in Electronic Markets
TL;DR: In this paper, the authors used a unique data set based on both chronologically compiled ratings as well as reviewer characteristics for a given set of products and geographical location-based purchasing behavior from Amazon, and provided evidence that community norms are an antecedent to reviewer disclosure of identity-descriptive information.
Book
Regression Methods in Biostatistics: Linear, Logistic, Survival, and Repeated Measures Models
TL;DR: McCoch as discussed by the authors provides a unified, in-depth, readable introduction to the multipredictor regression methods most widely used in biostatistics: linear models for continuous outcomes, logistic models for binary outcomes, the Cox model for right-censored survival times, repeated-measures models for longitudinal and hierarchical outcomes, and generalized linear model for counts and other outcomes.