scispace - formally typeset
Search or ask a question
Topic

Statistical hypothesis testing

About: Statistical hypothesis testing is a research topic. Over the lifetime, 19580 publications have been published within this topic receiving 1037815 citations. The topic is also known as: statistical hypothesis testing & confirmatory data analysis.


Papers
More filters
Book
10 Mar 1995
TL;DR: In this article, the authors present a matrix algebra review of the linear regression model with a special distribution distribution of functions of random variables sampling theory estimation hypothesis testing prediction and linear regression models.
Abstract: Basic concepts special distributions distributions of functions of random variables sampling theory estimation hypothesis testing prediction the linear regression model other windows on the world. Appendices: matrix algebra review I matrix algebra review II computation statistical tables.

245 citations

Journal ArticleDOI
TL;DR: The statistically principled framework for testing the power-law hypothesis is adapted, a hypothesis test based on the Kolmogorov--Smirnov goodness-of-fit statistic and likelihood ratio tests for comparing against alternative explanations are applied, and the effectiveness of these methods on synthetic binned data with known structure is evaluated.
Abstract: Many man-made and natural phenomena, including the intensity of earthquakes, population of cities and size of international wars, are believed to follow power-law distributions. The accurate identification of power-law patterns has significant consequences for correctly understanding and modeling complex systems. However, statistical evidence for or against the power-law hypothesis is complicated by large fluctuations in the empirical distribution's tail, and these are worsened when information is lost from binning the data. We adapt the statistically principled framework for testing the power-law hypothesis, developed by Clauset, Shalizi and Newman, to the case of binned data. This approach includes maximum-likelihood fitting, a hypothesis test based on the Kolmogorov--Smirnov goodness-of-fit statistic and likelihood ratio tests for comparing against alternative explanations. We evaluate the effectiveness of these methods on synthetic binned data with known structure, quantify the loss of statistical power due to binning, and apply the methods to twelve real-world binned data sets with heavy-tailed patterns.

244 citations

Journal ArticleDOI
TL;DR: The Efron model for accidental bias is used to assess the potential for bias in the estimation of treatment effects due to covariate imbalances and the Blackwell-Hodges model is modified to allow for selection bias only when the investigator is able to discern the next assignment with certainty.

244 citations

Journal ArticleDOI
TL;DR: This tutorial provides an introduction to the hierarchical linear models technique in general terms, and then specifies model notation and assumptions in detail, and elaborate on model interpretation and provide guidelines for model checking.
Abstract: Hierarchical linear models are useful for understanding relationships in hierarchical data structures, such as patients within hospitals or physicians within hospitals. In this tutorial we provide an introduction to the technique in general terms, and then specify model notation and assumptions in detail. We describe estimation techniques and hypothesis testing procedures for the three types of parameters involved in hierarchical linear models: fixed effects, covariance components, and random effects. We illustrate the application using an example from the Type II Diabetes Patient Outcomes Research Team (PORT) study and use two popular PC-based statistical computing packages, HLM/2L and SAS Proc Mixed, to perform two-level hierarchical analysis. We compare output from the two packages applied to our example data as well as to simulated data. We elaborate on model interpretation and provide guidelines for model checking.

244 citations

Posted Content
TL;DR: Abel et al. as discussed by the authors show that a theory which does not generate a complete specification of the regression test is nonetheless to have testable implications, these implications must be robust over the permissible alternative specifications.
Abstract: In most natural sciences (physics, chemistry, biology) theories are validated by controlled experiment. However, in other natural sciences (astronomy, meteorology), and in most social sciences, including economics, the data are characteristically generated not by experiment but by measurement of uncontrolled systems. In economics, theories take the form of restrictions on the models assumed to generate the data, and statistical methods replace experimental controls in testing these restrictions. And here is the difficulty: in economics, particularly macroeconomics, the theory used to derive tests ordinarily does not generate a complete specification of which variables are to be held constant when statistical tests are performed on the relation between the dependent variable and the independent variables of primary interest. Accordingly, in such cases there will be a set of often very different candidate regression-based tests, each of which has equal status with the others since each is based on a different projection of the same underlying multivariate model. Except in the unlikely event that the explanatory variables are mutually orthogonal, the conditional regression coefficients, which generally form the basis for the test statistic, will depend on the conditioning set. We conclude from this that, if a theory which does not generate a complete specification of the regression test is nonetheless to have testable implications, these implications must be robust over the permissible alternative specifications. If the restrictions indicated by the theory are satisfied in some projections, but not in others that have an equal claim to represent implications of the theory, one cannot conclude that the theory has been confirmed. The fact that the observable implications of valid theories must obtain over a broad (but usually incompletely specified) set of regressions rather than for a single regression introduces a large and unavoidable element of imprecision into hypothesis testing in macroeconomics. Generally it appears to be appropriate to weaken the statistical criterion for rejecting theories. Consider, for example, the theory of money demand, which will engage our attention in this paper. The Tobin-Baumol square root formula implies that the elasticity of money demand with respect to the interest rate is exactly one-half. But which interest rate? Should wealth be held constant? Inflation? In view of such uncertainties it would be inappropriate to insist in a literal-minded fashion on rejecting the Tobin-Baumol model if in some regression the measured interest elasticity differed from one-half by more than two standard deviations, and only then. Obviously a more flexible approach is called for. The practice has been to conclude that the statistical evidence is consistent with the Tobin-Baumol model as long as the interest rate coefficient is negative. If it is negative and significant, or negative and insignificantly different from minus one-half, that would provide somewhat stronger confirmation. But a positive coefficient, particularly a significantly positive coefficient, would be viewed as raising questions about the validity of the theory. In macroeconomics generally, as in the money demand application, the typical response to specification uncertainty has been to regard a theory as supported if the signs of the estimated coefficients agree with those expected from theory, and as disconfirmed otherwise. There is no theoretical justification for this procedure, but it seems to be a reasonable course to follow. The point that economic theory ordinarily generates incompletely specified statistical *University of California, Santa Barbara. We have received helpful comments from Andrew Abel, Robert Clower, Michael Darby, Robert Engle, Stephen Goldfeld, David Laidler, Edward Leamer, Robert Lucas, Frederic Mishkin, and Edward Prescott. Thomas Hall provided able research assistance.

243 citations


Network Information
Related Topics (5)
Estimator
97.3K papers, 2.6M citations
88% related
Linear model
19K papers, 1M citations
88% related
Inference
36.8K papers, 1.3M citations
87% related
Regression analysis
31K papers, 1.7M citations
86% related
Sampling (statistics)
65.3K papers, 1.2M citations
83% related
Performance
Metrics
No. of papers in the topic in previous years
YearPapers
2023267
2022696
2021959
2020998
20191,033
2018943