Multivariate Analysis from a Statistical Point of View

Open Access

Multivariate Analysis from a Statistical Point of View

Kyle Cranmer

- pp 211

Chats0

TLDR

In this paper, the Neyman-Pearson theory was translated into the language of statistical learning theory, and a formalism for a learning machine was introduced, which is general enough to encompass all of the techniques used within high energy physics.

Abstract:

Multivariate Analysis is an increasingly common tool in experimental high energy physics; however, many of the common approaches were borrowed from other elds. We clarify what the goal of a multivariate algorithm should be for the search for a new particle and compare dieren t approaches. We also translate the Neyman-Pearson theory into the language of statistical learning theory. Multivariate Analysis is an increasingly common tool in experimental high energy physics; however, most of the common approaches were borrowed from other elds. Each of these algorithms were developed for their own particular task, thus they look quite different at their core. It is not obvious that what these dieren t algorithms do internally is optimal for the the tasks which they perform within high energy physics. It is also quite dicult to compare these dieren t algorithms due to the dierences in the formalisms that were used to derive and/or document them. In Section 2 we introduce a formalism for a Learning Machine, which is general enough to encompass all of the techniques used within high energy physics. In Sections 3 & 4 we review the statistical statements relevant to new particle searches and translate them into the formalism of statistical learning theory. In the remainder of the note, we look at the main results of statistical learning theory and their relevance to some of the common algorithms used within high energy physics.

Multivariate Analysis from a Statistical Point of View

Citations

PhysicsGP: A Genetic Programming approach to event selection

Estimating a Signal In the Presence of an Unknown Background

A Contribution to the Foundations of AI: Genetic Programming and Support Vector Machines

References

The Nature of Statistical Learning Theory

Genetic Programming: On the Programming of Computers by Means of Natural Selection

The Nature of Statistical Learning

Multivariate Density Estimation, Theory, Practice and Visualization

Kendall's advanced theory of statistics

Related Papers (5)

Measurement Analysis: An Introduction to the Statistical Analysis of Laboratory Data in Physics, Chemistry and the Life Sciences

Data Analysis with R

Information criteria and cross validation for Bayesian inference in regular and singular cases

Questioning Multilevel Models

Statistical challenges of high-dimensional data.