scispace - formally typeset
Open AccessJournal ArticleDOI

The Detection of Influential Subsets in Linear Regression by Using an Influence Matrix

TLDR
In this article, a new method to identify influential subsets in linear regression problems is presented, which uses the eigenstructure of an influence matrix, defined as the matrix of uncentred covariances of the effect on the whole data set of deleting each observation, normalized to include the univariate Cook statistics on the diagonal.
Abstract
This paper presents a new method to identify influential subsets in linear regression problems. The procedure uses the eigenstructure of an influence matrix which is defined as the matrix of uncentred covariances of the effect on the whole data set of deleting each observation, normalized to include the univariate Cook statistics on the diagonal. It is shown that the eigenstructure of the influence matrix is useful to identify influential subsets and a procedure for detecting influential sets is proposed. The method is illustrated with two examples

read more

Content maybe subject to copyright    Report

Citations
More filters
Journal ArticleDOI

Outlier Detection Using Nonconvex Penalized Regression

TL;DR: In this paper, a thresholding based iterative procedure for outlier detection (Θ-IPOD) was proposed to identify outliers and estimate regression coefficients, which is based on hard thresholding and soft thresholding.
Posted Content

Outlier Detection Using Nonconvex Penalized Regression

TL;DR: A thresholding based iterative procedure for outlier detection (Θ–IPOD) based on hard thresholding correctly identifies outliers on some hard test problems and is much faster than iteratively reweighted least squares for large data, because each iteration costs at most O(np) (and sometimes much less), avoiding an O( np2) least squares estimate.
Journal ArticleDOI

Outliers in multilevel data

TL;DR: This article developed several techniques for data exploration for outliers and outlier analysis and then applied these to the detailed analysis of outliers in two large scale multilevel data sets from educational contexts.
Journal ArticleDOI

Self-organizing maps for outlier detection

TL;DR: This paper addresses the problem of multivariate outlier detection using the (unsupervised) self-organizing map (SOM) algorithm introduced by Kohonen with empirical results reported on both artificial and real data.
Journal ArticleDOI

A General Method for Local Sensitivity Analysis With Application to Regression Models and Other Optimization Problems

TL;DR: A method for sensitivity analysis of general applicability based on the well-known duality property of mathematical programming, which states that the partial derivatives of the primal objective function with respect to the constraints on the right side parameters are the negative of the optimal values of the dual problem variables.
References
More filters
Book

Robust Regression and Outlier Detection

TL;DR: This paper presents the results of a two-year study of the statistical treatment of outliers in the context of one-Dimensional Location and its applications to discrete-time reinforcement learning.
Journal ArticleDOI

Unmasking Multivariate Outliers and Leverage Points

TL;DR: This work proposes to compute distances based on very robust estimates of location and covariance, better suited to expose the outliers in a multivariate point cloud, to avoid the masking effect.
Frequently Asked Questions (1)
Q1. What have the authors contributed in "The detection of influential subsets" ?

In this paper, a new method to identify infiuential subsets in linear regression problems is presented, which uses the eigenstructure of an infiuence matrix which is defined as the matrix defined by uncentered covariance.