what is the c transformation for h'?

assuming that h' is integrable and m is continuously differentiable, one can check out that the expression for this influence curve given above is just a composition of continuously differentiable operators: observe that the c transformation F 1---+ J:oo h'(y )F(y )dy is linear and continuous and, hence, differentiable.

What is the meaning of the term "Estimators"?

It is well-known that, in many cases of practical interest, the estimators can be considered as restrictions of functionals defined on the space :F of distribution functions.

how does the influence function T'(Fj) function function be extended to the vector space?

Fn close enough (in the weak topology) to F ,1 for all 0 < t < - a.s. -n(ii) the influence function T'(Fi') belongs to D(ft) (in particular, it is bounded),(iii) the influence functional T'(Fj') can be extended to the vector space 9 (the linear span of F, defined above) and the transformation (from 9 to D(ft)) : H f--lo T'(H;·) is continuously Hadamard differentiable.

What is the effect of the plug-in estimator?

In particular, as indicated in the introduction, the influence curve is closely related with the asymptotic variance: so, every different estimator of the influence curve provides an estimator for the asymptotic variance.

What is the inverse of the DoobDonsker theorem?

From DoobDonsker's theorem,.jTi(Fn - F) ~ BO(F),weakly in D(ft), where BO is the Brownian bridge on [0,1] considered as a random element [see, e.g. Pollard (1984, p. 97)].

What is the significance of the bootstrap method?

The authors will use bootstrap methodology, that is, the authors will approximate the distribution of Dn under F by that of its bootstrap versionD~ = sup,Jn ISC~(x) - SCn(x) I,:r; under Fn, where SC~(x) denotes the sensitivity curve SCn(x) calculated from the bootstrap sample Xi, ... , X~_l (whose empirical distribution is 'J represented by F:_ The author), which is drawn by resampling from the original data Xl,'" ,Xn - l .

(Open Access) On the estimation of the influence curve (1995) | Antonio Cuevas

Q: What have the authors contributed in "On the estimation of the influence curve" ?

The authors prove the asymptotic validity of bootstrap confidence bands for the influence curve from its usual estimator ( the sensitive curve ).

Q: What are the three estimators used in the literature?

Three estimators have been considered in the literature: the sensitivity curve, the empirical influence curve, and the jackknife approximation [see Hampel et al. (1987), p. 92].

Q: What is the meaning of the term "Influence function"?

An important example is the so-called influence function, T'(F; x) (of a functional T at a distribution F E :F), which is nothing but the partial derivative of T along the direction corresponding to the degenerate distribution hx (for each x), that is, T'(F; x) =lim(.....o+ [T((l- f)F + fhx ) - T(F)]/f [see Hampel (1974), Hampel et al. (1987)].

Q: What is the way to measure the reliability of the estimators?

If the authors assume that the sequence {Tn}of estimators generated by T is consistent, in probability under G (for each G), to T( G) then T'(F; x) represents (for small values of f) the approximate value of asymptotic bias introduced by a contamination of type (1 - f)F + fhx at the distribution F. Some quantitative measures of robustness (gross-error sensitivity!

Q: What is the effect of the plug-in estimator?

In particular, as indicated in the introduction, the influence curve is closely related with the asymptotic variance: so, every different estimator of the influence curve provides an estimator for the asymptotic variance.

Q: What is the main idea of differentiation in statistics?

The works of von Mises (1947) and Kallianpur and Rao (1945) are pioneering contributions on this topic but, in fact, the use of differentiation techniques only became really popular in the late sixties coinciding with the rapid development of the robustness theory.

Q: What is the inverse of the DoobDonsker theorem?

From DoobDonsker's theorem,.jTi(Fn - F) ~ BO(F),weakly in D(ft), where BO is the Brownian bridge on [0,1] considered as a random element [see, e.g. Pollard (1984, p. 97)].

Working Paper

92-35

Divisi6n

Economfa

September

1992

Universidad Carlos

ill

Madrid

Calle Madrid,

126

28903

Getafe

(Spain)

Fax

(341)

624

9849

( .

'..

THE

ESTIMATION

THE

INFLUENCE

CURVE

Antonio

Cuevas

and

Juan

Romo·

Abstract _

prove the asymptotic validity of

boots

trap

confidence

bands

for

the

influence curve

from

its

usual

estimator (the sensitive curve).

The

proof

based

the

use

of Gill's

(1989)

generalized

delta

method

for

Hadamard

differentiable operators. The

scope

and

applicability

this

result are

also

discussed.

Key

words:

Influence curve, sensitivity curve,

boots

trap

confidence

bands,

Hadamard differentiability.

·Cuevas, Departamento

Matematicas,

Universidad

Aut6noma

Madrid,

Spain;

Romo,

Departamento

Estadfstica y

Econometrfa,

Universidad

Carlos

III

Madrid, Spain.

)

INTRODUCTION

AND

BACKGROUND

is well-known

that,

in many cases of practical interest,

the

estimators can

be considered

restrictions of functionals defined on

the

space

of distri-

bution functions. In fact, this idea goes back to the origins of mathematical

statistics since

implicit in the early notion of consistency proposed by

Fisher. In precise terms, let

= T

...

, X

) be (for all n =

1,2,

...

) an

estimator taking values in

defined on random samples X

...

from

a univariate distribution. The sequence

}

is said to

generated by a

functional

T :

iffor

all n and for each sample

Xl,""

have

Tn(X}, .

)

T(F

where F

is the empirical distribution associ-

ated

with

Xl,'

•

Many usual estimators fulfil this condition;

this

the

case, for instance, of

M-

and L-estimators [see,

Huber (1981 )]. By

will represent

the

set of empirical distributions of order n in

:F,

that

is,

the

set of discrete probability measures in

whose atoms have probabilities

equal to

1/n

or to a multiple of 1/n. Obviously, the domain

of T has

include

for all n E N.

In this setting, a natural idea is to use the differentiability properties

the

functional T in order to get statistical results for

the

sequence

The

works of von Mises (1947) and Kallianpur and Rao (1945) are pio-

neering contributions on this topic

but,

in fact,

the

use of differentiation

techniques only became really popular in

the

late sixties coinciding with

the

rapid development of

the

robustness theory. An

important

example is

the

so-called influence function,

T'(F;

x) (of a functional T

a distribution

F E

:F), which is nothing

but

the partial derivative of T along

the

direc-

tion

corresponding to the degenerate distribution h

(for each x),

that

is,

T'(F;

x) =lim(

.....

[T((l-

f)F

) -

T(F)]/f

[see

Hampel (1974), Hampel

al.

(1987)].

assume

that

the sequence

{Tn}of

estimators generated

T is consistent, in probability under G (for each G), to

then

T'(F;

represents (for small values of f) the approximate value of asymptotic bias

introduced by a contamination of type

f)F

+fh

the

distribution

Some quantitative measures of robustness (gross-error sensitivity! local-shift

sensitivity! rejection points)

are also defined from

the

influence curve.

However, in order to get a deeper insight into

the

meaning of

the

influence

function, we need to impose (on

further differentiability assumptions,

stronger

than

the mere existence of

T'(F;

x). This situation is similar to

that

the

classical analysis for functions f :

the

true

significance

the

gradient

f (which

the analog of

the

influence function) arises

when

assume

that

differentiable since, in this case,

f defines

the

best

linear

local

approximant of f.

The

general concept of differentiability, for operators or functionals,

inspired on

the

same idea: let Qand

be normed spaces and let V : Q

---+

be an operator.

will say

that

V is differentiable

G E

with respect

to a collection

S of subsets of

there

exists a linear and continuous map

DV(G;.)

: Q

---+

(which

will call

the

differential

such

that

for

in some neighbourhood of zero,

V(G

+~)

= V(G) +

DV(G;~)

+ R(G +

~),

where

the

remainder R satisfies

lim

R(G +

t~)

t-O

uniformly in

for every

The

most interesting particular cases correspond to the following choices

S: S = all singletons of

S = all compact subsets of

and S = all

bounded subsets of

They lead, respectively, to

the

concepts of Gateaux,

ada

mard (or compact) and Frichet differentiability.

The

application of these concepts (borrowed from

the

functional analysis)

statistical functionals T :

---+

presents an obvious hurdle:

not a

normed space. A simple device to overcome this difficulty

embedding

the

space Q =

P(F

H E

:F,

>..

E 'R}, endowed with

the

supremum

norm.

The statistical functionals can be often extended in a natural way

the

space Q (or appropriate subspaces of it). In such cases the use of

the

above

notions of differentiability

a very useful tool which allows

consider

the

influence function from a different perspective. Moreover, if

the

functional

Frechet (or Hadamard) differentiable

F and

the

differential can be

expressed in

the

form

DT(F;~)

\lI(x

)d~(x),

then

not difficult to prove

[see

Boos and Serfling (1980)]

that

\11

) coin-

cides with the influence curve, and the sequence

}

of estimators generated

)

•...

by T

asymptotically normal with asymptotic variance

T'(F; x)2dF(x).

This is, perhaps, the most important point in connection with

the

influence

curve: under

standard

conditions the asymptotic variance can

expressed

in terms

T'(F; x).

particular, the estimates of

the

influence curve are

potentially useful in

the

estimation of

the

asymptotic variance

[see

Presedo

(1991

)].

The

choice between Hadamard or Frechet differential in each particular

application

usually guided by technical considerations.

general terms,

Frechet differential

more natural and easier to handle. Some applications

can be found in Kallianpur and Rao (1955), Boos and Serfling (1980), Clarke

(1986),

Parr

(1985) and Arcones and Gine (1992). Nevertheless,

the

com-

pact differentiation has, in principle, a broader applicability since

imposes

a weaker (less restrictive) condition;

is, in fact,

the

weakest notion of dif-

ferential which

still manageable in the sense offulfilling

the

chain rule. For

applications, see Fernholz (1983), Esty

al.

(1985) and Gill (1989).

this paper

use Hadamard differential to prove (in Section 2 below)

the

validity of bootstrap confidence bands for

the

standard estimator of

the

influence curve.

The

basic tools used in the proof are

the

results on bootstrap

of empirical processes

[see

Gine and Zinn (1990)] and

the

generalized delta

method

established

Gill (1989). Section 3 contains some final remarks.

BOOTSTRAP

CONFIDENCE

BANDS

FOR

THE

INFLUENCE

CURVE

\Ve

consider

now

the

problem of estimating

the

influence curve

(F; x)

from a random sample

Xl,'

of F. Three estimators have been con-

sidered in the literature:

the

sensitivity curve,

the

empirical influence curve,

and

the

jackknife approximation

[see

Hampel

al.

(1987), p.

92].

The

first

one

perhaps

the

On the estimation of the influence curve

Citations

Applied regression analysis bibliography update 1994-97

Finding confidence limits on population growth rates: Bootstrap and analytic methods

A central limit theorem for M-estimators by the Von Mises method

Influence Measures for CART Classification Trees

Influence function and correspondence analysis

References

Robust statistics: the approach based on influence functions

Convergence of stochastic processes

The Influence Curve and Its Role in Robust Estimation

The Bootstrap and Edgeworth Expansion

On the Asymptotic Distribution of Differentiable Statistical Functions

Related Papers (5)

The Influence Curve and Its Role in Robust Estimation

Ecology and management of a neotropical rainforest : lessons drawn from Paracou, a long-term experimental research site in French Guiana

Asymptotic and Bootstrap Inference for AR( Infinite ) Processes with Conditional Heteroskedasticity

On the bootstrap in cube root asymptotics

The estimating function bootstrap

Frequently Asked Questions (11)

Q1. What have the authors contributed in "On the estimation of the influence curve" ?

Q2. What are the three estimators used in the literature?

Q3. what is the c transformation for h'?

Q4. What is the meaning of the term "Estimators"?

Q5. how does the influence function T'(Fj) function function be extended to the vector space?

Q6. What is the meaning of the term "Influence function"?

Q7. What is the way to measure the reliability of the estimators?

Q8. What is the effect of the plug-in estimator?

Q9. What is the main idea of differentiation in statistics?

Q10. What is the inverse of the DoobDonsker theorem?

Q11. What is the significance of the bootstrap method?