What is the advantage of the technique used?

An advantage of the technique used (discriminant analysis) is that individual bands are selected by the method, showing that high order noisy MNF bands contain signal that impacts applications such as classification.

What is the simplest way to estimate the noise in MNF coordinates?

In order to study the effects of filtration on SNRs in MNF coordinates, a estimate suggested by Schowengerdt is used:SNR = σ2S σ2N ,where σ2S is the variance of the signal and σ 2 N is the variance of the noise [15].

What is the definition of adaptive filter?

An adaptive filter can alter the size of the filter kernel (spatial domain) or change the frequencies filtered (frequency domain) depending on image characteristics and noise levels.

What is the eigenvalue of a b-band image?

The eigenvalues contained in Λ are the estimated variance of the signal (σS) divided by the estimated variance of the noise (σN ), and therefore the diagonal element λb in Λ is an approximation for the SNR of band b in the transformed image.

When was the AVIRIS 224-band imagery acquired?

Three flight-lines of Airborne Visible/Infrared Imaging Spectrometer (AVIRIS) 224-band imagery were acquired in the winter of 1999.

What are the main reasons why hyperspectral images are noisy?

these images tend to be noisy as a result of the fine discretization and other factors such as the method of acquisition (small aircraft) [1].

(Open Access) An Adaptive Noise-Filtering Algorithm for AVIRIS Data With Implications for Classification Accuracy (2009) | Rhonda D. Phillips

Q: What are the contributions in "An adaptive noise filtering algorithm for aviris data with implications for classification accuracy" ?

This paper describes a new algorithm used to adaptively filter a remote sensing dataset based on signal-to-noise ratios ( SNRs ) once the maximum noise fraction ( MNF ) has been applied. The proposed adaptive filters are applied to a hyperspectral image generated by the AVIRIS sensor, and results are given for the identification of three different pine species located within the study area.

Q: How do you reduce the variance of a small window?

Spatial median filters work by decreasing the variance within a small window (kernel) by assigning a pixel the median value of the surrounding pixels.

An Adaptive Noise Filtering Algorithm for AVIRIS Data

with Implications for Classiﬁcation Accuracy

Rhonda D. Phillips Student Member, IEEE, Christine E. Blinn Member, IEEE,

Layne T. Watson, Fellow, IEEE, and Randolph H. Wynne, Member, IEEE

Abstract—This paper describes a new algorithm used to adaptively ﬁlter a remote sensing dataset based on

signal-to-noise ratios (SNRs) once the maximum noise fraction (MNF) has been applied. This algorithm uses

Hermite splines to calculate the approximate area underneath the SNR curve as a function of band number, and

that area is used to place bands into “bins” with other bands having similar SNRs. A median ﬁlter with a

variable sized kernel is then applied to each band, with the same size kernel used for each band in a particular

bin. The proposed adaptive ﬁlters are applied to a hyperspectral image generated by the AVIRIS sensor, and

results are given for the identiﬁcation of three different pine species located within the study area. The adaptive

ﬁltering scheme improves image quality as shown by estimated SNRs, and classiﬁcation accuracies improved

by more than 10% on the sample study area, indicating that the proposed methods improve the image quality,

thereby aiding in species discrimination.

I. Background

Hyperspectral images provide a powerful tool as the wave spectrum is ﬁnely discretized using hundreds of channels

on a scanner. The large dimensionality of a hyperspectral dataset often requires a data transformation such as principal

components analysis (PCA) or the singular value decomposition (SVD) to reduce the number of variables, or bands,

within an image prior to further processing. Furthermore, these images tend to be noisy as a result of the ﬁne

discretization and other factors such as the method of acquisition (small aircraft) [1]. Green et al. ﬁrst proposed the

maximum noise transform (alternately called the minimum noise transform, minimum noise fraction, or MNF) to align

a dataset in order of decreasing signal-to-noise ratio (SNR) using an eigenvalue decomposition similar to PCA [2]. Lee

et al. equivalently deﬁned the MNF (or noise adjusted PCA) as two PCA transformations, and used the MNF to reduce

the noise level in an image [3]. The MNF can be used to reduce noise and the number of dimensions in an image.

Reduction in noise in imagery is essential to many remote sensing applications, as Landgrebe has documented the

relationship between noise in imagery and classiﬁcation errors [4]. Furthermore, certain applications require a minimum

SNR, for example, estimating foliar biochemical concentrations [5].

Noise can be reduced using a variety of ﬁlters deﬁned on the frequency or spatial domains [6]. While certain

frequency domain ﬁlters (using the Fourier transform) have been shown to be more effective than spatial ﬁlters with

respect to speciﬁc types of noise, a spatial ﬁlter such as a median ﬁlter can produce similar results and requires

signiﬁcantly less computation [7]. An adaptive ﬁlter can alter the size of the ﬁlter kernel (spatial domain) or change

the frequencies ﬁltered (frequency domain) depending on image characteristics and noise levels. Lennon et al. used an

adaptive median ﬁlter on data transformed to MNF coordinates [8], and Pok et al. vary the kernel size between three

and ﬁve depending on the detected noise in a particular window in a three-band image [9]. King et al. present an

adaptive frequency domain ﬁlter used on medical imagery [10].

The properties of the MNF are well suited to an adaptive ﬁlter, yet adaptive spatial ﬁltering is not commonly used

on MNF transformed data, although the idea was proposed by [8]. Typical data processing using the MNF truncates

the data, resulting in loss of signal; uses all bands in the MNF coordinate system without noise removal, or applies a

spatial convolution with a uniform kernel size across all bands despite all bands having drastically different SNRs. This

paper introduces an algorithm for an adaptive median ﬁlter applied to data transformed using the MNF in which the

ﬁlter support size varies with noise. To demonstrate the effectiveness of this technique, a real dataset is ﬁltered using

R.D. Phillips and L.T. Watson are with the Departments of Computer Science and Mathematics, Virginia Polytechnic Institute and State University,

Blacksburg, VA 24061.

C.E. Blinn and R.H. Wynne are with the Department of Forestry, Virginia Polytechnic Institute and State University, Blacksburg, VA 24061.

the algorithm presented in this paper, and the dataset is subsequently used to identify three different species of pines

within a forest, an application for which the high spectral resolution imagery is well-suited.

II. Maximum Noise Fraction

Data transformations such as PCA and SVD transform an image to a new coordinate system without taking factors

such as noise into consideration [2]. PCA uses the eigenvectors (V ) resulting from an eigen decomposition:

Σ = V ΛV

−1

where Σ is the covariance matrix of the image, and Λ is a diagonal matrix containing the eigenvalues corresponding

to V , as a new coordinate basis for the image. This transformed image has the property that each successive band is

aligned along an axis of decreasing overall variance in the original image; that is, as the component number increases,

the variance within the component decreases. When PCA is used for data reduction, ideally these higher order bands

with decreasing variances are not necessary to represent the majority of the original image, and these components

can be removed, resulting in a data reduction. Unfortunately with datasets that are particularly noisy (the case with

hyperspectral data), the ﬁrst few components are not sufﬁcient to represent the image as they capture much of the noise

as well as the signal. The MNF is similar in spirit to PCA with the additional quality that it considers image noise

when selecting a new coordinate system. While the PCA aligns the axes along directions of the maximum variance in

the original image, the MNF aligns the axes along directions of the maximum SNR.

Theoretically, the MNF orders the data along the axes of maximum SNR using the eigen decomposition:

−1

= V ΛV

−1

where Σ

is the covariance matrix of the signal, Σ

is the covariance matrix of the noise, V is an (orthogonal) matrix

containing the eigenvectors of Σ

−1

, and Λ is a diagonal matrix containing eigenvalues that correspond to V . V

provides the basis for the transformed dataset. In practice, Σ

and Σ

are unknown and must be estimated from the

data [2]. Σ

is generally taken to be the covariance matrix of the image, and Σ

can be estimated using various

procedures [2][3]. The eigenvalues contained in Λ are the estimated variance of the signal (σ

) divided by the estimated

variance of the noise (σ

), and therefore the diagonal element λ

in Λ is an approximation for the SNR of band b in

the transformed image.

III. Adaptive Filter

The MNF is commonly used in remote sensing for data reduction and noise removal. The MNF of an image can be

truncated while still preserving most of the information within the image, which is especially useful in the hyperspectral

image processing domain as images contain hundreds of highly correlated bands and noise. The higher order bands that

are truncated commonly contain very low SNRs, and truncating the MNF can have the added effect of eliminating much

of the noise without losing much signal. Determining the precise location to truncate the MNF is problematic, and a

judgement call is often made by looking at a plot of the eigenvalues relative to the band number and determining where

this eigenvalue curve begins to approach an asymptote (λ = 1). In practice, this truncation is performed as a means

of reducing the overall noise within the image, but this method does not fully take advantage of the properties of the

MNF. If the truncation includes too many bands, too much noise is left in the image, and if the truncation includes too

few bands, useful signal may be excluded from the resulting image. A likely scenario would be that truncation includes

noise in the bands that are kept while discarding good signal with the higher order bands that are discarded.

Green et al. suggest that with low SNR bands, all values can be replaced with the mean of the band, and the MNF

image can be retransformed to the original subspace, resulting in a less noisy image [2]. This is an example of a rather

extreme mean ﬁlter. Another approach to reducing the noise in an image is to apply a small (typically a three by three

window) spatial ﬁlter such as a mean or median ﬁlter. However, applying a ﬁlter uniformly to all bands within the MNF

will not take advantage of the speciﬁc ordering of the bands. Bands with lower SNRs might beneﬁt from a ﬁlter with a

larger window, while bands with high SNRs require little or no ﬁltration. Bands with low SNRs have comparatively

low signal relative to noise, yet may have enough signal to warrant smoothing of the noise. A large ﬁlter will degrade

that signal, but will hopefully affect the noise more, resulting in a greater signal relative to noise.

Spatial median ﬁlters work by decreasing the variance within a small window (kernel) by assigning a pixel the

median value of the surrounding pixels. For example, using a 3 × 3 window, a median ﬁlter would assign a pixel the

median value of itself and its eight immediate neighbors (top, left, right, bottom, and four diagonal locations). As

geographic data is highly correlated, the variance of the signal within such a window should be small, and noise should

be random and not correlated within a neighborhood, making a large variance probable. With the assumption that the

2 3

4 5

fHxL

Fig. 1. Typical MNF eigenvalue curve shape.

2 3

4 5

fHxL

2 3

4 5

fHxL

Fig. 2. Dividing area under curve into bins.

variance of the noise is larger than that of the signal in these small windows, a spatial ﬁlter such as a median ﬁlter will

preserve most of the signal while eliminating much of the noise. A ﬁlter with a larger window has a more dramatic

smoothing effect over a ﬁlter with a small window, resulting in a larger SNR at the expense of the signal. A median

ﬁlter has the property of preserving original values unlike a mean ﬁlter.

The MNF is ordered such that for any two bands numbered m and n (assume n > m), SNR

≥ SNR

. Recall

that the eigenvalues associated with the MNF are estimates of SNR, meaning λ

≥ λ

. Based on the reasoning offered

above, the size K of the ﬁlter kernel for band m should not be greater than that for band n, K

≤ K

. Similarly, the

same size ﬁlter should be applied to bands with the most similar eigenvalues.

Consider the shape of the typical MNF eigenvalue curve, shown in Fig. 1. The ﬁrst few bands with the largest

decrease in slope should be grouped together in smaller groups than the last bands with very similar small negative

slopes. In order to divide the bands into bins in this manner, the area underneath the eigenvalue curve can be divided

evenly into a number of bins corresponding to the number of different sized ﬁlters to be applied, as shown in Fig. 2.

The three colors represent three different bins and three different kernel sizes. A formal algorithm for the adaptive ﬁlter

(AF) is given below.

Algorithm AF(M, Λ, nb)

input/output:

M (image transformed to MNF coordinates,

ﬁltered upon exit)

input:

Λ (B eigenvalues where B is number of bands in M )

nb (number of bins to use)

1 begin

2 approximate f (b) = λ with C(b) = λ

3 area :=

C(b)db

4 area

per bin :=

area

5 sare a := 0

6 for i := 1 step 1 until B − 1 do

7 begin

8 sare a := sarea +

i+1

C(b)db

9 bin := ceiling



sarea

area per bin



10 kernel

:= 2 · (bin − 1) + 1

11 end

12 kernel

:= ker nel

B−1

13 for i := 1 step 1 until B do

14 apply kernel

× kernel

median ﬁlter

to band

in image M

15 end

Calculating the area under the curve will require a function to approximate the eigenvalues as a function of band

number, as indicated in line 2 of the above algorithm. A description of Hermite splines, which are recommended given

their suitability to this particular application, is included in Section 3.1. The assignment of bins occurs in line 9, and

warrants further explanation. Starting with the ﬁrst band, the area under the curve is calculated as

C(b)db.

The total area under the curve through band i is therefore

i+1

C(b)db.

The results of previously calculated integrals are stored in sarea to prevent redundant calculations. Taking the ceiling

of the cumulative area under the curve divided by the area per bin results in bands one through B − 1 being placed in

bins one through nb, and band B is placed in the same bin as B − 1. Line 10 continues with the conversion of a bin

number to the size of a spatial ﬁlter kernel that corresponds to bin number. The bands in bin one should have no ﬁlter

(equivalent to a kernel of size one) applied, and the bands in bin two should have a 3 × 3 ﬁlter applied.

This approach is valid for convex eigenvalue curves that are similarly shaped to Fig. 1, which is usually the case.

The properties of the MNF dictate that the eigenvalue function is strictly decreasing, but in the event that the eigenvalue

function is not convex, dividing up the area under the curve of the derivative of the function will group the most similar

eigenvalues and their corresponding bands together. Consider ﬁnding the area under the curve of the derivative:

′

(x)dx = f (b) − f(a)

according to the fundamental theorem of calculus. This calculation requires no approximation of the function as the

actual functions’ values can be used. Because the eigenvalue function is monotonically decreasing, the area underneath

the curve will be negative, and therefore the area will be negated to produce a positive result necessary for bin

determination. The above algorithm is modiﬁed to produce the following adaptive ﬁlter with derivative (AFD) algorithm

using the area underneath the curve of the derivative to determine the location of bins.

Algorithm AFD(M, Λ, nb)

input/output:

M (image transformed to MNF coordinates,

ﬁltered upon exit)

input:

Λ (B eigenvalues where B is number of bands in M )

nb (number of bins to use)

1 begin

2 area := Λ(1) − Λ(B)

3 area pe r bin :=

area

4 sare a := 0

5 for i := 1 step 1 until B − 1 do

6 begin

7 sare a := sarea + Λ(i) − Λ(i + 1)

8 bin := ceiling



sarea

area per bin



9 kernel

:= 2 · (bin − 1) + 1

10 end

11 kernel

:= ker nel

B−1

12 for i := 1 step 1 until B do

13 apply kernel

× kernel

median ﬁlter

to band

in image M

14 end

Finally, either of the above variations on the adaptive ﬁltering algorithm may be used on a particular range of

bands. For example, if prior knowledge or analysis of the MNF transformed dataset indicates that there is no usable

signal beyond a speciﬁc band, the MNF image can still be truncated and ﬁltered adaptively. The value of B would be

changed from the total number of bands in the image to the number of bands desired after truncation. This is different

from simply truncating the MNF because the bands that are kept would be ﬁltered to decrease the noise, and the number

of bands kept could be larger to ensure that very little signal is lost in the truncation.

A. Hermite Splines

The ﬁltering algorithm requires a function that approximates the eigenvalue curve generated by the MNF. Cubic

splines are piecewise cubic polynomials that produce a visually appealing curve and interpolate a given set of points. In

particular, Hermite cubic splines have only one continuous derivative (standard cubic splines have two) and produce a

monotone cubic spline curve interpolating a monotonic function, rendering this type of spline ideal for interpolating the

monotonic SNR curve. The Hermite cubic spline C(x) is composed of 2n basis functions, c

(x), ˆc

(x), i = 1, . . ., n,

where n is the number of interpolation points. The function

C(x) =

i=1

(x) + d

ˆc

(x)

interpolates the points (x

, y

), i = 1, . . . , n if

) = 1, c

) = 0, j 6= i,

ˆc

) = 0, for all i, j.

Furthermore,

′

) = 0, for all i, j,

ˆc

′

) = 1, ˆc

′

) = 0, j 6= i,

making

′

) = d

Only x

, y

, and d

, i = 1, . . ., n are required to deﬁne a Hermite cubic spline, and the d

are chosen to make C(x)

monotone (theoretically always possible for monotone data y

). Refer to [11] for a more detailed description of Hermite

cubic splines including deﬁnitions of the basis functions c

), ˆc

The derivative and the deﬁnite integral of Hermite cubic splines can be easily obtained as the cubic polynomials

(and basis functions) are easily differentiated or integrated analytically. Included in [11] is a set of subroutines designed

to deﬁne, evaluate, and integrate Hermite cubic splines, PCHEZ, PCHEV, and PCHQA, respectively. PCHEZ deﬁnes

continuous derivatives, d

, that result in a visually appealing function, PCHEV evaluates the function and the derivative at

a set of points, and PCHQA returns the deﬁnite integral of the function between two points, a and b.

An Adaptive Noise-Filtering Algorithm for AVIRIS Data With Implications for Classification Accuracy

Figures

Citations

The use of remote sensing in soil and terrain mapping — A review

Recent Advances on Spectral–Spatial Hyperspectral Image Classification: An Overview and New Guidelines

Communication systems: An introduction to signals and noise in electrical communication

Hyperspectral Image Classification Based on Structured Sparse Logistic Regression and Three-Dimensional Wavelet Texture Features

Dimension Reduction Using Spatial and Spectral Regularized Local Discriminant Embedding for Hyperspectral Image Classification

References

A transformation for ordering multispectral data in terms of image quality with implications for noise removal

Remote sensing, models, and methods for image processing

Numerical methods and software

Enhancement of high spectral resolution remote-sensing data by a noise-adjusted principal components transform

Communication systems: an introduction to signals and noise in electrical communication

Related Papers (5)

Spectral and Spatial Classification of Hyperspectral Data Using SVMs and Morphological Profiles

Classification of hyperspectral remote sensing images with support vector machines

A transformation for ordering multispectral data in terms of image quality with implications for noise removal

Composite kernels for hyperspectral image classification

Imaging Spectroscopy and the Airborne Visible/Infrared Imaging Spectrometer (AVIRIS)

Frequently Asked Questions (10)

Q1. What are the contributions in "An adaptive noise filtering algorithm for aviris data with implications for classification accuracy" ?

Q2. What is the important reason for the large dimensionality of a hyperspectral dataset?

Q3. What is the advantage of the technique used?

Q4. What is the simplest way to estimate the noise in MNF coordinates?

Q5. What is the definition of adaptive filter?

Q6. What is the eigenvalue of a b-band image?

Q7. When was the AVIRIS 224-band imagery acquired?

Q8. How do you reduce the variance of a small window?

Q9. What are the main reasons why hyperspectral images are noisy?

Q10. How many bins can be divided into the MNF?