How can the authors achieve robustness to local deformations?

While the authors achieve full invariance to radial deformations by full Haar-integration the authors can only reach robustness to local deformations by partial Haar-integration.

What was the training set for the SVM?

From the training set only the “clean” (not agglomerated, not contaminated) pollen and the “non-pollen” particles from a few samples were used to train the support vector machine (SVM) using the RBF-kernel (radial basis function) and the one-vs-rest multi-class approach.

What is the qr to the segmentation border?

For the application on the pollen monitor data set (rotational invariance only around the z-axis), q is split into a radial distance qr to the segmentation border and the z-distance to the central plane qz.

What is the sampling of the parameter space of the kernel functions?

The best sampling of the parameter space of the kernel functions (corresponding to the inner class deformations of the objects), was found by cross validation on the training data set, resulting in Nqr ×Nqz ×Nc×n = 31×11×16×16 = 87296 “structural” features (using kernel function k1) and 8 “shape” features (usingkernel function k2).

How many airborne particles were recorded in the confocal data set?

The “pollen monitor data set” contains about 180,000 airborne particles including about 22,700 pollen grains from air samples that were collected, preparedand recorded with transmitted light microscopy from the online pollen monitor from March to September 2006 in Freiburg and Zürich (fig. 1c).

What is the resulting transformation of the radial deformations?

the group of arbitrary deformations GD and the group of rotations GR the final Haar integral becomes:T = ∫GR∫Gγ∫GDf ( gRgγgDS, gRgγgDX ) p(D) dgD dgγ dgR , (8)where p(D) is the probability for the occurrence of the local displacement field D. The transformation of the data set is described by (gX)(x) =: X(x′), wherex′ = Rx︸︷︷︸ rotation + γ(Rx)︸︷︷︸ global deformation+

What is the simplest way to compute a spherical-harmonics?

For 3D rotations this framework uses a spherical-harmonics series expansion, and for planar rotations around the z-axis it is simplified to a Fourier series expansion.

(Open Access) 3D invariants with high robustness to local deformations for automated pollen recognition (2007) | Olaf Ronneberger

Q: What are the contributions in "3d invariants with high robustness to local deformations for automated pollen recognition" ?

The authors present a new technique for the extraction of features from 3D volumetric data sets based on group integration.

Q: What is the important requirement for a fully automated system?

A few such pollen grains per m3 of air can already cause allergic reactions) the avoidance of false positives is one of the most important requirements for a fully automated system.

Q: What is the first step in processing the pollen monitor data set?

The first step in processing the pollen monitor data set is the detection of circular objects with voxel-wise vector based gray-scale invariants, similar to those in [8].

Q: What is the main purpose of the project?

Within the BMBF-founded project “OMNIBUSS” a first demonstrator of a fully automated online pollen monitor was developed, that integrates the collection, preparation and microscopic analysis of air samples.

Q: How is the HI framework extended to global and local deformations?

This is achieved by creating synthetic channels containing the segmentation borders and employing special parameterized kernel functions.

3D Invariants with High Robustness to Local

Deformations for Automated Pollen Recognition

Olaf Ronneberger, Qing Wang, and Hans Burkhardt

Albert-Ludwigs-Universit¨at Freiburg, Institut f¨ur Informatik, Lehrstuhl f¨ur

Mustererkennung und Bildverarbeitung, Georges-K¨ohler-Allee Geb. 052,

79110 Freiburg, Deutschland

{ronneber,qwang,burkhardt}@informatik.uni-freiburg.de

Abstract. We present a new technique for the extraction of features

from 3D volumetric data sets based on group integration. The features

are invariant to translation, rotation and global radial deformations.

They are robust to local arbitrary deformations and nonlinear gray value

changes, but are still sensitive to ﬁne structures. On a data set of 389 con-

focally scanned pollen from 26 species we get a precision/recall of 99.2%

with a simple 1NN classiﬁer. On volumetric transmitted light data sets of

about 180,000 airborne particles, containing about 22,700 pollen grains

from 33 species, recorded with a low-cost optic in a fully automated

online pollen monitor the mean precision for allergenic pollen is 98.5%

(recall: 86.5%) and for the other pollen 97.5% (recall: 83.4%).

1 Introduction

Nearly all worldwide pollen forecasts are still based on manual counting of pollen

in air samples under the microscope. Within the BMBF-founded project “OM-

NIBUSS” a ﬁrst demonstrator of a fully automated online pollen monitor was

developed, that integrates the collection, preparation and microscopic analysis

of air samples. Due to commercial interests, no details of the developed pattern

recognition algorithms were published within the last three years. This is the

ﬁrst time that we show how this machine works behind the scenes.

Challenges in pollen recognition. Due to the great intra class variability and

only verysubtle inter-class diﬀerences, automated pollen recognition is a very chal-

lenging but still largely unsolved problem. As most pollen grains are nearly spher-

ical and the subtle diﬀerences are mainly found near the surface, a pollen expert

needs the full 3D information (usually by “focussing through” the transparent

pollen grain). An additional diﬃculty is that pollen grains are often agglomerated

and that the air samples containlots of other airborne particles. For a reliable mea-

surement of high allergenic pollen (e.g. Artemisia. A few such pollen grains per m

of air can already cause allergic reactions) the avoidance of false positives is one

of the most important requirements for a fully automated system.

State of the art. Almost all published articles concerning pollen recognition

deal with very low numbers of pollen grains from only a few species and use

F.A. Hamprecht, C. Schn¨orr, and B. J¨ahne (Eds.): DAGM 2007, LNCS 4713, pp. 425–435, 2007.

 Springer-Verlag Berlin Heidelberg 2007

426 O. Ronneberger, Q. Wang, and H. Burkhardt

manually prepared pure pollen samples, e.g. [1]. Only [4] used a data set from

real air samples containing a reasonable number of pollen grains (3686) from

27 species. But even on a reduced data set containing only 8 species and dust

particles, the recall was only 64,9% with a precision of 30%.

Main Contribution. In this paper we describe the extension of the Haar-

integration framework [9,6,7,8] (further denoted as “HI framework”) to global

and local deformations. This is achieved by creating synthetic channels con-

taining the segmentation borders and employing special parameterized kernel

functions. Due to the sparsity of non-zero-values in the synthetic channels the

resulting integral features are highly localized in the real space, while the frame-

work automatically guarantees the desired invariance properties.

For eﬃcient computation of these integrals we make use of the sparsity of

the data in the synthetic channels and use a Fourier or spherical harmonics

(“SH”) series expansion (for the desired rotation invariance) to compute multiple

features at the same time.

a) volume rendering of

confocal data set

b) horizontal and vertical

cuts of confocal data set

c) horizontal and vertical cuts

of transmitted light data set

Fig. 1. 3D recordings of Betula pollen grains. In transmitted light microscopy the

recording properties in z-direction (the direction of the optical axis) are signiﬁcantly

diﬀerent from those in the xy-direction, because the eﬀects of diﬀraction, refraction

and absorption depend on the direction of the transmitted light. Furthermore there

is a signiﬁcant loss of information in z-direction due to the low-pass property of the

optical transfer function.

2MaterialandMethods

Data Sets. To demonstrate the generality of the proposed invariants and com-

pare them to earlier results, we use two diﬀerent pollen data sets in this article.

Both contain 3D volumetric recordings of pollen grains.

The “confocal data set” contains 389 pollen grains from 26 German pollen

taxa, recorded with a confocal laser scanning microscope (ﬁg 1a,b). For further

details on this data set refer to [6].

The “pollen monitor data set” contains about 180,000 airborne particles in-

cluding about 22,700 pollen grains from air samples that were collected, prepared

3D Invariants with High Robustness to Local Deformations 427

and recorded with transmitted light microscopy from the online pollen monitor

from March to September 2006 in Freiburg and Z¨urich (ﬁg. 1c). All 180,000

particles were manually labeled by pollen experts.

Segmentation. To ﬁnd the 3D surface of the pollen grains in the confocal data

set, we use the graph cut algorithm described in [2]. The original data were ﬁrst

scaled down. The edge costs to source and sink were modeled by a Gaussian

distribution relative to the mean and minimum gray value. We added voxel-to-

voxel edges to the 124 neighborhood, where the weight was a Gaussian of the

gray diﬀerences. The resulting binary mask was then smoothly scaled up to the

original size.

The ﬁrst step in processing the pollen monitor data set is the detection of

circular objects with voxel-wise vector based gray-scale invariants, similar to

those in [8]. For each detected circular object the precise border in the sharpest

layer is searched: As parts of the object border are often missing or not clear, we

use snakes to ﬁnd a smooth and complete border. To avoid the common problem

of snakes being attracted to undesired edges (if plain gradient magnitude is used

asforceﬁeld),wetakethestepsdepictedinﬁg2.

a) sharpest layer b) found edges c) weighted edges d) ﬁnal snake

1. Applying modiﬁed Canny edge

detection.

As pollen grains have a nearly

round shape, the edges that are

approximately perpendicular to

the radial direction are more rele-

vant. We replace the gradient with

its radial component in the orig-

inal Canny edge detection algo-

rithm.

2. Model-based weighting of the

edges.

The curvatures and relative loca-

tions of the edges are analyzed

and each edge is given a diﬀerent

weight. Some edges are even elim-

inated. As a result, a much clearer

weighted edge image is obtained.

3. Employing snakes to ﬁnd the

ﬁnal border.

The initial contour is chosen to be

the circle found in the detection

step. The external force ﬁeld is the

so-called “gradient vector ﬂow”

[10] computed from the weighted

edge image

Fig. 2. Segmentation of transmitted light microscopic images

2.1 Construction of Invariants

For the construction of invariants we use the combination of a normalization

and Haar-integration [9,6,7,8](see eq. (1)) over a transformation group con-

taining rotations and deformations (Haar-integration has nothing to do with

Haar wavelets). In contrast to the very general approach in [6], we now use the

428 O. Ronneberger, Q. Wang, and H. Burkhardt

object center and the outer border found in the segmentation step to extract

more distinctive features describing certain regions of the object.

T [f](X):=



f(gX)dg

G : transformation group

g : one element of the transformation group

dg : Haar measure

f : nonlinear kernel function

X : n-dim, multi-channel data set

(1)

Invariance to translations. Invariance to translations is achieved by moving

the center of mass of the segmentation mask to the origin. The ﬁnal features are

quite insensitive to errors in this normalization step, because they are computed

“far” away from this center and only the direction to it is used.

Invariance to rotation. Invariance to rotation around the object center is

achieved by integration over the rotation group. In the confocal data set we can

model a 3D rotation of a real-world object by a 3D rotation of the recorded volu-

metric data set (see ﬁg. 1b). In contrast to this, the transmitted light microscopic

image stacks from the pollen monitor data set show very diﬀerent characteristics

in xy- and z-direction, (see ﬁg. 1c). A rotation around the x- or y-axis of the

real-world object results in so diﬀerent gray value distributions, that it is more

reasonable to model only the rotation around the z-axis, resulting in a planar

rotation invariance.

Invariance to global Deformations and Robustness to local Deforma-

tions. The deformation model consists of two parts. The global deformations

are modeled by a simple shift in radial direction e

, which depends only on the

angular coordinates (see ﬁgure 3a). For full 3D-rotations described in spherical

coordinates x =(x

)thismodelis



= x + γ

γ(x)withγ

γ(x)=γ(x

) · e

) . (2)

For rotations around the z-axis described in cylindrical coordinates x=(x

)

we get



= x + γ

γ(x)withγ

γ(x)=γ(x

) · e

) . (3)

Please note, that this deformation is well deﬁned only for r>−γ(ϕ), which is

no problem in the present application, because the features are computed “far”

away from the center.

The smaller local deformations are described by an arbitrary displacement

ﬁeld D(x) such that



= x + D(x)(4)

(see ﬁg. 3b). For the later partial Haar-integration [3] over all possible realizations

of this displacement ﬁeld, it is suﬃcient to know only the probability for the

occurrence of a certain relative displacement r within this ﬁeld as



D(x + d) − D(x)=r



= p

(r; d) ∀x, d ∈ IR

, (5)

3D Invariants with High Robustness to Local Deformations 429

a) Global deformation model (radial) b) Local deformation model (arbitrary)

Fig. 3. Possible realizations of the deformation models

where we select p

(r; d) to be a rotationally symmetric Gaussian distribution

with a standard deviation σ = d·σ

While we achieve full invariance to radial deformations by full Haar-integration

we can only reach robustness to local deformations by partial Haar-integration.

But this non-invariance in the second case is exactly the desired behavior. In com-

bination with appropriate kernel functions this results in a continuous mapping of

objects (with weak or strong local deformations) into the feature space.

The kernel functions. Instead of selecting a certain ﬁxed number of kernel

functions, we introduce parameterized kernel functions here. Embedded into the

HI framework, each new combination of kernel parameters results in a new in-

variant feature. For multiple kernel parameters, we now have a multidimensional

invariant feature array describing the object.

Robustness to gray value transformations. To become robust to gray value trans-

formations the information is split into gradient direction (which is very robust

even under nonlinear gray value transformations) and gradient magnitude. This

was already successfully applied to the HI framework in [8] and to confocal pollen

data sets in [5].

Synthetic channels with segmentation results. To feed the segmentation informa-

tion into the HI framework we simply render the surface (confocal data set) or

the contour of the sharpest layer (transmitted light data set) as delta-peaks into

a new channel S and extend the kernel-function with two additional points that

sense the gray value in this channel. The only condition for this technique is

that the computation of the synthetic channel and the action of transformation

group can be exchanged without the result being changed (i.e., we must get the

same result if we ﬁrst extract the surface and then rotate and deform the volume

and vice versa).

Resulting kernel function. To achieve the requested properties we construct 4-

point kernels, where 2 points of the kernel a

and a

sense the segmentation

3D invariants with high robustness to local deformations for automated pollen recognition

Figures

Citations

Fast and robust segmentation of spherical particles in volumetric data sets from brightfield microscopy

Fast computation of 3D spherical Fourier harmonic descriptors - a complete orthonormal basis for a rotational invariant representation of three-dimensional objects

3D invariants for automated pollen recognition

A Local Feature Descriptor Based on SIFT for 3D Pollen Image Recognition

Analysis of Relevant Features for Pollen Classification

References

An experimental comparison of min-cut/max- flow algorithms for energy minimization in vision

Snakes, shapes, and gradient vector flow

An Experimental Comparison of Min-cut/Max-flow Algorithms for Energy Minimization in Vision

Automatic recognition of biological particles in microscopic images

Invariant Features for Gray Scale Images

Related Papers (5)

General-purpose object recognition in 3D volume data sets using gray-scale invariants - classification of airborne pollen-grains recorded with a confocal laser scanning microscope

Rotational Invariance Based on Fourier Analysis in Polar and Spherical Coordinates

Feasibility study on automated recognition of allergenic pollen: grass, birch and mugwort

Automatic recognition of biological particles in microscopic images

Elementary Theory of Angular Momentum

Frequently Asked Questions (14)

Q1. What are the contributions in "3d invariants with high robustness to local deformations for automated pollen recognition" ?

Q2. Why is the HI framework used in this paper?

Q3. What is the important requirement for a fully automated system?

Q4. How can the authors achieve robustness to local deformations?

Q5. What is the first step in processing the pollen monitor data set?

Q6. What was the training set for the SVM?

Q7. What is the qr to the segmentation border?

Q8. What is the sampling of the parameter space of the kernel functions?

Q9. What is the main purpose of the project?

Q10. How is the HI framework extended to global and local deformations?

Q11. How many airborne particles were recorded in the confocal data set?

Q12. What is the resulting transformation of the radial deformations?

Q13. What is the simplest way to compute a spherical-harmonics?

Q14. How can a pollen expert be able to detect the difference between the two samples?