What are the contributions in "Automated image-based tracking and its application in ecology" ?

Its application in ecology Anthony I. Dell, John A. Bender, Kristin Branson, Iain D. Couzin, Gonzalo G. de Polavieja, Lucas P. J. J. Noldus, Alfonso Pérez-Escudero, Pietro Perona, Andrew D. Straw, Martin Wikelski, and Ulrich Brose 1 Systemic Conservation Biology, Department of Biology, Georg-August University Göttingen, Göttingen, Germany 2 HasOffers Inc., 2220 Western Ave, Seattle, WA, USA 3 Howard Hughes Medical Institute, Janelia Farm Research Campus, Ashburn, VA, USA 4 Department of Ecology and Evolutionary Biology, Princeton University, Princeton, NJ, USA 5 Instituto Cajal, CSIC, Av. Doctor Arce, 37, Madrid, Spain 6 Noldus Information Technology BV, Nieuwe Kanaal 5, 6709 PA Wageningen, The Netherlands 7 Computation and Neural Systems Program, California Institute of Technology, Pasadena, CA, USA 8 Research Institute of Molecular Pathology ( IMP ), Vienna, Austria 9 Max Planck Institute for Ornithology, Radolfzell, Germany 10 Biology Department, University of Konstanz, Konstanz, Germany Review

What are the advantages of unsupervised methods?

Unsupervised techniques offer the advantage of decreased subjectivity, and increased throughput, repeatability, and the chance of finding rare behaviors [68,74,75].

What is the common approach for detecting individuals?

(A–C) A common approach for detecting individuals is background subtraction, where detection of individuals in raw images is achieved by removing an estimated background-only image, resulting in isolation of foreground pixels.

What are the methods for quantifying the physical structure of 3D landscapes?

Methods for quantifying the physical structure of 3D landscapes are rapidly advancing [58– 60] and can be used for rendering features of natural habitats, such as trees or streams.

What is the final step in automated tracking?

The final step in automated image-based tracking is analysis, where position and pose data are analyzed to understand relevant biological, and ecological, patterns and processes.

What are the main issues that need to be addressed?

In addition, the storage and management issues that arise from the huge amounts of digital data that are easily produced by imaging must be addressed.

What are some of the more applied questions that can be addressed by image-based tracking?

Image-based tracking can also address more applied questions, such as the role of fragmentation in population dynamics (A.I. Dell, unpublished) or determining the size of animal populations that are historically difficult to measure [52].

What is the way to maintain identity in 3D landscapes?

This often involves application of artificial markings; however, natural variation in the morphology of individuals can also be used to maintain identities throughout image sequences, even following occlusion (Table S1 in the supplementary material online).

What is the way to quantify the environment?

Remote quantification of the environment can easily be accomplished by imaging in the appropriate sensory regime, such as optical video cameras for quantifying light conditions and thermal cameras for quantifying the thermal landscapes.

What are the advantages of lightfield cameras?

Lightfield cameras work at higher frame rates and there are several laboratories exploring if they can be successfully incorporated into automated tracking systems (I.D. Couzin and G.G. de Polavieja, unpublished).

What is the basic question that can be addressed by a simple analysis?

Once coordinates (and pose estimates if available) are produced, then even very simple analysis can address basic ecological questions such as where and how animals behave and interact [4,8] (Figure IA–C).

What can be done to improve the accuracy of the 3D imaging?

As in 2D, multiple 3D imaging cameras can be employed simultaneously to provide additional resolution and to cope with occlusions [29].

How can the position and pose of organisms be determined?

The position and pose of organisms with stiff and simple-shaped bodies can be computed by fitting a shape contour to the image of the organism [8,27] (Figure ID),including determining whether clumps of pixels should be separated into multiple individuals (Figure IE–I).

What constraints on the acquisition, processing, and storage of digital information limit the spatial extent of image?

constraints on the acquisition, processing, and storage of digital information limit the spatiotemporal extent of image-based tracking, and extracting the position and pose of every individual in each image is difficult in complex habitat and at high densities.

What are the general traits that can be used to maintain identities in ecology?

General traits can be sufficient for maintaining identities at low densities or when individuals vary greatly in size or shape, but in many other instances in ecology individuals are likely to be similarly sized or shaped.

(Open Access) Automated image-based tracking and its application in ecology (2014) | Anthony I. Dell

Q: What is the way to track a scene in 3D?

(F) Light-field cameras allow for post-hoc selection of focal points, thus potentially allowing tracking and construction of the scene in 3D from a single image point.

Automated

image-based

tracking

and

its

application

ecology

Anthony

Dell

John

Bender

Kristin

Branson

Iain

Couzin

Gonzalo

Polavieja

Lucas

P.J.J.

Noldus

Alfonso

rez-Escudero

Pietro

Perona

Andrew

Straw

Martin

Wikelski

9,10

and

Ulrich

Brose

Systemic

Conservation

Biology,

Department

Biology,

Georg-August

University

ttingen,

Germany

HasOffers

Inc.,

2220

Western

Ave,

Seattle,

WA,

USA

Howard

Hughes

Medical

Institute,

Janelia

Farm

Research

Campus,

Ashburn,

VA,

USA

Department

Ecology

and

Evolutionary

Biology,

Princeton

University,

Princeton,

NJ,

USA

Instituto

Cajal,

CSIC,

Av.

Doctor

Arce,

37,

Madrid,

Spain

Noldus

Information

Technology

BV,

Nieuwe

Kanaal

6709

Wageningen,

The

Netherlands

Computation

and

Neural

Systems

Program,

California

Institute

Technology,

Pasadena,

CA,

USA

Research

Institute

Molecular

Pathology

(IMP),

Vienna,

Austria

Max

Planck

Institute

for

Ornithology,

Radolfzell,

Germany

Biology

Department,

University

Konstanz,

Germany

The

behavior

individuals

determines

the

strength

and

outcome

ecological

interactions,

which

drive

popula-

tion,

community,

and

ecosystem

organization.

Bio-log-

ging,

such

telemetry

and

animal-borne

imaging,

provides

essential

individual

viewpoints,

tracks,

and

life

histories,

but

requires

capture

individuals

and

often

impractical

scale.

Recent

developments

automated

image-based

tracking

offers

opportunities

remotely

quantify

and

understand

individual

behavior

scales

and

resolutions

not

previously

possible,

providing

essential

supplement

other

tracking

methodologies

ecology.

Automated

image-based

tracking

should

con-

tinue

advance

the

ﬁeld

ecology

enabling

better

understanding

the

linkages

between

individual

and

higher-level

ecological

processes,

via

high-throughput

quantitative

analysis

complex

ecological

patterns

and

processes

across

scales,

including

analysis

environ-

mental

drivers.

Measuring

behavior

Individual

behavior

(see

Glossary)

underlies

almost

all

aspects

ecology

[1–5].

Accurate

and

highly

resolved

behavioral

data

are

therefore

critical

for

obtaining

mech-

anistic

and

predictive

understanding

ecological

systems

[5].

Historically,

direct

observation

trained

biologists

was

used

quantify

behavior

[6,7].

However,

the

extent

and

resolution

which

direct

observations

can

made

highly

constrained

[8]

and

the

number

individuals

that

can

observed

simultaneously

small.

addition,

exact

record

events

not

preserved,

only

the

biologist’s

subjective

account

them.

Recent

technological

advances

tracking

now

make

possible

collect

large

amounts

highly

precise

and

accurate

behavioral

data.

For

many

organisms

equipment

can

attached

that

provide

information

about

the

Review

Glossary

Background

subtraction:

method

used

software

compare

the

current

video

frame

with

stored

picture

the

background;

any

pixel

the

current

frame

that

significantly

different

from

the

corresponding

pixel

the

background

likely

associated

with

the

body

animal.

Useful

situations

where

the

background

unchanging,

for

example,

when

the

surface

the

background

rigid

and

lighting

does

not

change.

Behavior:

the

actions

individuals,

often

response

stimuli.

Behavior

can

involve

movement

the

individual’s

body

through

space,

such

walking

chasing,

can

occur

while

the

animal

stationary,

such

grooming

eating.

Bio-logging:

attachment

implantation

equipment

organisms

provide

information

about

their

identity,

location,

behavior,

physiology

(e.g.,

global

positioning

systems,

accelerometers,

video

cameras,

telemetry

tags).

Ecological

interaction:

any

interaction

between

organism

and

its

environment,

between

two

organisms

(i.e.,

includi ng

interactions

between

conspecifics).

Fingerprinting:

method

used

identify

unmarked

individuals

using

natural

variation

their

physical

and/or

behavioral

appearance.

The

method

works

transforming

the

images

each

individual

into

characteristic

‘fingerprint’ ,

which

can

then

used

distinguish

individual

organisms

both

within

and

across

videos.

FPS

(frames

per

second):

the

number

frames

image

sequence

collected

per

second.

Image:

any

measurement

the

spatiotemporal

position

pose

organisms

that

can

recast

into

digital

image

and

analyzed

using

computer

vision

techniques

(see

Box

2).

Machine

learning:

set

techniques

that

allow

computer

software

learn

from

empirical

data,

user

assumptions,

manual

annotation.

These

approaches

are

becoming

increasingly

common

the

analysis

behavior,

where

users

can

tag

behavior

short

sequences

images

and

the

software

can

predict

occurrences

these

behaviors

throughout

the

entire

image

sequence.

Marking:

the

attachment

artificial

‘marks’

organisms

maintain

their

identity,

such

paint

barcodes.

Occlusion:

when

the

view

any

individual

image

disrupted

either

another

individual

physical

habitat

(i.e.,

the

occluding

object

lies

straight

line

between

the

focus

individual

and

the

camera).

Pixel:

physical

point

digital

image,

and

therefore

the

smallest

controllable

element

picture

represented

the

screen.

The

equivalent

pixel

space

voxel.

Pose:

any

additional

geometrical

quantity

interest

other

than

the

center

the

main

body

the

animal,

such

orientation,

wing

positions,

body

curvature,

etc.

Position:

the

center

body

mass

individual

time

and

space.

Resolution:

the

number

pixels/voxels

digital

image.

0169-5347/

2014

Elsevier

Ltd.

All

rights

reserved.

http://dx.doi.org/10.1016/j.tree.2014.05.004

Corresponding

author:

Dell,

A.I.

(adell@gwdg.de,

tonyidell@gmail.com).

Keywords:

behavior;

bio-logging;

ecological

interactions;

tracking;

automated

image-

based

tracking.

Trends

Ecology

Evolution,

July

2014,

Vol.

29,

No.

417

individuals’

spatiotemporal

position,

orientation,

and

physiology.

This

‘bio-logging’

allows

remote

reconstruction

behavior

over

large

spatiotemporal

extents,

providing

essential

individual

viewpoints,

tracks,

and

life

histories,

and

thus

important

ecological

and

evolutionary

insights

[9–11].

Image-based

tracking,

for

example

with

video,

another

tracking

method

that

shows

great

potential

ecology.

Similar

bio-logging,

image-based

tracking

involves

digital

recording

data,

meaning

objective

view

events

maintained,

increasing

repeatability

studies,

and

allowing

biologists

mine

data

for

quantities

not

originally

considered.

Image-based

tracking

can

used

when

individuals

are

too

small

attach

bio-loggers,

the

equipment

itself

changes

behavior,

and

all

visible

and

sufﬁciently

resolved

individuals

within

the

imaged

area

can

tracked,

not

just

those

with

loggers

attached.

Also,

image-based

tracking

generally

allows

for

higher

spatiotemporal

resolution

behavioral

data

than

bio-log-

ging,

and

many

imaging

methods

allow

extraction

quan-

titative

information

about

the

environment,

such

its

temperature

topography.

Currently,

constraints

the

acquisition,

processing,

and

storage

digital

information

limit

the

spatiotemporal

extent

image-based

tracking,

and

extracting

the

position

and

pose

every

individual

each

image

difﬁcult

complex

habitat

and

high

densities.

Nonetheless,

constraints

are

rapidly

being

over-

come

and

image-based

tracking

now

provides

valuable

tool

undertake

rigorous

hypothesis-driven

research

ecology

(Box

1).

Here

review

the

state-of-the-art

image-based

tracking,

its

strengths

and

limitations

when

applied

ecological

research,

and

its

application

solve

relevant

ecological

questions.

Automated

image-based

tracking

Initial

applications

image-based

tracking

required

man-

ual

analysis

[12,13],

which

effort

intensive,

often

leads

poor

spatiotemporal

resolution,

and

open

observer

effects

such

subjective

decisions

about

which

informa-

tion

record.

Recent

advances

automation

are

over-

coming

these

issues

[14–16],

and

there

now

exist

several

image-based

systems

capable

extracting

individual

be-

havior

with

minimal

zero

manual

intervention

(Table

the

supplementary

material

online).

Tracking

over

eco-

logically

relevant

spatiotemporal

scales

becoming

easier,

owing

advances

imaging

and

computing

technologies,

and

the

development

software

that

can

track

real

time

[17–19]

and

recognize

individuals

across

image

sequences

[20,21].

Biologists

now

employ

wide

range

imaging

methods

(e.g.,

near

infrared,

thermal

infrared,

sonar,

3D)

that

permit

tracking

environments

where

optical

video

unsuitable

(Box

2).

date,

automated

image-based

tracking

has

primarily

been

undertaken

the

laboratory,

where

biologists

have

examined

genetic

and

physiochemical

drivers

behavior

model

species

(Table

the

supplementary

material

online)

(Box

1).

However,

the

past

decade

has

seen

expansion

these

methods

into

the

ﬁeld,

and

automated

image-based

track-

ing

has

now

been

undertaken

wide

diversity

species,

including

plants,

worms,

spiders,

insects,

ﬁsh,

birds,

mam-

mals,

and

(Table

the

supplementary

material

online).

Automated

image-based

tracking

involves

three

main

steps

(Figure

1):

(i)

acquisition

image

sequences

(Box

2);

(ii)

detection

individuals

and

their

pose

each

image

and

appropriate

‘linking’

detections

consecutive

images

create

trajectories

through

time

(Box

3);

and

(iii)

analysis

behavioral

data

(Box

4).

Real-time

tracking

performed

images

are

acquired,

removing

the

need

for

storing

large

amounts

digital

information

[17–19]

and

allowing

researchers

inﬂuence

the

animal’s

environ-

ment

real

time

through

virtual

reality,

robotics,

other

dynamical

stimulus

regimes

[22–24].

Even

under

con-

trolled

laboratory

conditions

with

small

numbers

indi-

viduals,

automated

image-based

tracking

difﬁcult

computer

vision

problem.

Biological

organisms

are

highly

deformable

objects

which

behave

unconstrained

and

variable

ways

[25],

and

the

environments

within

which

they

exist

are

complex

and

dynamic.

Ultimately,

automated

image-based

tracking

there

trade-off

between

the

difﬁculty

the

tracking

problem

Box

Ecological

insights

from

automated

image-based

tracking

see

three

key

areas

where

considerable

intellectual

progress

has

been

made

ecology

using

automated

image-based

tracking.

First,

the

kinematics

animal

behavior

[17–19,23,24,34,42,57,66,69,70,74,

76–81],

including

the

role

the

internal

state

animals,

such

their

physiology

genes,

and

the

external

environment.

Recent

break-

throughs

remote

quantification

physical

landscapes

[58–60]

and

imaging

[29]

should

especially

helpful

for

these

questions.

Second,

collective

behavior

animal

groups

[1,26,33,38,40,43,45,62,

82,83],

including

understanding

how

information

about

the

physical

and

biological

environment

transfers

between

individuals.

Generally,

this

research

centers

intraspecific

groups

comprising

large

numbers

similar

sized

individuals.

Third,

determinants

social

behavior

[8,27–29,31,53,54,67,71,73,84].

Research

this

last

category

usually

focuses

small

number

individuals,

because

identifying

the

detailed

pose

required

for

automated

behavioral

analysis

difficult

larger

groups.

Tracking

over

short

durations

(minutes)

has

aided

our

understanding

the

genetic

basis

social

behavior,

such

aggression

courtship

[8,85],

where

the

high

throughput

that

automation

allows

provides

enhanced

power

for

uncovering

patterns

behavioral

data

[27].

Research

over

longer

times

can

uncover

complex

temporal

linkages

between

social

behaviors

[8,28],

and

experiments

over

the

order

weeks

provide

unique

insight

into

the

social

and

behavioral

development

individuals

intraspecific

groups

[31,53,54].

Enormous

potential

exists

for

automated

image-based

tracking

address

other

key

issues

ecology.

One

area

expect

significant

growth

the

study

interspecific

interactions,

which

are

critical

ecological

systems

[1–5].

For

example,

biologists

recently

used

automated

analysis

sonar

images

reveal

how

coordinated

hunting

predators

leads

increased

fragmentation

and

irregula-

rities

the

spatial

structure

prey

groups,

and

thus

inhibition

information

transfer

among

prey

[4].

Laboratory

research

alone

provides

much

scope

for

experimentally

testing

basic

ideas

about

ecology,

such

the

role

body

size

predator

density

determining

trophic

interaction

strength

(Movie

the

supple-

mentary

material

online)

(A.I.

Dell,

unpublished).

Image-based

tracking

can

also

address

applied

questions,

such

the

role

fragmentation

population

dynamics

(A.I.

Dell,

unpublished)

determining

the

size

animal

populations

that

are

historically

difficult

measure

[52].

Integrating

automated

tracking

techniques

into

images

already

collected

trigger-based

cameras

assess

species

occurrence

and

population

abundances

[21]

would

provide

important

information

about

the

behavior

organisms

natural

ecosystems.

Review

Trends

Ecology

Evolution

July

2014,

Vol.

29,

No.

418

(horizontal

axis

Figure

and

the

quality

tracking

output

(vertical

axis

Figure

2).

Difﬁculty

the

tracking

problem

Tracking

easiest

laboratory-based

systems

with

simple

environmental

landscape

and

low

numbers

indi-

viduals

(left

panel

Figure

2),

and

most

difﬁcult

the

ﬁeld

where

many

individuals

from

many

different

species

interact

across

complex

environmental

landscape

(right

panel

Figure

2).

From

individuals

interactions

Monitoring

the

behavior

individuals

they

interact

with

each

other

difﬁcult

for

several

reasons.

First,

Box

Obtaining

image

sequence

The

first

step

automated

image-based

tracking

involves

obtaining

machine-readable

sequence

images

that

accurately

represents

the

real

world.

This

translation

between

the

real

and

digital

world

critical

step,

and

time

spent

optimizing

the

image

(such

ensuring

sufficient

contrast

between

foreground

and

background)

pays

substantial

dividends

during

subsequent

steps

(see

Figure

main

text).

Optical

video

commonly

used

owing

its

accessibility

and

low

cost,

but

other

imaging

technologies

considerably

expand

the

range

environmental

contexts

within

which

tracking

can

undertaken

(Figure

I).

These

include

infrared

(Figure

IA,B),

thermal

infrared

[50]

(Figure

IC;

Movie

the

supplementary

material

online),

X-ray

microtomogra phy

[55]

(Figure

ID),

and

sonar

[4]

(Figure

IE;

Movie

the

supplementary

material

online).

Light-field

(Figure

IF)

and

multi-s ca le

gigapixel

[86]

(Figure

IG)

imaging

should

permit

tracking

and

scene

reconstruction

from

single

image

viewpoint.

Although

frame

rates

gigapixel

cameras

are

increasing

(S.D.

Feller,

unpublished) ,

three

frames

per

minute

[86],

they

are

currently

too

slow

for

most

automated

tracking

applications .

Light-

field

cameras

work

higher

frame

rates

and

there

are

several

laboratories

exploring

they

can

successfully

incorporated

into

automated

tracking

systems

(I.D.

Couzin

and

G.G.

Polavieja,

unpublished) .

Ultimately,

decisions

about

which

imaging

method

use

should

determined

the

specific

needs

the

project.

Automated

tracking

generally

requires

high-contrast

image

that

computer

vision

algorithms

can

adequately

discern

organisms

and

their

appendages

from

the

surrounding

background

(Box

3).

common

and

low-cost

method

obtaining

such

images

construct

artificial

arena

for

tracking

experiments,

which

often

colored

contrast

with

the

animals,

and

brightly

and

uniformly

lit

with

diffuse

lighting

(Figure

IA,B).

Deciding

the

spatial

and

temporal

resolution

images

also

key

consideration.

Higher

resolutions

generally

result

better

tracking

results

and

precise

quantification

behavior,

but

bottlenecks

during

the

transmission,

storage,

and

processing

digital

information

can

limit

high

temporal

resolution

low

spatial

resolution

and/or

short

durations.

Con-

straints

low

spatial

resolutions

can

overcome

integrating

output

from

multiple

cameras

[18]

and

should

become

less

important

technology

advances.

Recording

software

another

important

consideration,

such

the

choice

codec

for

encoding

and

compressing

digital

data

ensuring

that

accurate

time

stamps

are

obtained

and

that

frames

are

not

silently

dropped,

and

robust

open

source

[87,88]

and

commercial

[Noldus

Information

Technology,

media

recorder,

2013

(http://www.noldus.com/media-recorder);

Nor-

pix,

StreamPix,

2013

(http://www.norpix.com/products/streampix/

streampix.php)]

options

are

available.

Meters

1.0

2.0

3.0

5.0

6.0

7.0

8.0

9.0

10.0

8.0

7.0

6.0

5.0

4.0

3.0

2.0

1.0

(A)

(B)

(F)

(C)

(D)

(E)

(G)

TRENDS in Ecology & Evolution

Figure

growing

number

technologies

allow

capturing

digital

images

for

automated

image-based

tracking.

(A)

The

most

common

optical

near

infrared

video,

most

often

used

simple

laboratory

settings

(left

panel

Figure

(Movie

S1–S4,

Movie

S5,

Movie

S10,

Movie

S11,

Movie

S14,

and

Movie

S17

the

supplementary

material

online).

(B)

Images

from

multiple

cameras

allow

tracking

3D,

even

with

some

degree

habitat

complexity

present

(Movie

and

Movie

S15

the

supplementary

material

online).

(C)

Thermal

imaging

allows

tracking

complete

darkness,

but

requires

that

tracked

animals

have

surface

temperature

different

from

the

surrounding

landscape

(Movie

the

supplementary

material

online).

(D)

High-resolution

X-ray

microtomography

permits

imaging

through

complex

habitat

structure,

such

soil

(burrowing

invertebrate

highlighted

red

arrow).

(E)

Acoustic

imaging

(sonar)

can

also

image

habitats

where

optical

video

would

unusable,

such

this

image

predators

foraging

for

schooling

bait

fish

turbid

estuary

[4]

(Movie

the

supplementary

material

online).

(F)

Light-field

cameras

allow

for

post-hoc

selection

focal

points,

thus

potentially

allowing

tracking

and

construction

the

scene

from

single

image

point.

The

three

panels

(F)

were

obtained

from

single

light-field

image

–

each

panel

representing

different

focal

points

(highlighted

red

arrow).

(G)

Newly

developed

gigapixel

technologies

also

permit

capturing

images

from

single

image

point

with

very

high

spatial

resolutions

and

multi-scales,

again

allowing

for

tracking

from

single

image

point

[86].

The

three

lower

panels

(G)

are

enlarged

sections

the

main

image.

See

Acknowledgments

for

credits

and

permissions.

Review

Trends

Ecology

Evolution

July

2014,

Vol.

29,

No.

419

organisms

often

move

rapidly

when

interacting

(Movie

S13

the

supplementary

material

online),

requiring

data

with

high

spatiotemporal

resolution.

Second,

because

multiple

individuals

are

involved,

interactions

are

prone

occlu-

sions,

made

especially

worse

because

interactions

often

involve

physical

contact.

Occlusions

cause

identity

errors,

which

are

not

local

but

propagate

throughout

the

remaining

image

sequence.

Manual

corrections

these

errors

are

labor

intensive.

Customized

automated

algo-

rithms

which

predict

identity

based

the

relative

speed

and

direction

movement

can

reduce

mistakes,

and

thus

dramatically

reduce

the

number

manual

interventions

needed

[26,27],

but

error

propagation

still

unavoidable

because

the

stochastic

behavior

organisms

[15]

(Box

3).

‘Fingerprinting’

somewhat

resolves

this

problem

(see

below),

but

maintaining

identities

always

becomes

difﬁcult

the

number

individuals

scales

with

increasing

density.

Tracking

individuals

during

occlusions

additional

problem

and

can

partly

overcome

when

prior

knowledge

about

the

shape

the

organisms

incor-

porated

into

the

system

[26–28].

Recent

approaches

utiliz-

ing

multiple

depth

cameras

are

especially

useful

this

regard

[29]

(Movie

S22

the

supplementary

material

online)

and

could

eventually

integrated

with

ﬁnger-

printing

assist

resolving

identities

during

occlusions.

Most

current

attempts

track

multiple

individuals

involve

organisms

that

are

similar

size

and

shape

(Table

the

supplementary

material

online).

nature,

how-

ever,

interactions

between

species

often

involve

individu-

als

that

differ

greatly

size

and

shape

[30]

(Movie

S13

the

supplementary

material

online).

Although

such

differ-

ences

can

useful

for

distinguishing

individuals

[8,20],

many

tracking

systems

rely

knowledge

about

the

typical

shape

individuals

aid

the

segmentation

and

analy-

sis

images

[27,28,31].

Even

shape

issues

are

overcome,

remains

difﬁcult

task

for

computer

vision

algorithms

separate

small

animals

from

the

body

and

appendages

larger

animals.

Algorithm

features

allowing

tracking

differently

sized

and

shaped

organisms,

such

sophisticated

contour

representations

ﬁngerprinting,

would

greatly

enhance

the

usefulness

image-based

tracking

ecologists

(Box

5).

Tracking

three

dimensions

Automated

image-based

tracking

environments

substantially

straightforward

than

(Figure

2).

Therefore,

many

tracking

systems

are

limited

simple

arenas

and

either

involve

organisms

that

naturally

move

quasi-2D,

work

constraining

normally

individuals

only

move

2D.

This

latter

method

can

achieved

modifying

organisms

directly,

such

wing

clipping

[27],

using

physical

boundaries

constrain

behavior

near

[1,20,27,32,33]

(Movie

S1,

Movie

S4,

Movie

S5,

and

Movie

S10

the

supplementary

material

online).

nature,

however,

most

organisms

incorporate

least

some

degree

movement

3D,

which

inﬂuences

ecological

interactions

[3].

Tracking

systems

designed

for

can

provide

some

resolution

for

behavior

third

spatial

dimension

[34],

but

ultimately

developers

must

produce

tracking

systems

that

can

successfully

track

large

numbers

animals

space

(Movie

the

supple-

mentary

material

online).

Tracking

unconstrained

ﬂying

swimming

animals

can

achieved

several

ways,

but

most

often

multiple

cameras

are

employed

[18,29,35–45]

(Movie

S6,

Movie

S8,

and

Movie

S22

the

supplementary

material

online).

Although

only

two

calibrated

cameras

taking

images

the

same

point

space

are

required

for

triangulation,

information

from

additional

cameras

can

incrementally

improve

localization,

especially

some

cameras

are

limit-

occlusion

low

contrast

[18].

Synchronizing

multi-

ple

cameras

requires

additional

hardware

and

complicated

software

that

relates

equivalent

objects

be-

tween

image

sequences;

however,

this

complexity

can

hidden

from

the

user

dedicated

multi-camera

systems

[18].

Triangulation

optimized

when

cameras

are

posi-

tioned

with

maximally

divergent

locations,

which

the

ﬁeld

can

introduce

problems

because

arranging

unob-

structed

cameras

multiple

locations

can

difﬁcult,

can

obtaining

multiple

views

every

location

interest.

Some

technologies

allow

tracking

from

single

imaging

device,

which

could

solve

many

these

issues.

For

example,

images

can

reconstructed

from

single

image

reﬂections

shadows

surface

[46,47],

(iii) Analysis

Data saved as digital

image sequences with

deﬁned spaal (pixel)

and temporal (FPS)

resoluons.

Temporal

resoluon

Sleeping

Mang

Foraging

Eang

Walking

Predator Prey (1)

Prey (2)

Prey (3)

Predator

Key:

Prey (1)

Prey (2)

Prey (3)

0102030

Body velocity (mm / s)

Frequency

40 50 60 80

Spaal

resoluon

Tracking soware uses

computer vision algorithms

to isolate individuals

(foreground) from the

surrounding landscape

(background), using

methods such as

background

subtracon (see

Box 3).

Analysis of trajectories

quanﬁes individual

(e.g., body velocity,

turning rates, search

strategy) and inter-

individual (e.g., aack

distance) traits.

Further analysis, which can be

automated if behaviors are

stereotyped, can condense

these high-dimensional

quanes into behavioral

categories.

Posions and

orientaons

of individuals are

then integrated

across image sequences

to form trajectories through me.

(ii) Tracking(i) Imaging

TRENDS in Ecology & Evolution

Figure

The

three

general

steps

involved

automated

image-based

tracking

behavior

are:

(i)

imaging

(Box

2);

(ii)

detection

individuals

and

their

pose

the

image

and

appropriate

‘linking’

detections

create

separate

tracks

through

time

for

each

individual

(Box

3);

and

(iii)

analysis

trajectory

and

behavioral

data

(Box

4).

date,

imaging

often

done

the

laboratory

(left

panel),

which

can

easily

provide

clean,

crisp

image

that

minimizes

tracking

errors.

Each

these

steps

are

strongly

interlinked

and

time

spent

optimizing

one

step

(e.g.,

imaging)

can

pay

huge

dividends

time

and

effort

saved

later

steps

(e.g.,

reducing

tracking

errors).

Review

Trends

Ecology

Evolution

July

2014,

Vol.

29,

No.

420

Box

Identifying

individuals

and

behaviors

images

Once

set

suitable

images

has

been

obtained

(Box

2),

the

position

individuals,

and

often

their

pose,

must

automatically

computed

form

trajectories

through

time.

First,

the

software

must

determine

whether

and

where

individuals

are

present

each

image.

How

easily

this

done

varies

with

the

type

and

quality

images

(Box

2),

well

how

accurately

each

individual’s

position

can

predicted

from

its

behavior

(see

below).

Detection

straightforward

when

the

contrast

between

individuals

and

the

background

substantial,

and

when

the

background

known

does

not

change

throughout

the

entire

image

sequence

most

easily

performed

background

subtraction

(Figure

IA–C).

The

physical

complexity

natural

systems

will

ultimately

require

advanced

techniques,

such

those

which

constantly

update

their

background

image

[18],

through

visual

recognition

methods

[21,63–67],

where

the

distinctive

pattern

associated

with

individual’s

body

and

its

motion

can

recognized

against

the

clutter

the

background.

The

output

the

detection

stage

estimate

the

pixels

associated

with

individuals

each

image.

The

position

and

pose

organisms

with

stiff

and

simple-shaped

bodies

can

computed

fitting

shape

contour

the

image

the

organism

[8,27]

(Figure

ID),

including

determining

whether

clumps

pixels

should

separated

into

multiple

individuals

(Figure

IE–I).

The

situation

complex

when

the

body

flexible

and

multiple

degrees

freedom

are

interest,

such

wing

angles

head

orientation

(Figure

IJ).

Algorithms

for

learning

and

computing

individual’s

pose

active

area

research,

and

involves

either

explicit

modeling

the

body,

learning

associations

between

image

brightness

patterns

and

pose

parameters

[68,72,76].

Finally,

the

position

each

individual

must

linked

over

multiple

frames

form

trajectories

(Figure

IL–P).

This

relatively

simple

for

single

individuals,

although

false

and

missed

detections

become

likely

when

detection

problematic.

Constructing

trajectories

for

multi-

ple

individuals

often

involves

parameterization

movement

model

which

includes

information

from

frames,

such

the

accelera-

tion

each

individual

their

preferred

direction

motion

[89,90].

Movement

models

also

improve

the

detection

phase

tracking,

but

ultimately

suffer

from

error

propagation

and

thus

can

labor

intensive.

Fingerprinting

identifies

individuals

from

their

image

structure

(see

main

text)

and

therefore

recovers

identities

after

occlusion

[20]

(Figure

IK;

Movie

the

supplementary

material

online).

(A)

(E)

(J)

(L)

(N)

(P)

(M)

(O)

(K)

(F)

(G)

(H)

(I)

150

100

Individuals

Penalty

(B)

(C)

(D)

TRENDS in Ecology & Evolution

Figure

After

imaging

(Box

2),

computer

vision

software

must

automatically

detect

the

position,

and

sometimes

pose,

individuals

the

image

create

trajectories.

(A–C)

common

approach

for

detecting

individuals

background

subtraction,

where

detection

individuals

raw

images

achieved

removing

estimated

background-only

image,

resulting

isolation

foreground

pixels.

(D)

Contours,

denoting

individuals,

can

then

mapped

clusters

these

foreground

pixels.

How

many

individuals

are

within

pixel

cluster

can

determined

number

ways.

The

cluster

pixels

(E–H)

can

grouped

one,

two,

three,

four

individuals,

with

(I)

the

optimal

grouping

being

three

individuals

based

some

quantifiable

measure.

When

overlaps

are

large

body

shapes

are

non-rigid,

other

methods

using

past

and

future

dynamics

are

suitable

(see

main

text).

(J)

complex

contours

can

precisely

map

the

pose

individuals,

such

swimming

Caenorhabditis

elegans

[19]

(Movie

the

supplementary

material

online),

wing

positioning

Drosophila

[8]

(Movie

S14

the

supplementary

material

online),

body

posturing

mice

during

social

interactions

[28]

(Movie

S11

the

supplementary

material

online).

(K)

Fingerprinting

allows

for

maintenance

identities

through

time

analyzing

the

complete

image

structure,

often

using

differences

between

individuals

that

are

undetectable

the

human

eye,

such

these

zebra

fish

[20]

(Movie

the

supplementary

material

online).

Once

individuals

are

detected

and

identified,

their

positions

are

linked

across

frames

form

trajectories.

(L)

This

could

single

individual

landscape

[27],

(M)

single

individual

landscape

(shown

here

with

some

habitat

complexity)

[18]

(Movie

the

supplementary

material

online),

(N)

multiple

individuals

simple

landscape

[27]

(Movie

the

supplementary

material

online),

(O)

multiple

individuals

landscape

(Movie

the

supplementary

material

online).

(P)

Trajectories

throughout

complex

habitat

can

also

obtained,

such

this

woodlice

navigating

for

between

two

habitat

patches

connected

dispersal

corridor

(A.I.

Dell,

unpublished).

See

Acknowledgments

for

credits

and

permissions.

Review

Trends

Ecology

Evolution

July

2014,

Vol.

29,

No.

421

Automated image-based tracking and its application in ecology

Figures

Citations

DeepLabCut: markerless pose estimation of user-defined body parts with deep learning

Using DeepLabCut for 3D markerless pose estimation across species and behaviors

Toward a Science of Computational Ethology

DeepPoseKit, a software toolkit for fast and robust animal pose estimation using deep learning

Applications of machine learning in animal behaviour studies

References

Multiple view geometry in computer vision

Multiple View Geometry in Computer Vision.

Observational study of behavior: sampling methods.

KinectFusion: Real-time dense surface mapping and tracking

Tracking and data association

Related Papers (5)

idTracker: tracking individuals in a group by automatic identification of unmarked animals

High-throughput Ethomics in Large Groups of Drosophila

JAABA: interactive machine learning for automatic annotation of animal behavior

DeepLabCut: markerless pose estimation of user-defined body parts with deep learning

Mapping the stereotyped behaviour of freely moving fruit flies

Frequently Asked Questions (16)

Q1. What are the contributions in "Automated image-based tracking and its application in ecology" ?

Q2. What are the advantages of unsupervised methods?

Q3. What is the common approach for detecting individuals?

Q4. What is the way to track a scene in 3D?

Q5. What are the methods for quantifying the physical structure of 3D landscapes?

Q6. What is the final step in automated tracking?

Q7. What are the main issues that need to be addressed?

Q8. What are some of the more applied questions that can be addressed by image-based tracking?

Q9. What is the way to maintain identity in 3D landscapes?

Q10. What is the way to quantify the environment?

Q11. What are the advantages of lightfield cameras?

Q12. What is the basic question that can be addressed by a simple analysis?

Q13. What can be done to improve the accuracy of the 3D imaging?

Q14. How can the position and pose of organisms be determined?

Q15. What constraints on the acquisition, processing, and storage of digital information limit the spatial extent of image?

Q16. What are the general traits that can be used to maintain identities in ecology?