Real-time people tracking for mobile robots using thermal vision

Q: What are the two kinds of metric used to indicate the quality of the tracking procedure?

The authors use two kinds of metrics that indicate the quality of the tracking procedure: detection metrics (counting persons) and localisation metrics (area matching).

Q: What is the weight update equation for established tracking filters?

The weight update equation for established tracking filters is changed to wit ∝ p(zt|xt = xit)ψ, where ψ = e(−ρgim) and gim expresses the amount of overlap between particle i and region m, which is multiplied by a factor ρ in the exponent of the penalty term.

Q: How long does it take to calculate a step of the tracking procedure?

It takes about two times longer to calculate one step of the tracking procedure when using all three moments compared to the tracker based on thermal information only (around 30Hz on a 2.00 GHz processor when using 1000 samples).

Q: What is the weight update equation for the ith detection particle?

The weight update equation for the ith detection particle is modified to wit ∝ p(zt|xt = xit)ψ, where ψ = ψd if particle i overlaps with other detected regions and ψ = 1 otherwise.

Q: What is the fitness value for each sample i?

A fitness value f i for each sample i is then calculated as the sum of all gradients multiplied with individual weights αj for each region: f i = ∑m j=1 αj∆ i j .

Question

Q1. What are the contributions in "Improved data association and occlusion handling for vision-based people tracking by mobile robots" ?

Q2. What are the future works in "Improved data association and occlusion handling for vision-based people tracking by mobile robots" ?

Q3. How long does it take to remove particles from a tracker?

Q4. How did the authors obtain the ground truth data?

Q5. How is the weight of the detection particles penalised?

Q6. What are the two kinds of metric used to indicate the quality of the tracking procedure?

Q7. What is the weight update equation for established tracking filters?

Q8. How long does it take to calculate a step of the tracking procedure?

Q9. What is the weight update equation for the ith detection particle?

Q10. What is the fitness value for each sample i?

Q11. What is the importance of the ellipse?

Q12. How can the authors determine the region corresponding to a person on the colour image?

Q13. How is the order of persons determined?

Q14. What is the trade-off between time requirements and performance of the tracker?

Q15. What features are used to indicate the order of overlapping persons in the image?

Accepted Answer

This paper presents an approach for tracking multiple persons using a combination of colour and thermal vision sensors on a mobile robot.

Accepted Answer

Such a solution has obvious pitfalls that should be considered in future work such as proper handling of misclassification errors, wrong assignments after occlusions, uniformly dressed people, etc.

Accepted Answer

The authors keep particles of the totally occluded tracker for a short time (we use a value of 8 frames here) in situations when quick occlusions occur and the velocity of particles may allow resolution of this occlusion.

Accepted Answer

To obtain the ground truth data the authors used a flood-fill segmentation algorithm corrected afterwards by hand using the ViPER-GT tool [3].

Accepted Answer

To avoid multiple detections in the same or similar regions, the weight of detection particles is penalised by a factor ψd < 1 in cases where particles cross already detected areas.

Accepted Answer

The authors use two kinds of metrics that indicate the quality of the tracking procedure: detection metrics (counting persons) and localisation metrics (area matching).

Accepted Answer

The weight update equation for established tracking filters is changed to wit ∝ p(zt|xt = xit)ψ, where ψ = e(−ρgim) and gim expresses the amount of overlap between particle i and region m, which is multiplied by a factor ρ in the exponent of the penalty term.

Accepted Answer

It takes about two times longer to calculate one step of the tracking procedure when using all three moments compared to the tracker based on thermal information only (around 30Hz on a 2.00 GHz processor when using 1000 samples).

Accepted Answer

The weight update equation for the ith detection particle is modified to wit ∝ p(zt|xt = xit)ψ, where ψ = ψd if particle i overlaps with other detected regions and ψ = 1 otherwise.

Accepted Answer

A fitness value f i for each sample i is then calculated as the sum of all gradients multiplied with individual weights αj for each region: f i = ∑m j=1 αj∆ i j .

Accepted Answer

To calculate the importance weight wit of a sample i with state xit the authors divide the ellipses into m = 7 different regions (see Fig. 2) and for each region j the image gradient ∆ij between pixels in the inner and outer parts of the ellipse is calculated.

Accepted Answer

By using the affine transformation the authors are able to determine the region corresponding to a person on the colour image (see Fig. 3).

Accepted Answer

The order ofthe persons from front-to-back is then determined by a sort procedure requiring MO · log(MO) comparisons where MO specifies the number of overlapping persons.

Accepted Answer

A good trade-off between time requirements and performance of the tracker for their setup is a representation using just the first moment of the colour distribution (46% more time compared to the gradient based tracker).

Accepted Answer

There are several features that could indicate the order of overlapping persons in the image, from which the authors have chosen a set of three thermal and three colour features.

Real-time people tracking for mobile robots using thermal vision

Figures

Citations

Thermal cameras and applications: a survey

Optical flow or image subtraction in human detection from infrared camera on mobile robot

Pedestrian Tracking Using Online Boosted Random Ferns Learning in Far-Infrared Imagery for Safe Driving at Night

Feature analysis for human recognition and discrimination: Application to a person-following behaviour in a mobile robot

A Comparative Study of Vision Based Human Detection Techniques in People Counting Applications

References

Robust Real-Time Face Detection

Robust real-time face detection

Sequential Monte Carlo methods in practice

C ONDENSATION —Conditional Density Propagation forVisual Tracking

Tracking and data association

Related Papers (5)

Histograms of oriented gradients for human detection