Home
/
Authors
/
Marc Lalonde

Author

Marc Lalonde

Bio: Marc Lalonde is an academic researcher from McGill University. The author has contributed to research in topics: Image processing & Video tracking. The author has an hindex of 9, co-authored 27 publications receiving 876 citations.

Topics: Image processing, Video tracking, Image segmentation, Zoom, Template matching ...read more

Papers

PDF

Open Access

More filters

Journal Article•DOI•

Fast and robust optic disc detection using pyramidal decomposition and Hausdorff-based template matching

[...]

Marc Lalonde, Mario Beaulieu, Langis Gagnon

01 Nov 2001-IEEE Transactions on Medical Imaging

TL;DR: Reports on the design and test of an image processing algorithm for the localization of the optic disk in low-resolution (about 20 /spl mu//pixel) color fundus images and a confidence level is associated to the final detection that indicates the "level of difficulty" the detector has to identify the OD position and shape.

...read moreread less

Abstract: Reports on the design and test of an image processing algorithm for the localization of the optic disk (OD) in low-resolution (about 20 /spl mu//pixel) color fundus images The design relies on the combination of two procedures: 1) a Hausdorff-based template matching technique on edge map, guided by 2) a pyramidal decomposition for large scale object tracking The two approaches are tested against a database of 40 images of various visual quality and retinal pigmentation, as well as of normal and small pupils An average error of 7% on OD center positioning is reached with no false detection In addition, a confidence level is associated to the final detection that indicates the "level of difficulty" the detector has to identify the OD position and shape

...read moreread less

413 citations

Proceedings Article•DOI•

Procedure to detect anatomical structures in optical fundus images

[...]

Langis Gagnon, Marc Lalonde, Mario Beaulieu, Marie Carole Boucher

03 Jul 2001

TL;DR: In this article, an overview of the design and test of an image processing procedure for detecting all important anatomical structures in color fundus images is presented. But this procedure is not suitable for the detection of the retinal network.

...read moreread less

Abstract: We present an overview of the design and test of an image processing procedure for detecting all important anatomical structures in color fundus images. These structures are the optic disk, the macula and the retinal network. The algorithm proceeds through five main steps: (1) automatic mask generation using pixels value statistics and color threshold, (2) visual image quality assessment using histogram matching and Canny edge distribution modeling, (3) optic disk localization using pyramidal decomposition, Hausdorff-based template matching and confidence assignment, (4) macula localization using pyramidal decomposition and (5) bessel network tracking using recursive dual edge tracking and connectivity recovering. The procedure has been tested on a database of about 40 color fundus images acquired from a digital non-mydriatic fundus camera. The database is composed of images of various types (macula- and optic disk-centered) and of various visual quality (with or without abnormal bright or dark regions, blurred, etc).© (2001) COPYRIGHT SPIE--The International Society for Optical Engineering. Downloading of the abstract is permitted for personal use only.

...read moreread less

191 citations

Proceedings Article•DOI•

Real-time eye blink detection with GPU-based SIFT tracking

[...]

Marc Lalonde, David Byrns, Langis Gagnon, Normand Teasdale¹, Denis Laurendeau¹ - Show less +1 more•Institutions (1)

Laval University¹

28 May 2007

TL;DR: This paper reports on the implementation of a GPU-based, real-time eye blink detector on very low contrast images acquired under near-infrared illumination that is part of a multi-sensor data acquisition and analysis system for driver performance assessment and training.

...read moreread less

Abstract: This paper reports on the implementation of a GPU-based, real-time eye blink detector on very low contrast images acquired under near-infrared illumination. This detector is part of a multi-sensor data acquisition and analysis system for driver performance assessment and training. Eye blinks are detected inside regions of interest that are aligned with the subject's eyes at initialization. Alignment is maintained through time by tracking SIFT feature points that are used to estimate the affine transformation between the initial face pose and the pose in subsequent frames. The GPU implementation of the SIFT feature point extraction algorithm ensures real-time processing. An eye blink detection rate of 97% is obtained on a video dataset of 33,000 frames showing 237 blinks from 22 subjects.

...read moreread less

123 citations

Proceedings Article•DOI•

A system to automatically track humans and vehicles with a PTZ camera

[...]

Marc Lalonde, Samuel Foucher, Langis Gagnon, E. Pronovost, Maxime Derenne, A. Janelle - Show less +2 more

27 Apr 2007

TL;DR: The paper reports about the development of a software module that allows autonomous object detection, recognition and tracking in outdoor urban environment and its operational uses within the commercial system.

...read moreread less

Abstract: The paper reports about the development of a software module that allows autonomous object detection, recognition and tracking in outdoor urban environment. The purpose of the project was to endow a commercial PTZ camera with object tracking and recognition capability to automate some surveillance tasks. The module can discriminate between various moving objects and identify the presence of pedestrians or vehicles, track them, and zoom on them, in near real-time. The paper gives an overview of the module characteristics and its operational uses within the commercial system.

...read moreread less

47 citations

Proceedings Article•DOI•

A system for efficient and robust map symbol recognition

[...]

E. Reiher, Ying Li, V. Delle Donne, Marc Lalonde, C. Hayne, Chona Zhu - Show less +2 more

25 Aug 1996

TL;DR: A system that allows the user to input maps into a geographic information system (GIS) by using automatic symbol and line recognition based on the Hausdorff distance and neural networks is presented.

...read moreread less

Abstract: We present a system that allows the user to input maps into a geographic information system (GIS) by using automatic symbol and line recognition. The system is composed of a user interface, a symbol recognition engine, a knowledge base and a database. The recognition is based on the Hausdorff distance and neural networks, where our main contribution is to make the recognition efficient and robust for handling very large maps and many symbols of different scales and orientations. The system allows for efficient and coherent management of maps, recognition processes and recognition results.

...read moreread less

28 citations

1
2
3
4
…
5
6

Collapse

Cited by

PDF

Open Access

More filters

Journal Article•DOI•

Ridge-based vessel segmentation in color images of the retina

[...]

Joes Staal, Michael D. Abràmoff, Meindert Niemeijer, Max A. Viergever, B. van Ginneken - Show less +1 more

05 Apr 2004-IEEE Transactions on Medical Imaging

TL;DR: A method is presented for automated segmentation of vessels in two-dimensional color images of the retina based on extraction of image ridges, which coincide approximately with vessel centerlines, which is compared with two recently published rule-based methods.

...read moreread less

Abstract: A method is presented for automated segmentation of vessels in two-dimensional color images of the retina. This method can be used in computer analyses of retinal images, e.g., in automated screening for diabetic retinopathy. The system is based on extraction of image ridges, which coincide approximately with vessel centerlines. The ridges are used to compose primitives in the form of line elements. With the line elements an image is partitioned into patches by assigning each image pixel to the closest line element. Every line element constitutes a local coordinate frame for its corresponding patch. For every pixel, feature vectors are computed that make use of properties of the patches and the line elements. The feature vectors are classified using a kNN-classifier and sequential forward feature selection. The algorithm was tested on a database consisting of 40 manually labeled images. The method achieves an area under the receiver operating characteristic curve of 0.952. The method is compared with two recently published rule-based methods of Hoover et al. and Jiang et al. . The results show that our method is significantly better than the two rule-based methods (p<0.01). The accuracy of our method is 0.944 versus 0.947 for a second observer.

...read moreread less

3,416 citations

Proceedings Article•DOI•

A 3-dimensional sift descriptor and its application to action recognition

[...]

Paul Scovanner¹, Saad Ali¹, Mubarak Shah¹•Institutions (1)

University of Central Florida¹

29 Sep 2007

TL;DR: This paper uses a bag of words approach to represent videos, and presents a method to discover relationships between spatio-temporal words in order to better describe the video data.

...read moreread less

Abstract: In this paper we introduce a 3-dimensional (3D) SIFT descriptor for video or 3D imagery such as MRI data. We also show how this new descriptor is able to better represent the 3D nature of video data in the application of action recognition. This paper will show how 3D SIFT is able to outperform previously used description methods in an elegant and efficient manner. We use a bag of words approach to represent videos, and present a method to discover relationships between spatio-temporal words in order to better describe the video data.

...read moreread less

1,757 citations

Journal Article•DOI•

A New Supervised Method for Blood Vessel Segmentation in Retinal Images by Using Gray-Level and Moment Invariants-Based Features

[...]

Diego Marin¹, Arturo Aquino¹, Manuel Emilio Gegúndez-Arias¹, José Manuel Bravo¹•Institutions (1)

University of Huelva¹

01 Jan 2011-IEEE Transactions on Medical Imaging

TL;DR: A neural network scheme for pixel classification and computes a 7-D vector composed of gray-level and moment invariants-based features for pixel representation that is suitable for retinal image computer analyses such as automated screening for early diabetic retinopathy detection.

...read moreread less

Abstract: This paper presents a new supervised method for blood vessel detection in digital retinal images. This method uses a neural network (NN) scheme for pixel classification and computes a 7-D vector composed of gray-level and moment invariants-based features for pixel representation. The method was evaluated on the publicly available DRIVE and STARE databases, widely used for this purpose, since they contain retinal images where the vascular structure has been precisely marked by experts. Method performance on both sets of test images is better than other existing solutions in literature. The method proves especially accurate for vessel detection in STARE images. Its application to this database (even when the NN was trained on the DRIVE database) outperforms all analyzed segmentation approaches. Its effectiveness and robustness with different image conditions, together with its simplicity and fast implementation, make this blood vessel segmentation proposal suitable for retinal image computer analyses such as automated screening for early diabetic retinopathy detection.

...read moreread less

913 citations

Book•

Template Matching Techniques in Computer Vision: Theory and Practice

[...]

Roberto Brunelli

20 Apr 2009

TL;DR: This book and the accompanying website, focus on template matching, a subset of object recognition techniques of wide applicability, which has proved to be particularly effective for face recognition applications.

...read moreread less

Abstract: The detection and recognition of objects in images is a key research topic in the computer vision community Within this area, face recognition and interpretation has attracted increasing attention owing to the possibility of unveiling human perception mechanisms, and for the development of practical biometric systems This book and the accompanying website, focus on template matching, a subset of object recognition techniques of wide applicability, which has proved to be particularly effective for face recognition applications Using examples from face processing tasks throughout the book to illustrate more general object recognition approaches, Roberto Brunelli: examines the basics of digital image formation, highlighting points critical to the task of template matching; presents basic and advanced template matching techniques, targeting grey-level images, shapes and point sets; discusses recent pattern classification paradigms from a template matching perspective; illustrates the development of a real face recognition system; explores the use of advanced computer graphics techniques in the development of computer vision algorithms Template Matching Techniques in Computer Vision is primarily aimed at practitioners working on the development of systems for effective object recognition such as biometrics, robot navigation, multimedia retrieval and landmark detection It is also of interest to graduate students undertaking studies in these areas

...read moreread less

721 citations

Book Chapter•DOI•

Deep Features for Text Spotting

[...]

Max Jaderberg¹, Andrea Vedaldi¹, Andrew Zisserman¹•Institutions (1)

University of Oxford¹

06 Sep 2014

TL;DR: A Convolutional Neural Network classifier is developed that can be used for text spotting in natural images and a method of automated data mining of Flickr, that generates word and character level annotations is used to form an end-to-end, state-of-the-art text spotting system.

...read moreread less

Abstract: The goal of this work is text spotting in natural images. This is divided into two sequential tasks: detecting words regions in the image, and recognizing the words within these regions. We make the following contributions: first, we develop a Convolutional Neural Network (CNN) classifier that can be used for both tasks. The CNN has a novel architecture that enables efficient feature sharing (by using a number of layers in common) for text detection, character case-sensitive and insensitive classification, and bigram classification. It exceeds the state-of-the-art performance for all of these. Second, we make a number of technical changes over the traditional CNN architectures, including no downsampling for a per-pixel sliding window, and multi-mode learning with a mixture of linear models (maxout). Third, we have a method of automated data mining of Flickr, that generates word and character level annotations. Finally, these components are used together to form an end-to-end, state-of-the-art text spotting system. We evaluate the text-spotting system on two standard benchmarks, the ICDAR Robust Reading data set and the Street View Text data set, and demonstrate improvements over the state-of-the-art on multiple measures.

...read moreread less

681 citations

1
2
3
4
…
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179

Collapse