scispace - formally typeset
Search or ask a question
Author

Bruno Steux

Bio: Bruno Steux is an academic researcher from Mines ParisTech. The author has contributed to research in topics: AdaBoost & Object detection. The author has an hindex of 14, co-authored 34 publications receiving 821 citations.

Papers
More filters
Proceedings ArticleDOI
30 Sep 2008
TL;DR: A first experimental evaluation conducted on a publicly available set of low-resolution videos in a commercial mall shows very promising inter-camera person re-identification performances and the matching method is very fast, making re- identification among hundreds of persons computationally feasible in less than ~ 1/5 second.
Abstract: We present and evaluate a person re-identification scheme for multi-camera surveillance system. Our approach uses matching of signatures based on interest-points descriptors collected on short video sequences. One of the originalities of our method is to accumulate interest points on several sufficiently time-spaced images during person tracking within each camera, in order to capture appearance variability. A first experimental evaluation conducted on a publicly available set of low-resolution videos in a commercial mall shows very promising inter-camera person re-identification performances (a precision of 82% for a recall of 78%). It should also be noted that our matching method is very fast: ~ 1/8s for re-identification of one target person among 10 previously seen persons, and a logarithmic dependence with the number of stored person models, making re- identification among hundreds of persons computationally feasible in less than ~ 1/5 second.

272 citations

Proceedings ArticleDOI
01 Dec 2010
TL;DR: A Laser-SLAM algorithm which can be programmed in less than 200 lines C-language program and shows the possibility to perform complex tasks using simple and easily programmable algorithms.
Abstract: This paper presents a Laser-SLAM algorithm which can be programmed in less than 200 lines C-language program. The first idea aimed to develop and implement a simple SLAM algorithm providing good performances without exceeding 200 lines in a C-language program. We use a robotic platform called MinesRover, a six wheels robot with several sensors. We based our work and calculations on a laser sensor and the odometry of the robot. The article presents the different capabilities of the platform and the way we use them in order to improve our programs. We also illustrates the difficulties encountered during the programming and testing of the algorithm. This work shows the possibility to perform complex tasks using simple and easily programmable algorithms.

85 citations

Posted Content
TL;DR: In this article, a mathematical explanation of the ubiquity of PID controllers in the industry has been provided by comparing their sampling with the one of intelligent PID controllers, which were recently introduced.
Abstract: The ubiquity of PID controllers in the industry has remained mysterious until now. We provide here a mathematical explanation of this strange phenomenon by comparing their sampling with the the one of "intelligent" PID controllers, which were recently introduced. Some computer simulations nevertheless confirm the superiority of the new intelligent feedback design.

56 citations

Proceedings ArticleDOI
14 Jun 2004
TL;DR: A highly-efficient algorithm for classification of pedestrian images using a learned set of features, each feature based on a 5/spl times/5 pixels shape, done using AdaBoost and genetic-like algorithms is presented.
Abstract: We present a system for pedestrian detection and impact prediction, from a frontal camera situated on a moving vehicle. The system combines together the output of several algorithms to form a reliable detection and positioning of pedestrians. One of the important contributions of this paper is a highly-efficient algorithm for classification of pedestrian images using a learned set of features, each feature based on a 5/spl times/5 pixels shape. The learning of the features is done using AdaBoost and genetic-like algorithms. The described application was developed as a part of the CAMELLIA project, thus all the algorithms used in this application are designed to use a special set of low level image processing operations provided by the smart imaging core developed in the project. Fusion of the various algorithms results and tracking of pedestrians is done using particle filtering, providing a good tool to predict the future movement of pedestrians, in order to estimate impact probability.

55 citations

Proceedings ArticleDOI
23 Jun 2010
TL;DR: A mathematical explanation of the ubiquity of PID controllers in the industry is provided by comparing their sampling with the one of “intelligent” PID controllers, which were recently introduced.
Abstract: The ubiquity of PID controllers in the industry has remained mysterious until now. We provide here a mathematical explanation of this strange phenomenon by comparing their sampling with the the one of “intelligent” PID controllers, which were recently introduced. Some computer simulations nevertheless confirm the superiority of the new intelligent feedback design.

47 citations


Cited by
More filters
Proceedings ArticleDOI
07 Jun 2015
TL;DR: This paper proposes an effective feature representation called Local Maximal Occurrence (LOMO), and a subspace and metric learning method called Cross-view Quadratic Discriminant Analysis (XQDA), and presents a practical computation method for XQDA.
Abstract: Person re-identification is an important technique towards automatic search of a person's presence in a surveillance video. Two fundamental problems are critical for person re-identification, feature representation and metric learning. An effective feature representation should be robust to illumination and viewpoint changes, and a discriminant metric should be learned to match various person images. In this paper, we propose an effective feature representation called Local Maximal Occurrence (LOMO), and a subspace and metric learning method called Cross-view Quadratic Discriminant Analysis (XQDA). The LOMO feature analyzes the horizontal occurrence of local features, and maximizes the occurrence to make a stable representation against viewpoint changes. Besides, to handle illumination variations, we apply the Retinex transform and a scale invariant texture operator. To learn a discriminant metric, we propose to learn a discriminant low dimensional subspace by cross-view quadratic discriminant analysis, and simultaneously, a QDA metric is learned on the derived subspace. We also present a practical computation method for XQDA, as well as its regularization. Experiments on four challenging person re-identification databases, VIPeR, QMUL GRID, CUHK Campus, and CUHK03, show that the proposed method improves the state-of-the-art rank-1 identification rates by 2.2%, 4.88%, 28.91%, and 31.55% on the four databases, respectively.

2,209 citations

Proceedings ArticleDOI
13 Jun 2010
TL;DR: An appearance-based method for person re-identification that consists in the extraction of features that model three complementary aspects of the human appearance: the overall chromatic content, the spatial arrangement of colors into stable regions, and the presence of recurrent local motifs with high entropy.
Abstract: In this paper, we present an appearance-based method for person re-identification. It consists in the extraction of features that model three complementary aspects of the human appearance: the overall chromatic content, the spatial arrangement of colors into stable regions, and the presence of recurrent local motifs with high entropy. All this information is derived from different body parts, and weighted opportunely by exploiting symmetry and asymmetry perceptual principles. In this way, robustness against very low resolution, occlusions and pose, viewpoint and illumination changes is achieved. The approach applies to situations where the number of candidates varies continuously, considering single images or bunch of frames for each individual. It has been tested on several public benchmark datasets (ViPER, iLIDS, ETHZ), gaining new state-of-the-art performances.

1,674 citations

Posted Content
TL;DR: The history of person re-identification and its relationship with image classification and instance retrieval is introduced and two new re-ID tasks which are much closer to real-world applications are described and discussed.
Abstract: Person re-identification (re-ID) has become increasingly popular in the community due to its application and research significance. It aims at spotting a person of interest in other cameras. In the early days, hand-crafted algorithms and small-scale evaluation were predominantly reported. Recent years have witnessed the emergence of large-scale datasets and deep learning systems which make use of large data volumes. Considering different tasks, we classify most current re-ID methods into two classes, i.e., image-based and video-based; in both tasks, hand-crafted and deep learning systems will be reviewed. Moreover, two new re-ID tasks which are much closer to real-world applications are described and discussed, i.e., end-to-end re-ID and fast re-ID in very large galleries. This paper: 1) introduces the history of person re-ID and its relationship with image classification and instance retrieval; 2) surveys a broad selection of the hand-crafted systems and the large-scale methods in both image- and video-based re-ID; 3) describes critical future directions in end-to-end re-ID and fast retrieval in large galleries; and 4) finally briefs some important yet under-developed issues.

984 citations

Journal ArticleDOI
TL;DR: This paper provides a review of the literature in on-road vision-based vehicle detection, tracking, and behavior understanding, and discusses the nascent branch of intelligent vehicles research concerned with utilizing spatiotemporal measurements, trajectories, and various features to characterize on- road behavior.
Abstract: This paper provides a review of the literature in on-road vision-based vehicle detection, tracking, and behavior understanding. Over the past decade, vision-based surround perception has progressed from its infancy into maturity. We provide a survey of recent works in the literature, placing vision-based vehicle detection in the context of sensor-based on-road surround analysis. We detail advances in vehicle detection, discussing monocular, stereo vision, and active sensor-vision fusion for on-road vehicle detection. We discuss vision-based vehicle tracking in the monocular and stereo-vision domains, analyzing filtering, estimation, and dynamical models. We discuss the nascent branch of intelligent vehicles research concerned with utilizing spatiotemporal measurements, trajectories, and various features to characterize on-road behavior. We provide a discussion on the state of the art, detail common performance metrics and benchmarks, and provide perspective on future research directions in the field.

862 citations

Proceedings ArticleDOI
01 Jul 2017
TL;DR: A new deep learning framework for person search that jointly handles pedestrian detection and person re-identification in a single convolutional neural network and converges much faster and better than the conventional Softmax loss.
Abstract: Existing person re-identification benchmarks and methods mainly focus on matching cropped pedestrian images between queries and candidates. However, it is different from real-world scenarios where the annotations of pedestrian bounding boxes are unavailable and the target person needs to be searched from a gallery of whole scene images. To close the gap, we propose a new deep learning framework for person search. Instead of breaking it down into two separate tasks—pedestrian detection and person re-identification, we jointly handle both aspects in a single convolutional neural network. An Online Instance Matching (OIM) loss function is proposed to train the network effectively, which is scalable to datasets with numerous identities. To validate our approach, we collect and annotate a large-scale benchmark dataset for person search. It contains 18,184 images, 8,432 identities, and 96,143 pedestrian bounding boxes. Experiments show that our framework outperforms other separate approaches, and the proposed OIM loss function converges much faster and better than the conventional Softmax loss.

757 citations