Showing papers in "Computer Vision and Image Understanding in 2004"

PDF

Open Access

Journal Article•DOI•

A linear-time component-labeling algorithm using contour tracing technique

[...]

Fu Chang¹, Chun-Jen Chen¹, Chi-Jen Lu¹•Institutions (1)

01 Feb 2004-Computer Vision and Image Understanding

TL;DR: A new linear-time algorithm is presented in this paper that simultaneously labels connected components and their contours in binary images and extracts component contours and sequential orders of contour points, which can be useful for many applications.

...read moreread less

599 citations

Journal Article•DOI•

Cast shadow segmentation using invariant color features

[...]

Elena Salvador, Andrea Cavallaro¹, Touradj Ebrahimi•Institutions (1)

Queen Mary University of London¹

01 Aug 2004-Computer Vision and Image Understanding

TL;DR: A new cast shadow segmentation algorithm is proposed that exploits spectral and geometrical properties of shadows in a scene to perform this task and is robust and efficient in detecting shadows for a large class of scenes.

...read moreread less

408 citations

Journal Article•DOI•

Silhouette and stereo fusion for 3D object modeling

[...]

Carlos Hernández Esteban¹, Francis Schmitt¹•Institutions (1)

École Normale Supérieure¹

01 Dec 2004-Computer Vision and Image Understanding

TL;DR: A new approach to high quality 3D object reconstruction, based on a deformable model, which defines the framework where texture and silhouette information can be fused by defining two external forces based on the images: a texture driven force and a silhouette driven force.

...read moreread less

406 citations

Journal Article•DOI•

Layered representations for learning and inferring office activity from multiple sensory channels

[...]

Nuria Oliver¹, Ashutosh Garg², Eric Horvitz¹•Institutions (2)

Microsoft¹, University of Illinois at Urbana–Champaign²

01 Nov 2004-Computer Vision and Image Understanding

TL;DR: The use of layered probabilistic representations for modeling human activities is presented, and how the representation is used to do sensing, learning, and inference at multiple levels of temporal granularity and abstraction and from heterogeneous data sources is described.

...read moreread less

370 citations

Journal Article•DOI•

Video-based event recognition: activity representation and probabilistic recognition methods

[...]

Somboon Hongeng¹, Ram Nevatia¹, Francois Bremond¹•Institutions (1)

University of Southern California¹

01 Nov 2004-Computer Vision and Image Understanding

TL;DR: A new representation and recognition method for human activities that recognizes multi-agent events by propagating the constraints and likelihood of event threads in a temporal logic network and presents results on real-world data and performance characterization on perturbed data.

...read moreread less

351 citations

Journal Article•DOI•

Moment invariants for recognition under changing viewpoint and illumination

[...]

Florica Mindru¹, Tinne Tuytelaars¹, Luc Van Gool², Theo Moons³•Institutions (3)

Katholieke Universiteit Leuven¹, École Polytechnique Fédérale de Lausanne², Catholic University of Brussels³

01 Apr 2004-Computer Vision and Image Understanding

TL;DR: Although the generalised color moment invariants are extracted from planar surface patches, it is argued that invariant neighbourhoods offer a concept through which they can also be used to deal with 3D objects and scenes.

...read moreread less

279 citations

Journal Article•DOI•

Region-based image retrieval using integrated color, shape, and location index

[...]

B. G. Prasad¹, Kanad K. Biswas¹, S. K. Gupta¹•Institutions (1)

Indian Institutes of Technology¹

01 Apr 2004-Computer Vision and Image Understanding

TL;DR: Results obtained show that retrieval effectiveness increases in non-cascaded region-based querying by combined index, and also based on a combined color shape location index.

...read moreread less

162 citations

Journal Article•DOI•

Registration without ICP

[...]

Helmut Pottmann¹, Stefan Leopoldseder¹, Michael Hofer¹•Institutions (1)

Vienna University of Technology¹

01 Jul 2004-Computer Vision and Image Understanding

TL;DR: A new approach to the geometric alignment of a point cloud to a surface and to related registration problems which relies on instantaneous kinematics and on the geometry of the squared distance function of a surface is presented.

...read moreread less

159 citations

Journal Article•DOI•

Real-time 3D shape reconstruction, dynamic 3D mesh deformation, and high fidelity visualization for 3D video

[...]

Takashi Matsuyama¹, Xiaojun Wu¹, Takeshi Takai¹, Shohei Nobuhara¹•Institutions (1)

Kyoto University¹

01 Dec 2004-Computer Vision and Image Understanding

TL;DR: Experimental results with quantitative performance evaluations demonstrate the effectiveness of a PC cluster system for real-time reconstruction of dynamic 3D object action from multiview video images, a deformable 3D mesh model for reconstructing the accurate dynamic 2D object shape, and an algorithm of rendering natural-looking texture on the3D object surface from the multi-view video images.

...read moreread less

144 citations

Journal Article•DOI•

Image retrieval using color histograms generated by Gauss mixture vector quantization

[...]

Sangoh Jeong¹, Chee Sun Won², Robert M. Gray¹•Institutions (2)

Stanford University¹, Dongguk University²

01 Apr 2004-Computer Vision and Image Understanding

TL;DR: Results show that the histograms made by GMVQ with a penalized log-likelihood (LL) distortion yield better retrieval performance for color images than the conventional methods of uniform quantization and VQ with squared error distortion.

...read moreread less

131 citations

Journal Article•DOI•

Temporal spatio-velocity transform and its application to tracking and interaction

[...]

Koichi Sato¹, Jake K. Aggarwal¹•Institutions (1)

University of Texas at Austin¹

01 Nov 2004-Computer Vision and Image Understanding

TL;DR: The TSV transform provides an efficient way to remove noise by focusing on stable velocities, and constructs noise-free blobs, and is applied to tracking human figures in a sidewalk environment and extended to an interaction recognition system.

...read moreread less

Journal Article•DOI•

An automatic road sign recognition system based on a computational model of human recognition processing

[...]

Chiung-Yao Fang¹, Chiou-Shann Fuh², P. S. Yen¹, Shen Cherng³, Sei-Wang Chen¹ - Show less +1 more•Institutions (3)

National Taiwan Normal University¹, National Taiwan University², Cheng Shiu University³

01 Nov 2004-Computer Vision and Image Understanding

TL;DR: An automatic road sign detection and recognition system that is based on a computational model of human visual recognition processing and the experimental results revealed both the feasibility of the proposed computational model and the robustness of the developedRoad sign detection system.

...read moreread less

Journal Article•DOI•

Fast Euclidean distance transformation in two scans using a 3 × 3 neighborhood

[...]

Frank Y. Shih¹, Yi-Ta Wu¹•Institutions (1)

New Jersey Institute of Technology¹

01 Feb 2004-Computer Vision and Image Understanding

TL;DR: This work proposes a new, simple and fast EDT in two scans using a 3 × 3 neighborhood, and develops an optimal two-scan algorithm to achieve the EDT correctly and efficiently in a constant time without iterations.

...read moreread less

Journal Article•DOI•

Selection weighted vector directional filters

[...]

Rastislav Lukac¹, Bogdan Smolka², Konstantinos N. Plataniotis¹, Anastasios N. Venetsanopoulos¹•Institutions (2)

University of Toronto¹, Silesian University of Technology²

01 Apr 2004-Computer Vision and Image Understanding

TL;DR: The proposed angular optimization algorithms take advantage of adaptive stack filters design and weighted median filtering framework and are able to remove image noise, while maintaining excellent signal-detail preservation capabilities and sufficient robustness for a variety of signal and noise statistics.

...read moreread less

Journal Article•DOI•

Classifying offensive sites based on image content

[...]

Will Archer Arentz¹, Bjørn Olstad¹•Institutions (1)

Norwegian University of Science and Technology¹

01 Apr 2004-Computer Vision and Image Understanding

TL;DR: This paper proposes a method for helping to identify adult web sites by using the imagecontent as means of detecting erotic material, and proves to be quite successful in tests where all 20 sites where classified correctly.

...read moreread less

Journal Article•DOI•

Error analysis of pattern recognition systems: the subsets bootstrap

[...]

Ruud M. Bolle¹, Nalini K. Ratha¹, Sharath Pankanti¹•Institutions (1)

IBM¹

01 Jan 2004-Computer Vision and Image Understanding

TL;DR: It is argued that biometric match score accuracy is best expressed in terms of a curve, the Receiver Operating Characteristic curve, and confidence intervals, or margins of error, should be provided for this curve for determining whether accuracy differences between systems are really statistically significant.

...read moreread less

Journal Article•DOI•

Automatic description of complex buildings from multiple images

[...]

ZuWhan Kim¹, Ramakant Nevatia¹•Institutions (1)

University of Southern California¹

01 Oct 2004-Computer Vision and Image Understanding

TL;DR: 3-D rooftop boundary hypotheses are found from the line and junction features of the images by applying consecutive grouping procedures and are verified with evidence collected from the images and the elevation data.

...read moreread less

Journal Article•DOI•

Tri-view morphing

[...]

Jiangjian Xiao¹, Mubarak Shah¹•Institutions (1)

University of Central Florida¹

01 Dec 2004-Computer Vision and Image Understanding

TL;DR: This paper presents an efficient image-based approach to navigate a scene based on only three wide-baseline uncalibrated images without the explicit use of a 3D model, and demonstrates three applications of the tri-view morphing algorithm.

...read moreread less

Journal Article•DOI•

3-D reconstruction of static human body shape from image sequence

[...]

Fabio Remondino¹•Institutions (1)

École Polytechnique Fédérale de Lausanne¹

01 Jan 2004-Computer Vision and Image Understanding

TL;DR: The core of the presented work describes the calibration and orientation of the images, mostly based on photogrammetric techniques, and the reconstruction of the 3-D body model in point cloud form.

...read moreread less

Journal Article•DOI•

Synchronization of oscillations for machine perception of gaits

[...]

Jeffrey E. Boyd¹•Institutions (1)

University of Calgary¹

01 Oct 2004-Computer Vision and Image Understanding

TL;DR: The possibility of an alternative model for motion perception based on synchronization with the transient oscillations of temporal band-pass filters that is consistent with other proposed models for human perception is discussed.

...read moreread less

Journal Article•DOI•

Hierarchical Markovian segmentation of multispectral images for the reconstruction of water depth maps

[...]

J. N. Provost¹, Christophe Collet, P. Rostaing, Patrick Pérez², Patrick Bouthemy² - Show less +1 more•Institutions (2)

Centre national de la recherche scientifique¹, French Institute for Research in Computer Science and Automation²

01 Feb 2004-Computer Vision and Image Understanding

TL;DR: The designed segmentation method can be extended to images for which it is required to segment a region of interest using an unsupervised approach, and is applied to Satellite Pour l'Observation de la Terre remote multispectral images.

...read moreread less

Journal Article•DOI•

The analysis and applications of adaptive-binning color histograms

[...]

Wee Kheng Leow¹, Rui Li¹•Institutions (1)

National University of Singapore¹

01 Apr 2004-Computer Vision and Image Understanding

TL;DR: This paper defines a new dissimilarity measure that is more reliable than the Euclidean distance and yet computationally less expensive than EMD, and a mathematically sound definition of mean histogram can be defined for histogram clustering applications.

...read moreread less

Journal Article•DOI•

Detecting image orientation based on low-level visual content

[...]

Yongmei Michelle Wang¹, Hongjiang Zhang²•Institutions (2)

Yale University¹, Microsoft²

01 Mar 2004-Computer Vision and Image Understanding

TL;DR: This paper presents automatic image orientation detection algorithms based on both the luminance (structural) and chrominance (color) low-level content features based on the statistical learning support vector machines (SVMs) as the classifiers.

...read moreread less

Journal Article•DOI•

Image matching with scale adjustment

[...]

Yves Dufournaud¹, Cordelia Schmid¹, Radu Horaud¹•Institutions (1)

French Institute for Research in Computer Science and Automation¹

01 Feb 2004-Computer Vision and Image Understanding

TL;DR: This paper shows how to represent and extract interest points at variable scales and devise a method allowing the comparison of two images at two different resolutions, using a photometric- and rotation-invariant descriptors and an image matching strategy based on local constraints and on the robust estimation of this geometric model.

...read moreread less

Journal Article•DOI•

Topological model for two-dimensional image representation: definition and optimal extraction algorithm

[...]

Guillaume Damiand, Yves Bertrand, Christophe Fiorio

01 Feb 2004-Computer Vision and Image Understanding

TL;DR: The two-dimensional topological map is defined, a model which represents both topological and geometrical information of aTwo-dimensional labeled image which is minimal, complete, and unique and can be used to define efficient image processing algorithms.

...read moreread less

Journal Article•DOI•

Maximum entropy model-based baseball highlight detection and classification

[...]

Yihong Gong, Mei Han, Wei Hua, Wei Xu

01 Nov 2004-Computer Vision and Image Understanding

TL;DR: This paper proposes a novel system that is able to automatically detect and classify baseball highlights by seamlessly integrating image, audio, and speech clues using a unique framework based on maximum entropy model (MEM).

...read moreread less

Journal Article•DOI•

Real time repeated video sequence identification

[...]

Kok Meng Pua¹, John M. Gauch¹, Susan Gauch¹, Jedrzej Zdzislaw Miadowicz¹•Institutions (1)

University of Kansas¹

01 Mar 2004-Computer Vision and Image Understanding

TL;DR: A real time system for detecting repeated video clips from a live video source such as news broadcasts that utilizes customized temporal video segmentation techniques to automatically partition the digital video signal into semantically sensible shots and scenes.

...read moreread less

Journal Article•DOI•

Linear color segmentation and its implementation

[...]

Dmitry P. Nikolaev¹, Petr P. Nikolayev¹•Institutions (1)

Russian Academy of Sciences¹

01 Apr 2004-Computer Vision and Image Understanding

TL;DR: A framework for color image segmentation is presented, which combines color histogram analysis and region merging approach, and testing this algorithm with both artificially generated and real images shows quite reliable results.

...read moreread less

Journal Article•DOI•

A multi-modal system for the retrieval of semantic video events

[...]

Arnon Amir¹, Sankar Basu¹, Giridharan Iyengar¹, Ching-Yung Lin¹, Milind Naphade¹, John R. Smith¹, Savitha Srinivasan¹, Belle L. Tseng¹ - Show less +4 more•Institutions (1)

IBM¹

01 Nov 2004-Computer Vision and Image Understanding

TL;DR: A framework for event detection is proposed where events, objects, and other semantic concepts are detected from video using trained classifiers and integration of content-based and concept-based querying in the search process is integrated.

...read moreread less