scispace - formally typeset
Search or ask a question
Author

Thanarat H. Chalidabhongse

Bio: Thanarat H. Chalidabhongse is an academic researcher from Chulalongkorn University. The author has contributed to research in topics: Background subtraction & Object detection. The author has an hindex of 12, co-authored 55 publications receiving 2279 citations. Previous affiliations of Thanarat H. Chalidabhongse include King Mongkut's Institute of Technology Ladkrabang.


Papers
More filters
Journal ArticleDOI
TL;DR: A real-time algorithm for foreground-background segmentation that can handle scenes containing moving backgrounds or illumination variations, and it achieves robust detection for different types of videos is presented.
Abstract: We present a real-time algorithm for foreground-background segmentation. Sample background values at each pixel are quantized into codebooks which represent a compressed form of background model for a long image sequence. This allows us to capture structural background variation due to periodic-like motion over a long period of time under limited memory. The codebook representation is efficient in memory and speed compared with other background modeling techniques. Our method can handle scenes containing moving backgrounds or illumination variations, and it achieves robust detection for different types of videos. We compared our method with other multimode modeling techniques. In addition to the basic algorithm, two features improving the algorithm are presented-layered modeling/detection and adaptive codebook updating. For performance evaluation, we have applied perturbation detection rate analysis to four background subtraction algorithms and two videos of different types of scenes.

1,552 citations

Proceedings ArticleDOI
24 Oct 2004
TL;DR: A new fast algorithm for background modeling and subtraction that can handle scenes containing moving backgrounds or illumination variations (shadows and highlights), and it achieves robust detection for compressed videos.
Abstract: We present a new fast algorithm for background modeling and subtraction. Sample background values at each pixel are quantized into codebooks which represent a compressed form of background model for a long image sequence. This allows us to capture structural background variation due to periodic-like motion over a long period of time under limited memory. Our method can handle scenes containing moving backgrounds or illumination variations (shadows and highlights), and it achieves robust detection for compressed videos. We compared our method with other multimode modeling techniques.

412 citations

Proceedings ArticleDOI
01 Dec 2006
TL;DR: A vision system that can extract 2D and 3D visual properties of mango such as size, projected area, volume, and surface area from images and use them in sorting and the results show the technique could be a good alternative and more feasible method for sorting mango comparing to human's manual sorting.
Abstract: This paper describes a vision system that can extract 2D and 3D visual properties of mango such as size (length, width, and thickness), projected area, volume, and surface area from images and use them in sorting. The 2D/3D visual properties are extracted from multiple view images of mango. The images are first segmented to extract the silhouette regions of mango. The 2D visual properties are then measured from the top view silhouette as explained by Yimyam et al. (2005). The 3D mango volume reconstruction is done using volumetric caving on multiple silhouette images. First the cameras are calibrated to obtain the intrinsic and extrinsic camera parameters. Then the 3D volume voxels are crafted based on silhouette images of the fruit in multiple views. After craving all silhouettes, we obtain the coarse 3D shape of the fruit and then we can compute the volume and surface area. We then use these features in automatic mango sorting which we employ a typical backpropagation neural networks. In this research, we employed the system to evaluate visual properties of a mango cultivar called "Nam Dokmai". There were two sets total of 182 mangoes in three various sizes sorted by weights according to a standard sorting metric for mango export. Two experiments were performed. One is for showing the accuracy of our vision-based feature extraction and measurement by comparing results with the measurements using various instruments. The second experiment is to show the sorting accuracy by comparing to human sorting. The results show the technique could be a good alternative and more feasible method for sorting mango comparing to human's manual sorting.

50 citations

Proceedings ArticleDOI
30 Jun 2004
TL;DR: The paper presents a statistical adaptive realtime background subtraction algorithm that is very robust to moving shadows and dynamic scene environment, and proposes a novel "vivacity factor" to measure the activities of foreground objects.
Abstract: The paper presents a statistical adaptive realtime background subtraction algorithm that is very robust to moving shadows and dynamic scene environment The algorithm enhances the previously developed method reported by T Horprascrt et al (see Proc IEEE ICCV'99 Frame-rate Workshop, 1999) by adding adaptation of the model corresponding to a dynamic background using adaptive brightness and color distortion In addition, we propose a novel "vivacity factor" to measure the activities of foreground objects It is used to delay the adaptation rate for the area of often-occurring moving foregrounds Our method provides a solution to real-time moving object and shadow detection in the dynamic background scene of a video stream We also develop the learning-rate control mechanism that is not addressed by most background subtraction algorithms

39 citations

Proceedings ArticleDOI
01 Dec 2010
TL;DR: The algorithms for players tracking and ball detection for an automatic broadcast tennis video annotation using a robust non-parametric procedure for estimating density gradients called the mean shift algorithm can precisely classify the players' action into back hand ground stroke and forehand ground stroke.
Abstract: This paper describes our algorithms for players tracking and ball detection for an automatic broadcast tennis video annotation. The system detects and tracks the players using a robust non-parametric procedure for estimating density gradients called the mean shift algorithm. The basic mean shift tracking algorithm assumes that the target object has to separate sufficiently from background, but this assumption is not always true especially when tracking is carried out in dynamic backgrounds such as in sport videos. To cope with this problem, in our proposed system, we embrace the motion segmentation and use the 8×8×8 color histogram to be feature distribution for mean shift tennis players tracking. In order to determine the players' actions precisely, the system also detect and track ball positions using frame differencing as well as applying some correlation techniques to eliminate false detections. Based on both players' motion patterns and ball positions, the system can precisely classify the players' action into backhand ground stroke and forehand ground stroke. Videos of broadcast tennis games downloaded from the Internet have been tested. The results show our system is able to precisely classify the player's actions with 83.7% precision and 82% recall rates.

29 citations


Cited by
More filters
01 Jan 2004
TL;DR: Comprehensive and up-to-date, this book includes essential topics that either reflect practical significance or are of theoretical importance and describes numerous important application areas such as image based rendering and digital libraries.
Abstract: From the Publisher: The accessible presentation of this book gives both a general view of the entire computer vision enterprise and also offers sufficient detail to be able to build useful applications. Users learn techniques that have proven to be useful by first-hand experience and a wide range of mathematical methods. A CD-ROM with every copy of the text contains source code for programming practice, color images, and illustrative movies. Comprehensive and up-to-date, this book includes essential topics that either reflect practical significance or are of theoretical importance. Topics are discussed in substantial and increasing depth. Application surveys describe numerous important application areas such as image based rendering and digital libraries. Many important algorithms broken down and illustrated in pseudo code. Appropriate for use by engineers as a comprehensive reference to the computer vision enterprise.

3,627 citations

Journal ArticleDOI
TL;DR: This survey reviews recent trends in video-based human capture and analysis, as well as discussing open problems for future research to achieve automatic visual analysis of human movement.

2,738 citations

Journal ArticleDOI
TL;DR: Efficiency figures show that the proposed technique for motion detection outperforms recent and proven state-of-the-art methods in terms of both computation speed and detection rate.
Abstract: This paper presents a technique for motion detection that incorporates several innovative mechanisms. For example, our proposed technique stores, for each pixel, a set of values taken in the past at the same location or in the neighborhood. It then compares this set to the current pixel value in order to determine whether that pixel belongs to the background, and adapts the model by choosing randomly which values to substitute from the background model. This approach differs from those based upon the classical belief that the oldest values should be replaced first. Finally, when the pixel is found to be part of the background, its value is propagated into the background model of a neighboring pixel. We describe our method in full details (including pseudo-code and the parameter values used) and compare it to other background subtraction techniques. Efficiency figures show that our method outperforms recent and proven state-of-the-art methods in terms of both computation speed and detection rate. We also analyze the performance of a downscaled version of our algorithm to the absolute minimum of one comparison and one byte of memory per pixel. It appears that even such a simplified version of our algorithm performs better than mainstream techniques.

1,777 citations

Journal ArticleDOI
TL;DR: A novel and efficient texture-based method for modeling the background and detecting moving objects from a video sequence that provides many advantages compared to the state-of-the-art.
Abstract: This paper presents a novel and efficient texture-based method for modeling the background and detecting moving objects from a video sequence. Each pixel is modeled as a group of adaptive local binary pattern histograms that are calculated over a circular region around the pixel. The approach provides us with many advantages compared to the state-of-the-art. Experimental results clearly justify our model.

1,355 citations

Book
14 Dec 2016
TL;DR: Whether you want to build simple or sophisticated vision applications, Learning OpenCV is the book any developer or hobbyist needs to get started, with the help of hands-on exercises in each chapter.
Abstract: Learning OpenCV puts you in the middle of the rapidly expanding field of computer vision. Written by the creators of the free open source OpenCV library, this book introduces you to computer vision and demonstrates how you can quickly build applications that enable computers to "see" and make decisions based on that data.The second edition is updated to cover new features and changes in OpenCV 2.0, especially the C++ interface.Computer vision is everywherein security systems, manufacturing inspection systems, medical image analysis, Unmanned Aerial Vehicles, and more. OpenCV provides an easy-to-use computer vision framework and a comprehensive library with more than 500 functions that can run vision code in real time. Whether you want to build simple or sophisticated vision applications, Learning OpenCV is the book any developer or hobbyist needs to get started, with the help of hands-on exercises in each chapter.This book includes:A thorough introduction to OpenCV Getting input from cameras Transforming images Segmenting images and shape matching Pattern recognition, including face detection Tracking and motion in 2 and 3 dimensions 3D reconstruction from stereo vision Machine learning algorithms

1,222 citations