Showing papers in "Pattern Recognition Letters in 2001"
••
TL;DR: The advantages and shortcomings of the performance measures currently used in CBIR are discussed and proposals for a standard test suite similar to that used in IR at the annual Text REtrieval Conference (TREC), are presented.
598 citations
••
TL;DR: The effects of five feature normalization methods on retrieval performance are discussed and two likelihood ratio-based similarity measures that perform significantly better than the commonly used geometric approaches like the Lp metrics are described.
450 citations
••
TL;DR: A two-phase clustering algorithm for outliers detection is proposed, which first modify the traditional k-means algorithm in Phase 1 by using a heuristic “if one new input pattern is far enough away from all clusters' centers, then assign it as a new cluster center”.
345 citations
••
TL;DR: This work describes a scheme that is able to classify audio segments into seven categories consisting of silence, single speaker speech, music, environmental noise, multiple speakers' speech, simultaneous speech and music, and speech and noise, and shows that cepstral-based features such as the Mel-frequency cep stral coefficients (MFCC) and linear prediction coefficients (LPC) provide better classification accuracy compared to temporal and spectral features.
315 citations
••
TL;DR: A new standard is currently being developed, the JPEG2000, which is not only intended to provide rate-distortion and subjective image quality performance superior to existing standards, but also to provide functionality that current standards can either not address efficiently or not address at all.
269 citations
••
TL;DR: The classification shows an improvement of the online experiment and the temporal determination of minimal classification error compared to linear classification methods.
251 citations
••
TL;DR: Experimental results indicate that the incorporation of colour information enhances the performance of the texture analysis techniques examined, and the classification accuracy is determined using a neural network classifier based on Learning Vector Quantization.
230 citations
••
TL;DR: A new graph distance metric is proposed for measuring similarities between objects represented by attributed relational graphs that can be computed by a straightforward extension of any algorithm that implements error-correcting graph matching, when run under an appropriate cost function, and the extension only takes time linear in the size of the graphs.
214 citations
••
TL;DR: Experiments show that the factors of PTF are easier to interpret than those produced by methods based on the singular value decomposition, which might contain negative values.
191 citations
••
TL;DR: It is proved that these combination rules are equivalent when using two classifiers and the sum of the estimates of the a posteriori probabilities is equal to one.
173 citations
••
TL;DR: In this article, the 3D and grey level comparison algorithms were designed to be integrated in security applications in which individuals cooperate, and the residual error after 3D matching was used as a first similarity measure.
••
TL;DR: This paper investigates a method for two-dimensional image fusion based on a novel multi-resolution transform called steerable pyramids, which combines the multi-scale decomposition with differential measurements, which is very useful for feature extraction.
••
TL;DR: An existing graph distance metric based on maximum common subgraph has been extended by a proposal to define the problem size with the union of the two graphs being measured, rather than the larger of theTwo graphs used in the existing metric.
••
TL;DR: A novel singular value decomposition (SVD)- and vector quantization (VQ)-based image hiding scheme to hide image data is presented, showing good compression ratio and satisfactory image quality.
••
TL;DR: A comparison between the new deslanting technique and the method proposed by Bozinovic and Srihari was made by measuring the performance of both methods within a word recognition system tested on different databases.
••
TL;DR: Experiments show that the new features proposed can catch salient edge/structure information and improve the retrieval performance and are more generally applicable than texture or shape features.
••
TL;DR: A lower bound of the box size is found and the reason for having it is provided and indicates the need for limiting the box sizes within certain bounds.
••
TL;DR: This paper presents a novel, information-theoretic algorithm for feature selection, which finds an optimal set of attributes by removing both irrelevant and redundant features and is applicable to datasets of a mixed nature.
••
TL;DR: A mixture-of-Gaussians modeling of the color space, provides a robust representation that can accommodate large color variations, as well as highlights and shadows, in face-color modeling and segmentation.
••
TL;DR: Comparisons with other text location methods are presented; indicating that the proposed system has a better accuracy.
••
TL;DR: It is argued that the use of predictive accuracy for basic probability assignments can improve the overall system performance when compared to `traditional' mass assignment techniques.
••
TL;DR: The proposed online system distinguishes crop from weeds based on multi-spectal reflectance gathered with an imaging spectrograph under field conditions were recognized herbicide reductions of up to 90%.
••
TL;DR: This work addresses the same problem using the framework of heuristic search strategies to find the shortest path in a graph and shows that the complexity of the algorithm is close to O( P 2 ).
••
TL;DR: The experimental results demonstrate that the proposed two-step circle detection algorithm using pairs of chords can detect the circles effectively.
••
TL;DR: A fast and novel technique for color quantization using reduction of color space dimensionality and a fast pixel mapping algorithm based on the proposed data clustering algorithm are presented.
••
TL;DR: This paper comprises a complete system for content-based retrieval and browsing of news reports; the annotation of the video stream is fully automated and is based both on visual features extracted from video shots and on textual strings extracted from captions and audio tracks.
••
TL;DR: The scale space image of the distance accumulation showed that the zero crossings of distance accumulation are quite stable and analysis of its relation to planar curvature matched very well with experimental results.
••
TL;DR: This paper presents a machine-printed and hand-written text classification scheme for Bangla and Devnagari, the two most popular Indian scripts, which has an accuracy of 98.6%.
••
TL;DR: A new vector median filter suitable for colour image processing is presented, based on a new ordering of vectors in the HSV colour space, which shows promising results in terms of colour image restoration.
••
TL;DR: This paper proposes a novel method to detect scene cuts adaptively using a difference metric based on the color histograms of successive frames of a video sequence, and applies a refinement procedure to remove false detection.