Showing papers in "arXiv: Computer Vision and Pattern Recognition in 2010"

PDF

Open Access

Posted Content•

Solving Inverse Problems with Piecewise Linear Estimators: From Gaussian Mixture Models to Structured Sparsity

[...]

Guoshen Yu¹, Guillermo Sapiro¹, Stéphane Mallat²•Institutions (2)

University of Minnesota¹, École Normale Supérieure²

15 Jun 2010-arXiv: Computer Vision and Pattern Recognition

TL;DR: In this article, a general framework for image inverse problems is introduced, based on Gaussian mixture models, estimated via a computationally efficient MAP-EM algorithm, which shows that the resulting piecewise linear estimate stabilizes the estimation when compared to traditional sparse inverse problem techniques.

...read moreread less

Abstract: A general framework for solving image inverse problems is introduced in this paper. The approach is based on Gaussian mixture models, estimated via a computationally efficient MAP-EM algorithm. A dual mathematical interpretation of the proposed framework with structured sparse estimation is described, which shows that the resulting piecewise linear estimate stabilizes the estimation when compared to traditional sparse inverse problem techniques. This interpretation also suggests an effective dictionary motivated initialization for the MAP-EM algorithm. We demonstrate that in a number of image inverse problems, including inpainting, zooming, and deblurring, the same algorithm produces either equal, often significantly better, or very small margin worse results than the best published ones, at a lower computational cost.

...read moreread less

505 citations

Posted Content•

Survey of Nearest Neighbor Techniques

[...]

Nitin Bhatia, Vandana

01 Jul 2010-arXiv: Computer Vision and Pattern Recognition

TL;DR: The nearest neighbor (NN) technique is very simple, highly efficient and effective in the field of pattern recognition, text categorization, object recognition etc and structure based techniques reduce the computational complexity.

...read moreread less

Abstract: The nearest neighbor (NN) technique is very simple, highly efficient and effective in the field of pattern recognition, text categorization, object recognition etc. Its simplicity is its main advantage, but the disadvantages can't be ignored even. The memory requirement and computation complexity also matter. Many techniques are developed to overcome these limitations. NN techniques are broadly classified into structure less and structure based techniques. In this paper, we present the survey of such techniques. Weighted kNN, Model based kNN, Condensed NN, Reduced NN, Generalized NN are structure less techniques whereas k-d tree, ball tree, Principal Axis Tree, Nearest Feature Line, Tunable NN, Orthogonal Search Tree are structure based algorithms developed on the basis of kNN. The structure less method overcome memory limitation and structure based techniques reduce the computational complexity.

...read moreread less

443 citations

Posted Content•

A Comprehensive Review of Image Enhancement Techniques

[...]

Raman Maini, Himanshu Aggarwal

22 Mar 2010-arXiv: Computer Vision and Pattern Recognition

TL;DR: Underlying concepts of underlying concepts, along with algorithms commonly used for image enhancement, are provided, with particular reference to point processing methods and histogram processing.

...read moreread less

Abstract: Principle objective of Image enhancement is to process an image so that result is more suitable than original image for specific application. Digital image enhancement techniques provide a multitude of choices for improving the visual quality of images. Appropriate choice of such techniques is greatly influenced by the imaging modality, task at hand and viewing conditions. This paper will provide an overview of underlying concepts, along with algorithms commonly used for image enhancement. The paper focuses on spatial domain techniques for image enhancement, with particular reference to point processing methods and histogram processing.

...read moreread less

363 citations

Posted Content•

Fast Inference in Sparse Coding Algorithms with Applications to Object Recognition

[...]

Koray Kavukcuoglu¹, Marc'Aurelio Ranzato¹, Yann LeCun¹•Institutions (1)

New York University¹

18 Oct 2010-arXiv: Computer Vision and Pattern Recognition

TL;DR: This work proposes a simple and efficient algorithm to learn basis functions, which provides a fast and smooth approximator to the optimal representation, achieving even better accuracy than exact sparse coding algorithms on visual object recognition tasks.

...read moreread less

Abstract: Adaptive sparse coding methods learn a possibly overcomplete set of basis functions, such that natural image patches can be reconstructed by linearly combining a small subset of these bases. The applicability of these methods to visual object recognition tasks has been limited because of the prohibitive cost of the optimization algorithms required to compute the sparse representation. In this work we propose a simple and efficient algorithm to learn basis functions. After training, this model also provides a fast and smooth approximator to the optimal representation, achieving even better accuracy than exact sparse coding algorithms on visual object recognition tasks.

...read moreread less

266 citations

Posted Content•

Image Segmentation by Using Threshold Techniques

[...]

Salem Saleh Al-amri, Namdeo V. Kalyankar, Santosh Khamitkar

21 May 2010-arXiv: Computer Vision and Pattern Recognition

TL;DR: This paper attempts to undertake the study of segmentation image techniques by using five threshold methods as Mean method, P-tile method, Histogram Dependent Technique (HDT), Edge Maximization Technique (EMT) and visual Technique and they are compared with one another so as to choose the best technique for threshold segmentation techniques image.

...read moreread less

Abstract: This paper attempts to undertake the study of segmentation image techniques by using five threshold methods as Mean method, P-tile method, Histogram Dependent Technique (HDT), Edge Maximization Technique (EMT) and visual Technique and they are compared with one another so as to choose the best technique for threshold segmentation techniques image. These techniques applied on three satellite images to choose base guesses for threshold segmentation image.

...read moreread less

258 citations

Posted Content•

TILT: Transform Invariant Low-rank Textures

[...]

Zhengdong Zhang¹, Arvind Ganesh², Xiao Liang¹, Yi Ma²•Institutions (2)

Tsinghua University¹, University of Illinois at Urbana–Champaign²

15 Dec 2010-arXiv: Computer Vision and Pattern Recognition

TL;DR: In this article, low-rank textures capture geometrically meaningful structures in an image, which encompass conventional local features such as edges and corners as well as all kinds of regular, symmetric patterns ubiquitous in urban environments and man-made objects.

...read moreread less

Abstract: In this paper, we show how to efficiently and effectively extract a class of "low-rank textures" in a 3D scene from 2D images despite significant corruptions and warping. The low-rank textures capture geometrically meaningful structures in an image, which encompass conventional local features such as edges and corners as well as all kinds of regular, symmetric patterns ubiquitous in urban environments and man-made objects. Our approach to finding these low-rank textures leverages the recent breakthroughs in convex optimization that enable robust recovery of a high-dimensional low-rank matrix despite gross sparse errors. In the case of planar regions with significant affine or projective deformation, our method can accurately recover both the intrinsic low-rank texture and the precise domain transformation, and hence the 3D geometry and appearance of the planar regions. Extensive experimental results demonstrate that this new technique works effectively for many regular and near-regular patterns or objects that are approximately low-rank, such as symmetrical patterns, building facades, printed texts, and human faces.

...read moreread less

203 citations

Posted Content•

Fast L1-Minimization Algorithms For Robust Face Recognition

[...]

Allen Y. Yang, Zihan Zhou, Arvind Ganesh, S. Shankar Sastry, Yi Ma - Show less +1 more

21 Jul 2010-arXiv: Computer Vision and Pattern Recognition

TL;DR: This study focuses on the numerical implementation of a sparsity-based classification framework in robust face recognition, where sparse representation is sought to recover human identities from very high-dimensional facial images that may be corrupted by illumination, facial disguise, and pose variation.

...read moreread less

Abstract: L1-minimization refers to finding the minimum L1-norm solution to an underdetermined linear system b=Ax. Under certain conditions as described in compressive sensing theory, the minimum L1-norm solution is also the sparsest solution. In this paper, our study addresses the speed and scalability of its algorithms. In particular, we focus on the numerical implementation of a sparsity-based classification framework in robust face recognition, where sparse representation is sought to recover human identities from very high-dimensional facial images that may be corrupted by illumination, facial disguise, and pose variation. Although the underlying numerical problem is a linear program, traditional algorithms are known to suffer poor scalability for large-scale applications. We investigate a new solution based on a classical convex optimization framework, known as Augmented Lagrangian Methods (ALM). The new convex solvers provide a viable solution to real-world, time-critical applications such as face recognition. We conduct extensive experiments to validate and compare the performance of the ALM algorithms against several popular L1-minimization solvers, including interior-point method, Homotopy, FISTA, SESOP-PCD, approximate message passing (AMP) and TFOCS. To aid peer evaluation, the code for all the algorithms has been made publicly available.

...read moreread less

151 citations

Posted Content•

Performance Comparison of SVM and ANN for Handwritten Devnagari Character Recognition

[...]

Sandhya Arora, Debotosh Bhattacharjee, Mita Nasipuri, Latesh Malik, M. Kundu, Dipak Kumar Basu - Show less +2 more

30 Jun 2010-arXiv: Computer Vision and Pattern Recognition

TL;DR: The characteristics of the some classification methods that have been successfully applied to handwritten Devnagari character recognition and results of SVM and ANNs classification method, applied on Handwritten DevNagari characters are discussed.

...read moreread less

Abstract: Classification methods based on learning from examples have been widely applied to character recognition from the 1990s and have brought forth significant improvements of recognition accuracies This class of methods includes statistical methods, artificial neural networks, support vector machines (SVM), multiple classifier combination, etc In this paper, we discuss the characteristics of the some classification methods that have been successfully applied to handwritten Devnagari character recognition and results of SVM and ANNs classification method, applied on Handwritten Devnagari characters After preprocessing the character image, we extracted shadow features, chain code histogram features, view based features and longest run features These features are then fed to Neural classifier and in support vector machine for classification In neural classifier, we explored three ways of combining decisions of four MLP’s, designed for four different features

...read moreread less

113 citations

Posted Content•

A Comparative Study of Removal Noise from Remote Sensing Image

[...]

Salem Saleh Al-amri, Namdeo V. Kalyankar, Santosh Khamitkar

05 Feb 2010-arXiv: Computer Vision and Pattern Recognition

TL;DR: This paper attempts to undertake the study of three types of noise such as Salt and Pepper (SPN), Random variation Impulse Noise (RVIN), Speckle (SPKN) and they are compared with one another to choose the base method for removal of noise from remote sensing image.

...read moreread less

Abstract: This paper attempts to undertake the study of three types of noise such as Salt and Pepper (SPN), Random variation Impulse Noise (RVIN), Speckle (SPKN). Different noise densities have been removed between 10% to 60% by using five types of filters as Mean Filter (MF), Adaptive Wiener Filter (AWF), Gaussian Filter (GF), Standard Median Filter (SMF) and Adaptive Median Filter (AMF). The same is applied to the Saturn remote sensing image and they are compared with one another. The comparative study is conducted with the help of Mean Square Errors (MSE) and PeakSignal to Noise Ratio (PSNR). So as to choose the base method for removal of noise from remote sensing image.

...read moreread less

98 citations

Journal Article•DOI•

Image Deblurring and Super-resolution by Adaptive Sparse Domain Selection and Adaptive Regularization

[...]

Weisheng Dong¹, Lei Zhang, Guangming Shi¹, Xiaolin Wu²•Institutions (2)

Xidian University¹, McMaster University²

06 Dec 2010-arXiv: Computer Vision and Pattern Recognition

TL;DR: Zhang et al. as discussed by the authors proposed to learn various sets of bases from a pre-collected dataset of example image patches, and then for a given patch to be processed, one set of bases are adaptively selected to characterize the local sparse domain.

...read moreread less

Abstract: As a powerful statistical image modeling technique, sparse representation has been successfully used in various image restoration applications. The success of sparse representation owes to the development of l1-norm optimization techniques, and the fact that natural images are intrinsically sparse in some domain. The image restoration quality largely depends on whether the employed sparse domain can represent well the underlying image. Considering that the contents can vary significantly across different images or different patches in a single image, we propose to learn various sets of bases from a pre-collected dataset of example image patches, and then for a given patch to be processed, one set of bases are adaptively selected to characterize the local sparse domain. We further introduce two adaptive regularization terms into the sparse representation framework. First, a set of autoregressive (AR) models are learned from the dataset of example image patches. The best fitted AR models to a given patch are adaptively selected to regularize the image local structures. Second, the image non-local self-similarity is introduced as another regularization term. In addition, the sparsity regularization parameter is adaptively estimated for better image restoration performance. Extensive experiments on image deblurring and super-resolution validate that by using adaptive sparse domain selection and adaptive regularization, the proposed method achieves much better results than many state-of-the-art algorithms in terms of both PSNR and visual perception.

...read moreread less

85 citations

Posted Content•

Handwritten Bangla Basic and Compound character recognition using MLP and SVM classifier

[...]

Nibaran Das, Bindaban Das, Ram Sarkar, Subhadip Basu, Mahantapas Kundu, Mita Nasipuri - Show less +2 more

22 Feb 2010-arXiv: Computer Vision and Pattern Recognition

TL;DR: In this article, a novel approach for recognition of handwritten compound Bangla characters along with the Basic characters of Bangla alphabet is presented, which makes an attempt to identify compound character classes from most frequently to less frequently occurred ones, i.e., in order of importance.

...read moreread less

Abstract: A novel approach for recognition of handwritten compound Bangla characters, along with the Basic characters of Bangla alphabet, is presented here. Compared to English like Roman script, one of the major stumbling blocks in Optical Character Recognition (OCR) of handwritten Bangla script is the large number of complex shaped character classes of Bangla alphabet. In addition to 50 basic character classes, there are nearly 160 complex shaped compound character classes in Bangla alphabet. Dealing with such a large varieties of handwritten characters with a suitably designed feature set is a challenging problem. Uncertainty and imprecision are inherent in handwritten script. Moreover, such a large varieties of complex shaped characters, some of which have close resemblance, makes the problem of OCR of handwritten Bangla characters more difficult. Considering the complexity of the problem, the present approach makes an attempt to identify compound character classes from most frequently to less frequently occurred ones, i.e., in order of importance. This is to develop a frame work for incrementally increasing the number of learned classes of compound characters from more frequently occurred ones to less frequently occurred ones along with Basic characters. On experimentation, the technique is observed produce an average recognition rate of 79.25 after three fold cross validation of data with future scope of improvement and extension.

...read moreread less

Posted Content•

Hybrid Medical Image Classification Using Association Rule Mining with Decision Tree Algorithm

[...]

P. Rajendran, M. Madheswaran

20 Jan 2010-arXiv: Computer Vision and Pattern Recognition

TL;DR: The two image mining approaches with a hybrid manner have been proposed in this paper and the hybrid method improves the efficiency of the proposed method than the traditional image mining methods.

...read moreread less

Abstract: The main focus of image mining in the proposed method is concerned with the classification of brain tumor in the CT scan brain images. The major steps involved in the system are: pre-processing, feature extraction, association rule mining and hybrid classifier. The pre-processing step has been done using the median filtering process and edge features have been extracted using canny edge detection technique. The two image mining approaches with a hybrid manner have been proposed in this paper. The frequent patterns from the CT scan images are generated by frequent pattern tree (FP-Tree) algorithm that mines the association rules. The decision tree method has been used to classify the medical images for diagnosis. This system enhances the classification process to be more accurate. The hybrid method improves the efficiency of the proposed method than the traditional image mining methods. The experimental result on prediagnosed database of brain images showed 97% sensitivity and 95% accuracy respectively. The physicians can make use of this accurate decision tree classification phase for classifying the brain images into normal, benign and malignant for effective medical diagnosis.

...read moreread less

Posted Content•

A family of statistical symmetric divergences based on Jensen's inequality

[...]

Frank Nielsen¹•Institutions (1)

Association for Computing Machinery¹

21 Sep 2010-arXiv: Computer Vision and Pattern Recognition

TL;DR: A novel parametric family of symmetric information-theoretic distances based on Jensen’s inequality for a convex functional generator unies the celebrated Jereys divergence with the Jensen-Shannon divergence when the Shannon entropy generator is chosen.

...read moreread less

Abstract: We introduce a novel parametric family of symmetric information-theoretic distances based on Jensen’s inequality for a convex functional generator. In particular, this family unies the celebrated Jereys divergence with the Jensen-Shannon divergence when the Shannon entropy generator is chosen. We then design a generic algorithm to compute the unique centroid dened as the minimum average divergence. This yields a smooth family of centroids linking the Jereys to the Jensen-Shannon centroid. Finally, we report on our experimental results.

...read moreread less

Posted Content•

Real-time Robust Principal Components' Pursuit

[...]

Chenlu Qiu¹, Namrata Vaswani¹•Institutions (1)

Iowa State University¹

04 Oct 2010-arXiv: Computer Vision and Pattern Recognition

TL;DR: A solution that automatically handles correlated sparse outliers is proposed that is motivated as a tool for video surveillance applications with the background image sequence forming the low rank part and the moving objects/persons/abnormalities forming the sparse part.

...read moreread less

Abstract: In the recent work of Candes et al, the problem of recovering low rank matrix corrupted by i.i.d. sparse outliers is studied and a very elegant solution, principal component pursuit, is proposed. It is motivated as a tool for video surveillance applications with the background image sequence forming the low rank part and the moving objects/persons/abnormalities forming the sparse part. Each image frame is treated as a column vector of the data matrix made up of a low rank matrix and a sparse corruption matrix. Principal component pursuit solves the problem under the assumptions that the singular vectors of the low rank matrix are spread out and the sparsity pattern of the sparse matrix is uniformly random. However, in practice, usually the sparsity pattern and the signal values of the sparse part (moving persons/objects) change in a correlated fashion over time, for e.g., the object moves slowly and/or with roughly constant velocity. This will often result in a low rank sparse matrix. For video surveillance applications, it would be much more useful to have a real-time solution. In this work, we study the online version of the above problem and propose a solution that automatically handles correlated sparse outliers. The key idea of this work is as follows. Given an initial estimate of the principal directions of the low rank part, we causally keep estimating the sparse part at each time by solving a noisy compressive sensing type problem. The principal directions of the low rank part are updated every-so-often. In between two update times, if new Principal Components' directions appear, the "noise" seen by the Compressive Sensing step may increase. This problem is solved, in part, by utilizing the time correlation model of the low rank part. We call the proposed solution "Real-time Robust Principal Components' Pursuit".

...read moreread less

Journal Article•DOI•

Content Based Image Retrieval Using Exact Legendre Moments and Support Vector Machine

[...]

Ch. Srinivasa Rao¹, S. Srinivas Kumar², B. Chandra Mohan³•Institutions (3)

Techno India¹, Jawaharlal Nehru Technological University, Kakinada², Bapatla Engineering College³

29 May 2010-arXiv: Computer Vision and Pattern Recognition

TL;DR: CBIR system using Exact Legendre Moments (ELM) for gray scale images is proposed in this work, and Superiority of the proposed CBIR system is observed over other moment based methods in terms of retrieval efficiency and retrieval time.

...read moreread less

Abstract: Content Based Image Retrieval (CBIR) systems based on shape using invariant image moments, viz., Moment Invariants (MI) and Zernike Moments (ZM) are available in the literature. MI and ZM are good at representing the shape features of an image. However, non-orthogonality of MI and poor reconstruction of ZM restrict their application in CBIR. Therefore, an efficient and orthogonal moment based CBIR system is needed. Legendre Moments (LM) are orthogonal, computationally faster, and can represent image shape features compactly. CBIR system using Exact Legendre Moments (ELM) for gray scale images is proposed in this work. Superiority of the proposed CBIR system is observed over other moment based methods, viz., MI and ZM in terms of retrieval efficiency and retrieval time. Further, the classification efficiency is improved by employing Support Vector Machine (SVM) classifier. Improved retrieval results are obtained over existing CBIR algorithm based on Stacked Euler Vector (SERVE) combined with Modified Moment Invariants (MMI).

...read moreread less

Posted Content•

Robust Low-Rank Subspace Segmentation with Semidefinite Guarantees

[...]

Yuzhao Ni¹, Ju Sun¹, Xiao-Tong Yuan¹, Shuicheng Yan¹, Loong-Fah Cheong¹ - Show less +1 more•Institutions (1)

National University of Singapore¹

20 Sep 2010-arXiv: Computer Vision and Pattern Recognition

TL;DR: It is advocated to enforce the symmetric positive semi definite constraint explicitly during learning (Low-Rank Representation with Positive Semi Definite constraint, or LRR-PSD), and it is shown that factually it can be solved in an exquisite scheme efficiently instead of general-purpose SDP solvers that usually scale up poorly.

...read moreread less

Abstract: Recently there is a line of research work proposing to employ Spectral Clustering (SC) to segment (group){Throughout the paper, we use segmentation, clustering, and grouping, and their verb forms, interchangeably.} high-dimensional structural data such as those (approximately) lying on subspaces {We follow {liu2010robust} and use the term "subspace" to denote both linear subspaces and affine subspaces. There is a trivial conversion between linear subspaces and affine subspaces as mentioned therein.} or low-dimensional manifolds. By learning the affinity matrix in the form of sparse reconstruction, techniques proposed in this vein often considerably boost the performance in subspace settings where traditional SC can fail. Despite the success, there are fundamental problems that have been left unsolved: the spectrum property of the learned affinity matrix cannot be gauged in advance, and there is often one ugly symmetrization step that post-processes the affinity for SC input. Hence we advocate to enforce the symmetric positive semidefinite constraint explicitly during learning (Low-Rank Representation with Positive SemiDefinite constraint, or LRR-PSD), and show that factually it can be solved in an exquisite scheme efficiently instead of general-purpose SDP solvers that usually scale up poorly. We provide rigorous mathematical derivations to show that, in its canonical form, LRR-PSD is equivalent to the recently proposed Low-Rank Representation (LRR) scheme {liu2010robust}, and hence offer theoretic and practical insights to both LRR-PSD and LRR, inviting future research. As per the computational cost, our proposal is at most comparable to that of LRR, if not less. We validate our theoretic analysis and optimization scheme by experiments on both synthetic and real data sets.

...read moreread less

Posted Content•

Scalable Large-Margin Mahalanobis Distance Metric Learning

[...]

Chunhua Shen¹, Junae Kim², Lei Wang²•Institutions (2)

NICTA¹, Australian National University²

02 Mar 2010-arXiv: Computer Vision and Pattern Recognition

TL;DR: This work proposes a fast and scalable algorithm to learn a Mahalanobis distance metric and suggests that, compared with state-of-the-art metric learning algorithms, this algorithm can achieve a comparable classification accuracy with reduced computational complexity.

...read moreread less

Abstract: For many machine learning algorithms such as $k$-Nearest Neighbor ($k$-NN) classifiers and $ k $-means clustering, often their success heavily depends on the metric used to calculate distances between different data points. An effective solution for defining such a metric is to learn it from a set of labeled training samples. In this work, we propose a fast and scalable algorithm to learn a Mahalanobis distance metric. By employing the principle of margin maximization to achieve better generalization performances, this algorithm formulates the metric learning as a convex optimization problem and a positive semidefinite (psd) matrix is the unknown variable. a specialized gradient descent method is proposed. our algorithm is much more efficient and has a better performance in scalability compared with existing methods. Experiments on benchmark data sets suggest that, compared with state-of-the-art metric learning algorithms, our algorithm can achieve a comparable classification accuracy with reduced computational complexity.

...read moreread less

Posted Content•

Recognition of Non-Compound Handwritten Devnagari Characters using a Combination of MLP and Minimum Edit Distance

[...]

Sandhya Arora, Debotosh Bhattacharjee, Mita Nasipuri, Dipak Kumar Basu, Mahantapas Kundu - Show less +1 more

30 Jun 2010-arXiv: Computer Vision and Pattern Recognition

TL;DR: A new method for recognition of offline Handwritten non-compound Devnagari Characters in two stages uses two well known and established pattern recognition techniques: one using neural networks and the other one using minimum edit distance.

...read moreread less

Abstract: This paper deals with a new method for recognition of offline Handwritten non-compound Devnagari Characters in two stages. It uses two well known and established pattern recognition techniques: one using neural networks and the other one using minimum edit distance. Each of these techniques is applied on different sets of characters for recognition. In the first stage, two sets of features are computed and two classifiers are applied to get higher recognition accuracy. Two MLP's are used separately to recognize the characters. For one of the MLP's the characters are represented with their shadow features and for the other chain code histogram feature is used. The decision of both MLP's is combined using weighted majority scheme. Top three results produced by combined MLP's in the first stage are used to calculate the relative difference values. In the second stage, based on these relative differences character set is divided into two. First set consists of the characters with distinct shapes and second set consists of confused characters, which appear very similar in shapes. Characters of distinct shapes of first set are classified using MLP. Confused characters in second set are classified using minimum edit distance method. Method of minimum edit distance makes use of corner detected in a character image using modified Harris corner detection technique. Experiment on this method is carried out on a database of 7154 samples. The overall recognition is found to be 90.74%.

...read moreread less

Posted Content•

Convolutional Matching Pursuit and Dictionary Training

[...]

Arthur Szlam, Koray Kavukcuoglu, Yann LeCun

03 Oct 2010-arXiv: Computer Vision and Pattern Recognition

TL;DR: It is demonstrated that sparse coding by matching pursuit and dictionary learning via K-SVD can be used in the translation invariant setting.

...read moreread less

Abstract: Here, {W,Z} are the dictionary and the coefficients, respectively, and zk is the kth column of Z. K, q, and λ are user selected parameters controlling the power of the model. More recently, many models with additional structure have been proposed. For example, in [9, 2], the dictionary elements are arranged in groups and the sparsity is on the group level. In [3, 5, 7], the dictionaries are constructed to be translation invariant. In the former work, the dictionary is constructed via a non-negative matrix factorization. In the latter two works, the construction is a convolutional analogue of 1.2 or an l variant, with 0 < p < 1. In this short note we work with greedy algorithms for solving the convolutional analogues of 1.1. Specifically, we demonstrate that sparse coding by matching pursuit and dictionary learning via K-SVD [1] can be used in the translation invariant setting.

...read moreread less

Posted Content•

Color Image Compression Based On Wavelet Packet Best Tree

[...]

Gajanan K. Kharate, Varsha H. Patil

19 Apr 2010-arXiv: Computer Vision and Pattern Recognition

TL;DR: It is proposed that proper selection of mother wavelet on the basis of nature of images, improve the quality as well as compression ratio remarkably, and the enhanced run length encoding technique is suggested provides better results than RLE.

...read moreread less

Abstract: In Image Compression, the researchers’ aim is to reduce the number of bits required to represent an image by removing the spatial and spectral redundancies. Recently discrete wavelet transform and wavelet packet has emerged as popular techniques for image compression. The wavelet transform is one of the major processing components of image compression. The result of the compression changes as per the basis and tap of the wavelet used. It is proposed that proper selection of mother wavelet on the basis of nature of images, improve the quality as well as compression ratio remarkably. We suggest the novel technique, which is based on wavelet packet best tree based on Threshold Entropy with enhanced run-length encoding. This method reduces the time complexity of wavelet packets decomposition as complete tree is not decomposed. Our algorithm selects the sub-bands, which include significant information based on threshold entropy. The enhanced run length encoding technique is suggested provides better results than RLE. The result when compared with JPEG-2000 proves to be better.

...read moreread less

Posted Content•

Detection of Microcalcification in Mammograms Using Wavelet Transform and Fuzzy Shell Clustering

[...]

T. Balakumaran, Ila Vennila, C. Gowri Shankar

10 Feb 2010-arXiv: Computer Vision and Pattern Recognition

TL;DR: The proposed algorithm for detecting microcalcification in mammogram quality enhancement using multirresolution analysis based on the dyadic wavelet transform and microCalcification detection by fuzzy shell clustering and the effectiveness of the proposed algorithm is confirmed by experimental results.

...read moreread less

Abstract: Microcalcifications in mammogram have been mainly targeted as a reliable earliest sign of breast cancer and their early detection is vital to improve its prognosis. Since their size is very small and may be easily overlooked by the examining radiologist, computer-based detection output can assist the radiologist to improve the diagnostic accuracy. In this paper, we have proposed an algorithm for detecting microcalcification in mammogram. The proposed microcalcification detection algorithm involves mammogram quality enhancement using multirresolution analysis based on the dyadic wavelet transform and microcalcification detection by fuzzy shell clustering. It may be possible to detect nodular components such as microcalcification accurately by introducing shape information. The effectiveness of the proposed algorithm for microcalcification detection is confirmed by experimental results.

...read moreread less

Journal Article•DOI•

An Efficient Automatic Mass Classification Method In Digitized Mammograms Using Artificial Neural Network

[...]

Mohammed J. Islam, Majid Ahmadi¹, Maher A. Sid-Ahmed¹•Institutions (1)

University of Windsor¹

29 Jul 2010-arXiv: Computer Vision and Pattern Recognition

TL;DR: An efficient computer aided mass classification method in digitized mammograms using Artificial Neural Network (ANN), which performs benign-malignant classification on region of interest (ROI) that contains mass.

...read moreread less

Abstract: In this paper we present an efficient computer aided mass classification method in digitized mammograms using Artificial Neural Network (ANN), which performs benign-malignant classification on region of interest (ROI) that contains mass. One of the major mammographic characteristics for mass classification is texture. ANN exploits this important factor to classify the mass into benign or malignant. The statistical textural features used in characterizing the masses are mean, standard deviation, entropy, skewness, kurtosis and uniformity. The main aim of the method is to increase the effectiveness and efficiency of the classification process in an objective manner to reduce the numbers of false-positive of malignancies. Three layers artificial neural network (ANN) with seven features was proposed for classifying the marked regions into benign and malignant and 90.91% sensitivity and 83.87% specificity is achieved that is very much promising compare to the radiologist's sensitivity 75%.

...read moreread less

Posted Content•

An Improved Image Mining Technique For Brain Tumour Classification Using Efficient Classifier

[...]

P. Rajendran, M. Madheswaran

12 Jan 2010-arXiv: Computer Vision and Pattern Recognition

TL;DR: An improved image mining technique for brain tumor classification using pruned association rule with MARI algorithm is presented in this paper and can assist the physicians for efficient classification with multiple keywords per image to improve the accuracy.

...read moreread less

Abstract: An improved image mining technique for brain tumor classification using pruned association rule with MARI algorithm is presented in this paper. The method proposed makes use of association rule mining technique to classify the CT scan brain images into three categories namely normal, benign and malign. It combines the low level features extracted from images and high level knowledge from specialists. The developed algorithm can assist the physicians for efficient classification with multiple keywords per image to improve the accuracy. The experimental result on prediagnosed database of brain images showed 96 percent and 93 percent sensitivity and accuracy respectively.

...read moreread less

Posted Content•

Offline Signature Identification by Fusion of Multiple Classifiers using Statistical Learning Theory

[...]

Dakshina Ranjan Kisku, Phalguni Gupta, Jamuna Kanta Sing

30 Mar 2010-arXiv: Computer Vision and Pattern Recognition

TL;DR: This paper uses Support Vector Machines (SVM) to fuse multiple classifiers for an offline signature system and the results are found to be promising.

...read moreread less

Abstract: This paper uses Support Vector Machines (SVM) to fuse multiple classifiers for an offline signature system. From the signature images, global and local features are extracted and the signatures are verified with the help of Gaussian empirical rule, Euclidean and Mahalanobis distance based classifiers. SVM is used to fuse matching scores of these matchers. Finally, recognition of query signatures is done by comparing it with all signatures of the database. The proposed system is tested on a signature database contains 5400 offline signatures of 600 individuals and the results are found to be promising.

...read moreread less

Posted Content•

Affine-invariant diffusion geometry for the analysis of deformable 3D shapes

[...]

Dan Raviv, Alexander M. Bronstein, Michael M. Bronstein, Ron Kimmel, Nir Sochen - Show less +1 more

29 Dec 2010-arXiv: Computer Vision and Pattern Recognition

TL;DR: An (equi-)affine invariant diffusion geometry is introduced by which surfaces that go through squeeze and shear transformations can still be properly analyzed and construct an invariant Laplacian from which local and global geometric structures are extracted.

...read moreread less

Abstract: We introduce an (equi-)affine invariant diffusion geometry by which surfaces that go through squeeze and shear transformations can still be properly analyzed. The definition of an affine invariant metric enables us to construct an invariant Laplacian from which local and global geometric structures are extracted. Applications of the proposed framework demonstrate its power in generalizing and enriching the existing set of tools for shape analysis.

...read moreread less

Posted Content•

New Clustering Algorithm for Vector Quantization using Rotation of Error Vector

[...]

H. B. Kekre, Tanuja Sarode

10 Apr 2010-arXiv: Computer Vision and Pattern Recognition

TL;DR: The proposed algorithm gives less distortion as compared to well known Linde Buzo Gray (LBG) algorithm and Kekre’s Proportionate Error (KPE) Algorithm by introducing new orientation every time to split the clusters.

...read moreread less

Abstract: —The paper presents new clustering algorithm. The proposed algorithm gives less distortion as compared to well known Linde Buzo Gray (LBG) algorithm and Kekre’s Proportionate Error (KPE) Algorithm. Constant error is added every time to split the clusters in LBG, resulting in formation of cluster in one direction which is 135 0 in 2-dimensional case. Because of this reason clustering is inefficient resulting in high MSE in LBG. To overcome this drawback of LBG proportionate error is added to change the cluster orientation in KPE. Though the cluster orientation in KPE is changed its variation is limited to ± 45 0 over 135 . The proposed algorithm takes care of this problem by introducing new orientation every time to split the clusters. The proposed method reduces PSNR by 2db to 5db for codebook size 128 to 1024 with respect to LBG. Keywords-component; Vector Quantization; Codebook; Codevector; Encoding; Compression. I. I NTRODUCTION Exhaustive Search (ES) method gives the optimal result at the World Wide Web Applications have extensively grown since last few decades and it has become requisite tool for education, communication, industry, amusement etc. All these applications are multimedia-based applications consisting of images and videos. Images/videos require enormous volume of data items, creating a serious problem as they need higher channel bandwidth for efficient transmission. Further high degree of redundancies is observed in digital images. Thus the need for image compression arises for resourceful storage and transmission. Image compression is classified into two categories, lossless image compression and lossy image compression technique. Vector quantization (VQ) is one of the lossy data compression techniques[1], [2] and has been used in number of applications, like pattern recognition [3], speech recognition and face detection [4], [5], image segmentation [6-9], speech data compression [10], Content Based Image Retrieval (CBIR) [11], [12], Face recognition[13], [14] iris recognition[15], tumor detection in mammography images [29] etc. VQ is a mapping function which maps k-dimensional vector space to a finite set CB = {C

...read moreread less

Posted Content•

Defining and Generating Axial Lines from Street Center Lines for better Understanding of Urban Morphologies

[...]

Xintao Liu, Bin Jiang

27 Sep 2010-arXiv: Computer Vision and Pattern Recognition

TL;DR: In this paper, the authors define axial lines as the least number of individual straight line segments mutually intersected along natural streets that are generated from street center lines using the Gestalt principle of good continuity.

...read moreread less

Abstract: Axial lines are defined as the longest visibility lines for representing individual linear spaces in urban environments. The least number of axial lines that cover the free space of an urban environment or the space between buildings constitute what is often called an axial map. This is a fundamental tool in space syntax, a theory developed by Bill Hillier and his colleagues for characterizing the underlying urban morphologies. For a long time, generating axial lines with help of some graphic software has been a tedious manual process that is criticized for being time consuming, subjective, or even arbitrary. In this paper, we redefine axial lines as the least number of individual straight line segments mutually intersected along natural streets that are generated from street center lines using the Gestalt principle of good continuity. Based on this new definition, we develop an automatic solution to generating the newly defined axial lines from street center lines. We apply this solution to six typical street networks (three from North America and three from Europe), and generate a new set of axial lines for analyzing the urban morphologies. Through a comparison study between the new axial lines and the conventional or old axial lines, and between the new axial lines and natural streets, we demonstrate with empirical evidence that the newly defined axial lines are a better alternative in capturing the underlying urban structure. Keywords: Space syntax, street networks, topological analysis, traffic, head/tail division rule

...read moreread less

Posted Content•

A Two Stage Classification Approach for Handwritten Devanagari Characters

[...]

Sandhya Arora, Debotosh Bhattacharjee, Mita Nasipuri, Latesh Malik

30 Jun 2010-arXiv: Computer Vision and Pattern Recognition

TL;DR: A differential distance based technique to find a near straight line for shirorekha, vertical bar (Spine) in handwritten devnagari characters is designed.

...read moreread less

Abstract: The paper presents a two stage classification approach for handwritten devanagari characters The first stage is using structural properties like shirorekha, spine in character and second stage exploits some intersection features of characters which are fed to a feedforward neural network. Simple histogram based method does not work for finding shirorekha, vertical bar (Spine) in handwritten devnagari characters. So we designed a differential distance based technique to find a near straight line for shirorekha and spine. This approach has been tested for 50000 samples and we got 89.12% success

...read moreread less

Posted Content•

Handwritten Arabic Numeral Recognition using a Multi Layer Perceptron

[...]

Nibaran Das, Ayatullah Faruk Mollah, Sudip Saha, Syed Sahidul Haque

09 Mar 2010-arXiv: Computer Vision and Pattern Recognition

TL;DR: A feature set of 88 features is designed to represent samples of handwritten Arabic numerals designed to include 72 shadow and 16 octant features and can be extended to include OCR of handwritten characters of Arabic alphabet.

...read moreread less

Abstract: Handwritten numeral recognition is in general a benchmark problem of Pattern Recognition and Artificial Intelligence Compared to the problem of printed numeral recognition, the problem of handwritten numeral recognition is compounded due to variations in shapes and sizes of handwritten characters Considering all these, the problem of handwritten numeral recognition is addressed under the present work in respect to handwritten Arabic numerals Arabic is spoken throughout the Arab World and the fifth most popular language in the world slightly before Portuguese and Bengali For the present work, we have developed a feature set of 88 features is designed to represent samples of handwritten Arabic numerals for this work It includes 72 shadow and 16 octant features A Multi Layer Perceptron (MLP) based classifier is used here for recognition handwritten Arabic digits represented with the said feature set On experimentation with a database of 3000 samples, the technique yields an average recognition rate of 9493% evaluated after three-fold cross validation of results It is useful for applications related to OCR of handwritten Arabic Digit and can also be extended to include OCR of handwritten characters of Arabic alphabet

...read moreread less

Posted Content•

An Unsupervised Algorithm For Learning Lie Group Transformations

[...]

Jascha Sohl-Dickstein, Jimmy C. Wang, Bruno A. Olshausen

07 Jan 2010-arXiv: Computer Vision and Pattern Recognition

TL;DR: Several theoretical contributions which allow Lie groups to be fit to high dimensional datasets are presented, reducing the computational complexity of parameter estimation to that of training a linear transformation model.

...read moreread less

Abstract: We present several theoretical contributions which allow Lie groups to be fit to high dimensional datasets. Transformation operators are represented in their eigen-basis, reducing the computational complexity of parameter estimation to that of training a linear transformation model. A transformation specific "blurring" operator is introduced that allows inference to escape local minima via a smoothing of the transformation space. A penalty on traversed manifold distance is added which encourages the discovery of sparse, minimal distance, transformations between states. Both learning and inference are demonstrated using these methods for the full set of affine transformations on natural image patches. Transformation operators are then trained on natural video sequences. It is shown that the learned video transformations provide a better description of inter-frame differences than the standard motion model based on rigid translation.

...read moreread less

Collapse