Showing papers by "Balaji Krishnapuram published in 2006"

PDF

Open Access

Proceedings Article•

Multiple Instance Learning for Computer Aided Diagnosis

[...]

Murat Dundar¹, Balaji Krishnapuram¹, R. B. Rao¹, Glenn Fung¹•Institutions (1)

04 Dec 2006

TL;DR: Comparisons against other MIL methods on benchmark problems indicate that the proposed method is competitive with the state-of-the-art, and the proposed algorithm significantly improves diagnostic accuracy when compared to both MIL and traditional classifiers.

...read moreread less

Abstract: Many computer aided diagnosis (CAD) problems can be best modelled as a multiple-instance learning (MIL) problem with unbalanced data: i.e., the training data typically consists of a few positive bags, and a very large number of negative instances. Existing MIL algorithms are much too computationally expensive for these datasets. We describe CH, a framework for learning a Convex Hull representation of multiple instances that is significantly faster than existing MIL algorithms. Our CH framework applies to any standard hyperplane-based learning algorithm, and for some algorithms, is guaranteed to find the global optimal solution. Experimental studies on two different CAD applications further demonstrate that the proposed algorithm significantly improves diagnostic accuracy when compared to both MIL and traditional classifiers. Although not designed for standard MIL problems (which have both positive and negative bags and relatively balanced datasets), comparisons against other MIL methods on benchmark problems also indicate that the proposed method is competitive with the state-of-the-art.

...read moreread less

116 citations

Journal Article•DOI•

Variational Bayes for continuous hidden Markov models and its application to active learning

[...]

Shihao Ji¹, Balaji Krishnapuram, Lawrence Carin•Institutions (1)

Duke University¹

01 Apr 2006-IEEE Transactions on Pattern Analysis and Machine Intelligence

TL;DR: A variational Bayes (VB) framework for learning continuous hidden Markov models (CHMMs), and the VB framework within active learning is examined, demonstrating that all of these active learning methods can significantly reduce the amount of required labeling, compared to random selection of samples for labeling.

...read moreread less

Abstract: In this paper, we present a variational Bayes (VB) framework for learning continuous hidden Markov models (CHMMs), and we examine the VB framework within active learning. Unlike a maximum likelihood or maximum a posteriori training procedure, which yield a point estimate of the CHMM parameters, VB-based training yields an estimate of the full posterior of the model parameters. This is particularly important for small training sets since it gives a measure of confidence in the accuracy of the learned model. This is utilized within the context of active learning, for which we acquire labels for those feature vectors for which knowledge of the associated label would be most informative for reducing model-parameter uncertainty. Three active learning algorithms are considered in this paper: 1) query by committee (QBC), with the goal of selecting data for labeling that minimize the classification variance, 2) a maximum expected information gain method that seeks to label data with the goal of reducing the entropy of the model parameters, and 3) an error-reduction-based procedure that attempts to minimize classification error over the test data. The experimental results are presented for synthetic and measured data. We demonstrate that all of these active learning methods can significantly reduce the amount of required labeling, compared to random selection of samples for labeling.

...read moreread less

72 citations

Patent•

Method, apparatus and computer program for displaying marks in an image data set

[...]

Carl J. G. Evertsz, Anke Bödicker, Sriram Krishnan, Balaji Krishnapuram, R. Bharat Rao, Dennis O'Dell, Alok Gupta - Show less +3 more

17 Aug 2006

TL;DR: In this paper, a method and an apparatus display marks in an image data set, wherein an image dataset comprising marks is provided and wherein during a review phase not all marks within the image dataset are displayed at the same time.

...read moreread less

Abstract: A method and an apparatus display marks in an image data set, wherein an image data set comprising marks is provided and wherein during a review phase not all marks within the image data set are displayed at the same time. A list of the marks can be generated by sorting the marks depending on a predetermined sorting criterion and wherein the marks are displayed temporally one after another within the image data set in accordance with the generated list. The image data set is for example a medical image data set, wherein the marks are CAD marks and wherein the sorting criterion is the probability of marking illness, in particular the suspiciousness.

...read moreread less

23 citations

Book Chapter•DOI•

Batch classification with applications in computer aided diagnosis

[...]

Volkan Vural¹, Glenn Fung², Balaji Krishnapuram², Jennifer G. Dy¹, Bharat Rao² - Show less +1 more•Institutions (2)

Northeastern University¹, Siemens²

18 Sep 2006

TL;DR: In this article, two algorithms are developed to classify all the samples in a batch jointly, one based on a probabilistic analysis and another based on mathematical programming approach, which demonstrate that the proposed algorithms are significantly more accurate than a naive SVM which ignores the correlations among the samples.

...read moreread less

Abstract: Most classification methods assume that the samples are drawn independently and identically from an unknown data generating distribution, yet this assumption is violated in several real life problems. In order to relax this assumption, we consider the case where batches or groups of samples may have internal correlations, whereas the samples from different batches may be considered to be uncorrelated. Two algorithms are developed to classify all the samples in a batch jointly, one based on a probabilistic analysis and another based on a mathematical programming approach. Experiments on three real-life computer aided diagnosis (CAD) problems demonstrate that the proposed algorithms are significantly more accurate than a naive SVM which ignores the correlations among the samples.

...read moreread less

13 citations

Patent•

System and method for learning rankings via convex hull separation

[...]

Jinbo Bi¹, Glenn Fung, Sriram Krishnan, Balaji Krishnapuram, R. Rao, Romer Rosales - Show less +2 more•Institutions (1)

Siemens¹

01 Jun 2006

TL;DR: In this paper, a ranking function that classifies feature points in an n-dimensional space has been proposed, where a plurality of feature points are derived from tissue sample regions in a digital medical image, and the ranking function satisfies inequality constraints f (xi ) ≤ f (xj ) for all xI ∈ conv(Ai) and xj ∈ Conv(Aj), where conv represents the convex hull of the elements of set A.

...read moreread less

Abstract: A method for finding a ranking function /that classifies feature points in an n- dimensional space includes providing (61) a plurality of feature points Xk derived from tissue sample regions in a digital medical image, providing (62) training data A comprising training samples Aj where formula (I), providing (63) an ordering E = {(P,Q)| AP< AQ} of at least some training data sets where all training samples xI ∈Ap are ranked higher than any sample xj ∈ AQ , and solving (64) a mathematical optimization program to determine the ranking function f that classifies said feature points x into sets A. For any two sets Ai, Aj, Ai -< Aj , and the ranking function f satisfies inequality constraints f (xi ) ≤ f (xj ) for all xI ∈ conv(Ai) and xj ∈ conv(Aj) , where conv(A) represents the convex hull of the elements of set A.

...read moreread less

7 citations

Journal Article•

Batch classification with applications in computer aided diagnosis

[...]

Volkan Vural¹, Glenn Fung², Balaji Krishnapuram², Jennifer G. Dy¹, Bharat Rao² - Show less +1 more•Institutions (2)

Northeastern University¹, Siemens²

01 Jan 2006-Lecture Notes in Computer Science

TL;DR: Two algorithms are developed to classify all the samples in a batch jointly, one based on a probabilistic analysis and anotherbased on a mathematical programming approach, which are significantly more accurate than a naive SVM which ignores the correlations among the samples.

...read moreread less

6 citations

Book Chapter•DOI•

Addressing image variability while learning classifiers for detecting clusters of micro-calcifications

[...]

Glenn Fung¹, Balaji Krishnapuram¹, Nicolas J. Merlet¹, Eli Ratner¹, Philippe Bamberger¹, Jonathan Stoeckel¹, R. Bharat Rao¹ - Show less +3 more•Institutions (1)

Siemens¹

18 Jun 2006

TL;DR: Experimental studies on the detection of clusters of micro-calcifications indicate that the proposed method significantly outperforms a state-of-the-art general purpose method for designing classifiers (SVM), in terms of FROC curves on a hold out test set.

...read moreread less

Abstract: Computer aided detection systems for mammography typically use standard classification algorithms from machine learning for detecting lesions. However, these general purpose learning algorithms make implicit assumptions that are commonly violated in CAD problems. We propose a new ensemble algorithm that explicitly accounts for the small fraction of outlier images which tend to produce a large number of false positives. A bootstrapping procedure is used to ensure that the candidates from these outlier images do not skew the statistical properties of the training samples. Experimental studies on the detection of clusters of micro-calcifications indicate that the proposed method significantly outperforms a state-of-the-art general purpose method for designing classifiers (SVM), in terms of FROC curves on a hold out test set.

...read moreread less

3 citations