scispace - formally typeset
Open AccessProceedings ArticleDOI

An Algorithm for Classifying Incomplete Data with Selective Bayes Classifiers

Reads0
Chats0
TLDR
Experiments on twelve benchmark incomplete data sets show that this algorithm can greatly improve the accuracy of classification and sharply reduce the number of attributes and so can greatly simplify the data sets and classifiers.
Abstract
Actual data sets are often incomplete because of various kinds of reason. Although many algorithms for classification have been proposed, most of them deal with complete data. So methods of constructing classifiers for incomplete data deserve more attention. By analyzing main methods of processing incomplete data for classification, this paper presents a selective Bayes classifier for classifying incomplete data. The proposed algorithm needs no assumption about data sets that are necessary for previous methods of processing incomplete data. Experiments on twelve benchmark incomplete data sets show that this algorithm can greatly improve the accuracy of classification. Furthermore, it can also sharply reduce the number of attributes and so can greatly simplify the data sets and classifiers.

read more

Citations
More filters
Journal ArticleDOI

Towards a HPC-oriented parallel implementation of a learning algorithm for bioinformatics applications

TL;DR: In this article, a parallel implementation of the U-BRAIN algorithm is presented, which is able to minimize both the memory space and the execution time of the algorithm, and is shown to be up to 30 times faster than the serial one.
Journal ArticleDOI

Towards a HPC-oriented parallel implementation of a learning algorithm for bioinformatics applications

TL;DR: The HPC-oriented implementation of a general purpose learning algorithm, originally conceived for DNA analysis and recently extended to treat uncertainty on data (U-BRAIN), is discussed and implemented and tested on the INTEL XEON E7xxx and E5xxx family of the CRESCO structure.
Journal ArticleDOI

A selective Bayes classifier with meta-heuristics for incomplete data

TL;DR: This study combined Electromagnetism-like Mechanism algorithm with RBC for feature selection and classification tasks with incomplete data and indicated greatly improved RBC performance combined with each feature selection approach.
References
More filters
Book

Data Mining: Practical Machine Learning Tools and Techniques

TL;DR: This highly anticipated third edition of the most acclaimed work on data mining and machine learning will teach you everything you need to know about preparing inputs, interpreting outputs, evaluating results, and the algorithmic methods at the heart of successful data mining.
Journal ArticleDOI

Stochastic Relaxation, Gibbs Distributions, and the Bayesian Restoration of Images

TL;DR: The analogy between images and statistical mechanics systems is made and the analogous operation under the posterior distribution yields the maximum a posteriori (MAP) estimate of the image given the degraded observations, creating a highly parallel ``relaxation'' algorithm for MAP estimation.
Book

Statistical Analysis with Missing Data

TL;DR: This work states that maximum Likelihood for General Patterns of Missing Data: Introduction and Theory with Ignorable Nonresponse and large-Sample Inference Based on Maximum Likelihood Estimates is likely to be high.