Medical diagnosis with C4.5 rule preceded by artificial neural network ensemble

doi:10.1109/TITB.2003.808498

Journal ArticleDOI

Medical diagnosis with C4.5 rule preceded by artificial neural network ensemble

- Vol. 7, Iss: 1, pp 37-42

TLDR

Case studies on diabetes, hepatitis, and breast cancer show that C4.5 Rule-PANE could generate rules with strong generalization ability, which benefits from an artificial neural network ensemble, and strong comprehensibility, whichbenefits from rule induction.

Abstract:

Comprehensibility is very important when machine learning techniques are used in computer-aided medical diagnosis. Since an artificial neural network ensemble is composed of multiple artificial neural networks, its comprehensibility is worse than that of a single artificial neural network. In this paper, C4.5 Rule-PANE, which combines an artificial neural network ensemble with rule induction by regarding the former as a preprocess of the latter, is proposed. At first, an artificial neural network ensemble is trained. Then, a new training data set is generated by feeding the feature vectors of original training instances to the trained ensemble and replacing the expected class labels of original training instances with the class labels output from the ensemble. Additional training data may also be appended by randomly generating feature vectors and combining them with their corresponding class labels output from the ensemble. Finally, a specific rule induction approach, i.e., C4.5 Rule, is used to learn rules from the new training data set. Case studies on diabetes, hepatitis , and breast cancer show that C4.5 Rule-PANE could generate rules with strong generalization ability, which benefits from an artificial neural network ensemble, and strong comprehensibility, which benefits from rule induction.

Medical diagnosis with C4.5 rule preceded by artificial neural network ensemble

Citations

Exploratory Undersampling for Class-Imbalance Learning

Improve Computer-Aided Diagnosis With Machine Learning Techniques Using Undiagnosed Samples

Exploratory Under-Sampling for Class-Imbalance Learning

Using Three Machine Learning Techniques for Predicting Breast Cancer Recurrence

Predicting breast cancer survivability using data mining techniques

References

An introduction to the bootstrap

Learning representations by back-propagating errors

Bagging predictors

UCI Repository of machine learning databases

Programs for Machine Learning

Related Papers (5)

Isolated word recognition with the liquid state machine : a case study

CIXL2: a crossover operator for evolutionary algorithms based on population features

Robust video signature based on ordinal measure

Performance, power efficiency and scalability of asymmetric cluster chip multiprocessors

Internet-audiotext electronic communications system with multimedia based matching