scispace - formally typeset
Search or ask a question
Journal ArticleDOI

A patient-adaptable ECG beat classifier using a mixture of experts approach

TL;DR: A "mixture-of-experts" (MOE) approach to develop customized electrocardiogram (EGG) beat classifier in an effort to further improve the performance of ECG processing and to offer individualized health care.
Abstract: Presents a "mixture-of-experts" (MOE) approach to develop customized electrocardiogram (EGG) beat classifier in an effort to further improve the performance of ECG processing and to offer individualized health care. A small customized classifier is developed based on brief, patient-specific ECG data. It is then combined with a global classifier, which is tuned to a large ECG database of many patients, to form a MOE classifier structure. Tested with MIT/BIH arrhythmia database, the authors observe significant performance enhancement using this approach.

Summary (3 min read)

Introduction

  • A large in-house ECG database is developed and maintained to test each ECG processing algorithm to be incorporated into the product.
  • The result is a complicated classifier which is costly to develop, maintain, and update.
  • The authors may include the training algorithm and the database used to develop the classifier to be delivered to the users, so that the classification algorithm can be finetuned to each patient.

A. ECG Beat Classification Techniques

  • Automated ECG beat classification was traditionally performed using a decision-tree-like approach, based on various features extracted from an ECG beat [1], [4], [5], [13], [20], [22].
  • The features used include the width and height of QRS complex, RR interval, QRS complex area, etc.
  • Their abilities to learn from examples and extract the statistical properties of the examples presented during the training sessions, make them an ideal choice for an automated process that imitates human logic.
  • Several efforts have been made to apply ANN’s for the purpose of ECG beat detection and classification.
  • They have achieved an average recognition accuracy of 90% in classifying the beats into two groups; normal and abnormal.

B. Self-Organization Map (SOM) and Learning Vector Quantization (LVQ)

  • SOM and LVQ are both clustering based algorithms proposed by Kohonen [14], [15].
  • In LVQ1, for a given input vector , a code word is found such that (2) The code word is then updated as follows: (3) where if the classification is correct [i.e., and have the same class label] and , otherwise.
  • As such, the development of the code book and eventually decision boundary can be made completely transparent to the user.
  • The resulting code book then will be submitted to the LVQ PAK to facilitate fine tuning and classification.

III. MIXTURE OF EXPERTS (MOE)

  • This user adaptation problem bears certain resemblance to the incremental learning problem in that new data are to be incorporated to improve existing classifier’s performance.
  • The LE represents a specialized ECG beat classifier, trained on a small segment of annotated ECG beats taken from the specific patient.
  • In the MOE method, the combined th output vector of both the experts is given by (6) where is the input feature vector, , are the weighting vectors for each expert from a gating network and are defined by (7) where ’s are the weight vectors of the gating network.
  • Define , and , , to be the subregion in the feature space where the classifier makes correct classification of and let be defined the same way, also known as Theorem 1.
  • The authors further partition this training data set into two subsets: one for the training of the user-specific classifier , and the other for estimating and .

IV. EXPERIMENT

  • The purpose of this experiment is to demonstrate the usefulness of the proposed user-adaptation procedure.
  • In particular, the authors will show that an ECG beat classifier trained on general patient records does not perform well when presented with patient records which contain rare beat types.
  • Moreover, the authors show that the performance of the MOE classifier is able to gain significant performance enhancement with a small amount of annotated patient specific training data.

A. Data Preparation

  • The authors concentrate on the classification of ventricular ectopic beats (VEB’s).
  • According to the AAMIrecommended practice, records containing the paced beats (four records) can be excluded from the reporting requirements.
  • The first group is intended to serve as a representative sample of a variety of waveforms and artifacts which an arrhythmia detector might encounter in routine clinical use.
  • If this GE classifier were a commercial device, it will be deemed not-applicable (due to low performance) to many of these 20 test records.
  • Each of the four categories included beats of several types as shown in Table III.

B. Training and Testing Procedure

  • A GE classifier was developed with SOM and LVQ algorithms using the data from the records of the first group (100–124).
  • The objective of this paper is to classify the QRS beats into one of the four different categories.
  • This is a reasonable assumption since each code word is obtained using the SOM clustering algorithm based on the L norm distance measure.
  • The LE classifier is developed in exactly the same manner as the global classifier, except that it uses only the first two and half minutes in the tape, and is constructed separately for each particular “patient tape” (tape #200–234) in the MIT/BIH database.
  • The output of the classifier is calculated as given by (6).

C. Results

  • The classifier was tested with the selected 20 records of the second group of the MIT database.
  • The GE was left intact and is used as is for testing the 25 min of data from each 30-min testing record with first 5 min excluded as they are used to develop the LE and the “gating network.”.
  • All detection statistics are founded on the mutually exclusive categories of true positives (TP), false positives (FP), true negatives (TN), and false negatives (FN).
  • Sensitivity, specificity, and positive predictivity are used to compare the results, also known as Three statistics.
  • These three statistics, together with the percentage classification rates, are reported for each individual testing file as required by the AAMI-recommended practice [18].

D. Discussion

  • 1) From Tables V and VI, the authors observe that the MOE approach is capable of significantly enhancing the performance of an ECG beat classifier over the global classifier.
  • 2) Comparing the LE and ME, the authors found that LE outperformed ME in terms of classification rate, mainly due to higher specificity (ability to correctly classify normal beats), but with lower sensitivity (ability to correctly classify PVC beats as PVC).
  • Hence, although a LE classifier performs well, the availability of a global classifier does help to further enhance its performance.
  • 4) A potential drawback of this proposed method is the need to develop a LE classifier for each individual patient, even with only 5 min of patient’s ECG record.
  • Since this must be performed by a physician or a ECG specialist, potentially it would be very costly.

V. CONCLUSION

  • The authors developed a novel approach to demonstrate the feasibility of having a patient-adaptable ECG beat classification algorithm.
  • The authors outlined the basic requirements of such a system, namely accuracy, cost-effectiveness and protection of the device manufactures intellectual property rights.
  • The authors presented a SOM/LVQ-based approach to illustrate that these requirements can be met.
  • The potential benefit of patient adaptation is immense and is worth pursuing further.
  • The authors believe it can be easily adapted to other automated patientmonitoring algorithms and eventually support decentralized remote patient-monitoring systems.

Did you find this useful? Give us your feedback

Content maybe subject to copyright    Report

IEEE TRANSACTIONS ON BIOMEDICAL ENGINEERING, VOL. 44, NO. 9, SEPTEMBER 1997 891
A Patient-Adaptable ECG Beat Classifier
Using a Mixture of Experts Approach
Yu Hen Hu,* Senior Member, IEEE, Surekha Palreddy, and Willis J. Tompkins, Fellow, IEEE
AbstractWe present a “mixture-of-experts” (MOE) approach
to develop customized electrocardigram (ECG) beat classifier in
an effort to further improve the performance of ECG processing
and to offer individualized health care. A small customized
classifier is developed based on brief, patient-specific ECG data.
It is then combined with a global classifier, which is tuned to a
large ECG database of many patients, to form a MOE classifier
structure. Tested with MIT/BIH arrhythmia database, we observe
significant performance enhancement using this approach.
Index Terms ECG beat classification, MIT/BIH database,
mixture of experts, neural network, patient adaptation.
I. INTRODUCTION
C
OMPUTERIZED electrocardiography is now a well-
established practice, after several years of significant
progress. Many algorithms have been proposed over years for
electrocardiogram (ECG) beat detection and classification. In
a clinical setting, such as an intensive care unit, it is essential
for automated systems to accurately detect and classify elec-
trocardiographic signals on a real-time basis. Since several
arrhythmia are potentially dangerous and life threatening, if
not detected within a few seconds to a few minutes of its
onset, automated electrocardiographic monitoring assumes a
challenging role. Several algorithms have been proposed in
the literature for detection and classification of ECG beats and
reported results, that leave room for improvement. They in-
clude signal processing techniques; such as frequency analysis,
template matching, and other parameter extraction methods.
Artificial neural networks were also employed to exploit their
natural ability in pattern-recognition tasks for successful clas-
sification of ECG beat [2], [3], [6]–[8], [23]–[25], [28]–[31].
One major problem faced by today’s automatic ECG anal-
ysis machine is the wild variations in the morphologies of
ECG waveforms of different patients and patient groups.
An ECG beat classifier which performs well for a given
training database often fails miserably when presented with
a different patient’s ECG waveform. Such an inconsistency
in performance is a major hurdle preventing highly reliable,
fully automated ECG processing systems to be widely used
clinically.
Manuscript received September 13, 1995; revised May 5, 1997. Asterisk
indicates corresponding author.
*Y. H. Hu is with the Department of Electrical and Computer
Engineering, University of Wisconsin, Madison, WI 53706 USA (e-mail:
hu@engr.wisc.edu).
S. Palreddy and W. J. Tompkins are with the Department of Electrical and
Computer Engineering, University of Wisconsin, Madison, WI 53706 USA.
Publisher Item Identifier S 0018-9294(97)06116-8.
One obvious approach to alleviate this problem is to use as
much training data as possible to develop the ECG classifier.
This is the approach taken by all the vendors of ECG pro-
cessing devices: A large in-house ECG database is developed
and maintained to test each ECG processing algorithm to be
incorporated into the product. However, such an approach
suffers several pitfalls.
1) No matter how large this database may be, it is not
possible to cover every ECG waveform of all potential
patients. Hence, its performance is inherently limited.
2) The complexity of the classifier grows as the size of the
training database grows. When a classifier is designed
to correctly classify ECG from millions of patients
(if it ever becomes possible), it has to take numerous
exceptions into account. The result is a complicated
classifier which is costly to develop, maintain, and
update.
3) It is practically impossible to make the classifier learn to
correct errors during normal clinical use. Thus, it may be
rendered useless if it fails to recognize a specific type of
ECG beats which occurs frequently in certain patient’s
ECG records.
The answer, we believe, is to allow the classifier to be
“patient-adaptable.” That is, to let the classification algorithm
adaptable to the special characteristics of each patient’s ECG
records. For example, we may include the training algorithm
and the database used to develop the classifier to be delivered
to the users, so that the classification algorithm can be fine-
tuned to each patient. Unfortunately, this is impractical for
several reasons.
While it is possible to turn over training algorithms and
databases to the users in an academic environment, it
is unlikely that any commercial ECG machine vendor
is willing to risk revealing their proprietary information
to their competitors. Moreover, in-house database often
contains millions of ECG records which could be costly
to distribute.
Users often do not want to be bothered by implementation
details of an ECG algorithm. Thus, few users will be able
to take advantage of this patient-adaptation feature even
if it is available.
Even if a user is willing to perform the patient cus-
tomization, he or she still have to provide sufficient
number of patient-specific training data in order to per-
form patient-adaptation. Manually editing ECG record is
a time consuming, labor intensive task. Hence, the size of
0018–9294/97$10.00 1997 IEEE

892 IEEE TRANSACTIONS ON BIOMEDICAL ENGINEERING, VOL. 44, NO. 9, SEPTEMBER 1997
patient-specific training data must be tightly controlled.
In this study, we propose a novel approach to patient-
adaptation while avoiding these difficulties: 1) We do not
require the factory-trained ECG classifier to provide training
algorithms or training databases. Instead, all we need is that
this classifier gives both its classification results, as well as
an estimate of posterior probability of the feature vector as is
drawn from each particular class. Hence, no company propri-
etary information is needed. 2) A patient-specific classifier will
be developed using an automated procedure, without human
supervision. 3) Only a brief manually edited patient ECG
record (2–5 min) is needed to achieve significant performance
improvement.
This proposed approach is based on three popular artificial
neural network (ANN)-related algorithms, namely, the self-
organizing maps (SOM), learning vector quantization (LVQ)
algorithms, along with the mixture-of-experts (MOE) method.
SOM and LVQ together are used to train the patient-specific
classifier, and MOE is a paradigm which facilitates the com-
bination of the two classifiers (original and patient-specific)
to realize patient-adaptation. In MOE, the two classifiers are
modeled as two experts on ECG beat classification. The
original classifier, called the Global expert (GE) in this work,
knows how to classify ECG beats for many other patients
whose ECG records are part of the in-house, large ECG
database. The patient-specific classifier, called the local expert
(LE) in this work, is trained specifically with the ECG record
of the patient. A gating function, based on the feature vector
presented, dynamically weights the classification results of the
GE’s and the LE’s to reach a combined decision. The process
is analogous to two human experts arriving at a consensus
based on their own expertise.
Section II reports the results of literature survey and
Section III discusses data acquisition with preprocessing.
Section IV discusses the proposed algorithms and the
development of experts. Section V reports the results of the
classifier on the database records and discusses the results.
Section VI is a summary of the findings of this paper.
II. P
RELIMINARIES
A. ECG Beat Classification Techniques
Automated ECG beat classification was traditionally per-
formed using a decision-tree-like approach, based on various
features extracted from an ECG beat [1], [4], [5], [13], [20],
[22]. The features used include the width and height of QRS
complex, RR interval, QRS complex area, etc. One of the
difficulties is that these features are susceptible to variations of
ECG beat morphology and temporal characteristics. As such,
the classification rate reported in these earlier efforts are rather
moderate.
Artificial neural networks (ANN’s) have been widely ac-
cepted for pattern recognition tasks. Their abilities to learn
from examples and extract the statistical properties of the
examples presented during the training sessions, make them
an ideal choice for an automated process that imitates human
logic. Several efforts have been made to apply ANN’s for
the purpose of ECG beat detection and classification. Previ-
ous reported efforts include [2], [3], [6]–[8], [23]–[25], and
[28]–[31].
Hu et al. [7] reported the development of an adaptive
multilayer perceptron (MLP) for classification of ECG beats.
They have achieved an average recognition accuracy of 90%
in classifying the beats into two groups; normal and abnormal.
In an attempt to classify the beats into 13 groups according to
the MIT Database annotations, they have reported an average
recognition accuracy rate of 65%. An hierarchical system of
the MLP networks which first classify the beat into normal
or abnormal, and then classify it into the specific beat type, is
developed, which improved the recognition accuracy to 84.5%.
B. Self-Organization Map (SOM) and Learning
Vector Quantization (LVQ)
SOM and LVQ are both clustering based algorithms pro-
posed by Kohonen [14], [15]. SOM is an unsupervised on-line
clustering technique. In SOM, each cluster center (prototype
or code word) is represented by the weights of a neuron which
is assigned to a coordinate in the feature map. The SOM
training algorithm forces adjacent neurons in the feature map
to respond to similar feature (input) vectors. In a way, this
feature map is analogous to the spatial organization of sensory
processing areas in the brain. Let
be denoted as the
weights (code word) or the
th neuron in SOM during the time
instant
, the weights of SOM then are updated according to
the following simple formula:
(1)
is the so-called neighborhood kernel, which determine
the size of neighborhood of the
th neuron within which all
neighboring neurons will be updated in response to the present
feature vector
. Initially, the neighborhood is large. The
size reduces as clustering converges, until no neighboring
neurons will get updated.
LVQ is a supervised, clustering-based classification tech-
nique which classifies a feature vector
according to the
label of the cluster prototype (code word) into which
is
clustered. Classification error occurs when the feature vectors
within the same cluster (hence, assigned to the same class
label) are actually drawn from different classes. To minimize
classification error, the LVQ algorithm fine tunes the clustering
boundary between clusters of different class labels by modi-
fying the position of the clustering center (prototype or code
word). This method is called “learning vector quantization
because this clustering based classification method is similar to
the vector quantization method used for signal compression
in the areas of communication and signal processing.
According to Kohonen, there are three different LVQ algo-
rithms, called LVQ1, LVQ2, and LVQ3 developed at subse-
quent stages to handle classification problems with different
natures. In this study, the optimized learning-rate LVQ1 and
LVQ3 algorithms were used for the training and fine-tuning of
the code book respectively. In LVQ1, for a given input vector
, a code word is found such that
(2)

HU et al.: PATIENT-ADAPTABLE ECG BEAT CLASSIFIER 893
The code word is then updated as follows:
(3)
where
if the classification is correct [i.e., and
have the same class label] and , otherwise.
is a time-varying learning rate. Other code words in the code
book remain unchanged. LVQ3 differs from LVQ1 in how
the code words are updated: Assuming that
falls within
a window between two adjacent clusters with corresponding
code words
and . Suppose that and belong to the
same class, and
and belong to different classes, then both
these code words will be updated in LVQ3:
(4a)
(4b)
On the other hand, if both
and belong to the same class
as
, and fall in a window centered at the cluster
boundary of these two classes, then
(5)
The optimal value of
depends on the size of the window,
being smaller for narrower windows. This algorithm is self-
stabilizing, and optimal placement of the
does not change
in continual training.
Software packages of both SOM and LVQ are available
in the public domain,
1
and the application of these packages
to the ECG beat classification problem is straight forward.
The adaptation parameters in these packages (SOM
PAK and
LVQ
PAK) were carefully fine tuned while developing the
classifiers. As such, the development of the code book and
eventually decision boundary can be made completely trans-
parent to the user. Moreover, performance obtained using these
package is very competitive compared to other approaches. In
this research work, we first apply SOM to a set of training
feature vectors. The resulting code book (prototypes) then will
be submitted to the LVQ
PAK to facilitate fine tuning and
classification.
III. M
IXTURE OF EXPERTS (MOE)
This user adaptation problem bears certain resemblance to
the incremental learning problem in that new data are to
be incorporated to improve existing classifier’s performance.
However, the black-box model of the existing classifier pre-
vents us from directly modifying the classifier structure as
incremental learning algorithms do. Instead, we propose a
different method called the MOE, to circumvent this problem.
The MOE approach was proposed by Jacobs et al. [9]–[12],
[16], [26], [27]. The basic notion is that linear combinations
of several statistical estimates can perform better than any
individual estimate. This strategy is not new. It is a well
known fact that a panel of experts often arrive at a better
diagnosis than any single expert, because each expert is able
to contribute from his/her own expertise.
1
University of Helsinki, Finland, URL: ftp://cochlea.hut.fi/pub/
The basic idea is to leave the existing black-box classifier
intact. Instead, we use the given small, user-specific training
data set to develop a LE classifier. Then we invoke a modified
MOE approach to combine these two classifiers, hoping to
achieve better performance.
To apply the MOE approach to solve the customization
problem, we employ two experts: a GE and a LE. The GE
represents the ECG beat classifier developed in factory. Thus,
it is trained to classify all types of ECG beats present in the
in-house ECG database. The LE represents a specialized ECG
beat classifier, trained on a small segment of annotated ECG
beats taken from the specific patient. As such, the GE and the
LE are endowed with complementary knowledge bases, and
can work together to reach a better decision than any one can
reach individually.
The expert network is a combination of the GE and LE
classifiers. Let
and be the output (row) vectors of
the two respective GE and LE classifiers. Each element of each
vector indicates the degree of proximity of an unknown ECG
beat to a predefined ECG beat class (category). In the MOE
method, the combined
th output vector of both the experts
is given by
(6)
where
is the input feature vector, , are the
weighting vectors for each expert from a gating network and
are defined by
(7)
where
’s are the weight vectors of the gating network. Note
that
.
Theorem 1: Define
, and
, , to be the subregion in the feature space where
the classifier
makes correct classification of and let
be defined the same way. Assume and
, then
(8)
Proof: We need only to prove that if both
and
misclassify a given feature vector , then cannot
give correct classification on
. Since the correct classifica-
tion output
, the combined output , and individual
classifier output
and are all binary vectors of the
same dimension, if both classifiers misclassify a given feature
vector
which belongs to class , we must have, for the th
elements of these binary vectors
where is the “exclusive-OR” operator in Boolean algebra.
Since from (7),
, we conclude ,if
, and ,if .
Hence,
. In other words, must also
misclassify the same feature vector
regardless the choice

894 IEEE TRANSACTIONS ON BIOMEDICAL ENGINEERING, VOL. 44, NO. 9, SEPTEMBER 1997
of and . This is to say, if , and if
, then .
The implication of Theorem 1 is that the maximum per-
formance enhancement of a MOE approach occurs when
(empty set). An example is to
designate each classifier to be responsible for classifying
a particular class. The assumption that
is
essential in this theorem. If
(interval between
zero and one), it is possible to find a counter example. Let
, , and
. Then .If
, then which yields correct
classification.
On the other hand, whether
takes binary values or
not, if both classifiers make correct classification, so will the
combined classifier.
Theorem 2: With the same definitions as in Theorem 1, and
(9)
Proof: Assume
[class , and ,
. Then
(10)
Thus, the output
is correctly classified.
From Theorem 2, it is clear that if both classifiers #1 and #2
correctly classify a pattern
, then the combined classifier will
also correctly classify the same pattern. Hence, this pattern can
be excluded from the user-adaptation training set as it will not
affect the result.
Adaptation Algorithm: Based on the result indicated in
Theorems 1 and 2, the design objective of the MOE network in
(3) is to devise a training algorithm to estimate the parameter
vectors
. Given that and are fixed
classifiers, this problem can be solved by a gradient procedure
as follows: Let us assume
be a set
of training data used for searching the optimal gating functions
and , such that the square error at the output
is minimized.
A gradient search algorithm can be devised as follows:
(11)
The initial values of
and are set to be the centroids of
the regions
and , respectively, for in the
user-specific training data set. The gradient of
with respect
to
can be calculated as
(12)
(13)
where
. In (13), we
assumed the transfer function
is a differentiable threshold
function, and is applied to the vector, element by element.
Finally, with (13), we have as shown in (14) at the bottom of
the page. Hence, for
, we have
(15)
Note that in above derivation, the error
is accumulated
over the entire epoch (
feature vectors). The summation
over
may be removed if we use on-line update of ’s
for each sample. This yields the following expression for
:
diag
(16)
Clearly, we have
. This is not surprising
with two parameter vectors arriving at a decision hyperplane
.
Until now, we have assumed that the user-specific ECG
beat classifier
is readily available. However, in reality
it needs to be trained with the user-specific training data set.
Also, the combined classifier
needs to be trained by
the same data set in order to determine the gating network
parameters. Therefore, if
is trained to 100% accuracy
(14)

HU et al.: PATIENT-ADAPTABLE ECG BEAT CLASSIFIER 895
on the user-specific data set, then the gating network of choice
may be
and . In light of the results
of Theorems 1 and 2, we devised the following strategy to
alleviate this problem: First, we construct the user-specific
training data set to contain only those feature vectors which
the original classifier misclassified. We further partition this
training data set into two subsets: one for the training of the
user-specific classifier
, and the other for estimating
and .
IV. E
XPERIMENT
The purpose of this experiment is to demonstrate the useful-
ness of the proposed user-adaptation procedure. In particular,
we will show that an ECG beat classifier trained on general
patient records does not perform well when presented with
patient records which contain rare beat types. Moreover, we
show that the performance of the MOE classifier is able to gain
significant performance enhancement with a small amount of
annotated patient specific training data.
A. Data Preparation
In this study, we concentrate on the classification of ven-
tricular ectopic beats (VEB’s). The 48 records (tapes) from
MIT/BIH ECG arrhythmia database [17], [19] are used for the
development and evaluation of the classifier. The availability
of annotated MIT/BIH database has enabled the evaluation of
performance of the proposed beat classification algorithm. The
American Association of Medical Instrumentation (AAMI)-
recommended practice [18] has provided a protocol for a
reproducible test with realistic clinical requirements, empha-
sizing tape-by-tape presentation of results that estimate an
algorithm’s ability to detect events of clinical significance.
Accompanying each tape in the MIT/BIH database is an
annotation file in which each ECG beat has been identified
by expert cardiologist annotators. These labels are referred to
as “truth” annotations and are used in training (developing)
the classifiers and also to evaluate the performance of the
classifiers (experts) in testing phase. According to the AAMI-
recommended practice, records containing the paced beats
(four records) can be excluded from the reporting require-
ments. Since this study is to evaluate the performance of a
classifier that can identify a premature ventricular contraction
(PVC), certain records in the database with no PVC’s (11
records) were excluded from the study, leaving 33 records of
interest. These excluded records are listed in Table I. Data
from channel 1, down-sampled to 180 samples/s were used in
this study. The selected files consist of 13 records (numbered
from 100–124, inclusive, with some numbers missing) and
20 records (numbered from 200–234, inclusive, with some
numbers missing). The first group is intended to serve as a
representative sample of a variety of waveforms and artifacts
which an arrhythmia detector might encounter in routine
clinical use. Records in the second group include complex
ventricular, junctional, and supraventricular arrhythmias and
conduction abnormalities. Several of these records are ex-
pected to present significant difficulty to arrhythmia detectors
because of the features of the rhythm, QRS morphology
TABLE I
R
ECORDS OF MIT/BIH DATABASE THAT WERE EXCLUDED FROM THE STUDY
TABLE II
F
OUR CATEGORIES OF INTEREST INTO WHICH THE
ECG BEATS OF THIS STUDY ARE CLASSIFIED
variation, and signal quality. These records were reported to
have gained considerable notoriety among database users [18].
In this experiment, we use the first group of files as the
training data to develop a GE classifier which is able to
classify typical ECG beats. The second group of 20 records
is used to simulate the ECG records of 20 patients, which
are to be classified by the GE classifier. Since these records
consist of less-frequently seen beats, it is expected that the
GE classifier will not perform well. If this GE classifier were
a commercial device, it will be deemed not-applicable (due to
low performance) to many of these 20 test records. However,
with the MOE approach, we will adapt this GE classifier with
a LE classifier to gain significant performance enhancement
at low cost.
The beats in the MIT/BIH database are of several different
types. In this study, we are interested in identifying four
different categories, as indicated in Table II. Each of the
four categories included beats of several types as shown in
Table III. The AAMI convention was used to combine the
beats into four classes of interest.
B. Training and Testing Procedure
In this study, a GE classifier was developed with SOM and
LVQ algorithms using the data from the records of the first
group (100–124). Before testing the records, a LE classifier
was developed for each of the records in the second group
using the first 2.5 min of data. The rest of the record is
then tested using the mixture of global and LE’s as explained
before. Since each record in the MIT/BIH database is of
length 30 min, the 2.5 min segment account for 1/12th of total
available patient specific data and contains approximately 150
ECG beats. In practice, the attending cardiologist or any expert
in ECG beat annotation will have to annotate a brief segment
of patient-specific ECG in order to take advantage of the
MOE approach. We believe that this is a reasonably small cost
compared to the potential gain in performance enhancement.
In future, we will explore a more effective method to further
reduce the amount of required annotated patient-specific data.

Citations
More filters
01 Jan 1990
TL;DR: An overview of the self-organizing map algorithm, on which the papers in this issue are based, is presented in this article, where the authors present an overview of their work.
Abstract: An overview of the self-organizing map algorithm, on which the papers in this issue are based, is presented in this article.

2,933 citations

Journal ArticleDOI
TL;DR: A method for the automatic processing of the electrocardiogram (ECG) for the classification of heartbeats and results are an improvement on previously reported results for automated heartbeat classification systems.
Abstract: A method for the automatic processing of the electrocardiogram (ECG) for the classification of heartbeats is presented. The method allocates manually detected heartbeats to one of the five beat classes recommended by ANSI/AAMI EC57:1998 standard, i.e., normal beat, ventricular ectopic beat (VEB), supraventricular ectopic beat (SVEB), fusion of a normal and a VEB, or unknown beat type. Data was obtained from the 44 nonpacemaker recordings of the MIT-BIH arrhythmia database. The data was split into two datasets with each dataset containing approximately 50 000 beats from 22 recordings. The first dataset was used to select a classifier configuration from candidate configurations. Twelve configurations processing feature sets derived from two ECG leads were compared. Feature sets were based on ECG morphology, heartbeat intervals, and RR-intervals. All configurations adopted a statistical classifier model utilizing supervised learning. The second dataset was used to provide an independent performance assessment of the selected configuration. This assessment resulted in a sensitivity of 75.9%, a positive predictivity of 38.5%, and a false positive rate of 4.7% for the SVEB class. For the VEB class, the sensitivity was 77.7%, the positive predictivity was 81.9%, and the false positive rate was 1.2%. These results are an improvement on previously reported results for automated heartbeat classification systems.

1,449 citations


Cites background or methods or result from "A patient-adaptable ECG beat classi..."

  • ...For comparison, the classification performance of a published system [11] is shown in Table VII(b)....

    [...]

  • ...Classifiers methods employed include linear discriminants [7], back propagation neural networks [8]–[10], self-organizing maps with learning vector quantization [11], and self-organizing networks [12]....

    [...]

  • ...[11], as this study looked at the problem of distinguishing VEB from non-VEB heartbeats on the MIT-BIH arrhythmia database....

    [...]

  • ...TABLE VII (A) CLASSIFICATION PERFORMANCE OF CONFIGURATION IX ON EACH RECORDING OF DS2 USING THE AAMI RECOMMENDED PERFORMANCE MEASURES, (B) AGGREGATE CLASSIFICATION PERFORMANCE FOR THE SYSTEM IN [11], AND (C) THE CLUSTERING PERFORMANCE OF CLASSIFIER IX AND SYSTEM FROM [12]...

    [...]

  • ...The AAMI standards are adopted in this study and our results have been compared to those of [11] and [12]....

    [...]

Journal ArticleDOI
TL;DR: A fast and accurate patient-specific electrocardiogram (ECG) classification and monitoring system that achieves a superior classification performance than most of the state-of-the-art methods for the detection of ventricular ectopic beats and supraventricular ectopy beats.
Abstract: Goal: This paper presents a fast and accurate patient-specific electrocardiogram (ECG) classification and monitoring system. Methods: An adaptive implementation of 1-D convolutional neural networks (CNNs) is inherently used to fuse the two major blocks of the ECG classification into a single learning body: feature extraction and classification. Therefore, for each patient, an individual and simple CNN will be trained by using relatively small common and patient-specific training data, and thus, such patient-specific feature extraction ability can further improve the classification performance. Since this also negates the necessity to extract hand-crafted manual features, once a dedicated CNN is trained for a particular patient, it can solely be used to classify possibly long ECG data stream in a fast and accurate manner or alternatively, such a solution can conveniently be used for real-time ECG monitoring and early alert system on a light-weight wearable device. Results: The results over the MIT-BIH arrhythmia benchmark database demonstrate that the proposed solution achieves a superior classification performance than most of the state-of-the-art methods for the detection of ventricular ectopic beats and supraventricular ectopic beats. Conclusion: Besides the speed and computational efficiency achieved, once a dedicated CNN is trained for an individual patient, it can solely be used to classify his/her long ECG records such as Holter registers in a fast and accurate manner. Significance: Due to its simple and parameter invariant nature, the proposed system is highly generic, and, thus, applicable to any ECG dataset.

1,300 citations


Cites background or methods from "A patient-adaptable ECG beat classi..."

  • ...There have been several methods for generic and fully automatic ECG classification based on signal processing techniques, such as frequency analysis [2], wavelet transform [3] and filter banks [4], statistical [5] and heuristic approaches [6], hidden Markov models [7], support vector machines [8], artificial neural networks (ANNs) [9], and mixture-of-experts method [10]....

    [...]

  • ...With the limited training data as proposed in [10] and [14]–[17], we shall demonstrate that simple CNNs will suffice to achieve a superior classification performance rather than the complex ones that are commonly used for deep learning tasks....

    [...]

  • ...Among all, only few of them with a patient-specific design [10], [12], [15]–[18] have, in particular, demonstrated significant performance improvements over the automatic and generic ECG classification methods thanks to their ability to adapt or optimize the classifier body according to each patient’s ECG signal....

    [...]

  • ...However, among many methods in the literature, only few [10], [14]–[18] have in fact used the AAMI standards along with the complete data from the benchmark MIT-BIH arrhythmia database [22]....

    [...]

  • ...To perform a more extensive and accurate comparative performance evaluation, the base performance of the proposed system is compared with the three existing algorithms, [10], [15], and [16], all of which comply with the AAMI standards....

    [...]

Journal ArticleDOI
TL;DR: Five types of beat classes of arrhythmia as recommended by Association for Advancement of Medical Instrumentation (AAMI) were analyzed and dimensionality reduced features were fed to the Support Vector Machine, neural network and probabilistic neural network (PNN) classifiers for automated diagnosis.

586 citations

Journal ArticleDOI
TL;DR: An integrated method for clustering of QRS complexes is presented which includes basis function representation and self-organizing neural networks (NN's) and outperforms both a published supervised learning method as well as a conventional template cross-correlation clustering method.
Abstract: An integrated method for clustering of QRS complexes is presented which includes basis function representation and self-organizing neural networks (NN's). Each QRS complex is decomposed into Hermite basis functions and the resulting coefficients and width parameter are used to represent the complex. By means of this representation, unsupervised self-organizing NNs are employed to cluster the data into 25 groups. Using the MIT-BIH arrhythmia database, the resulting clusters are found to exhibit a very low degree of misclassification (1.5%). The integrated method outperforms, on the MIT-BIH database, both a published supervised learning method as well as a conventional template cross-correlation clustering method.

555 citations

References
More filters
Book
01 Jan 1984
TL;DR: The purpose and nature of Biological Memory, as well as some of the aspects of Memory Aspects, are explained.
Abstract: 1. Various Aspects of Memory.- 1.1 On the Purpose and Nature of Biological Memory.- 1.1.1 Some Fundamental Concepts.- 1.1.2 The Classical Laws of Association.- 1.1.3 On Different Levels of Modelling.- 1.2 Questions Concerning the Fundamental Mechanisms of Memory.- 1.2.1 Where Do the Signals Relating to Memory Act Upon?.- 1.2.2 What Kind of Encoding is Used for Neural Signals?.- 1.2.3 What are the Variable Memory Elements?.- 1.2.4 How are Neural Signals Addressed in Memory?.- 1.3 Elementary Operations Implemented by Associative Memory.- 1.3.1 Associative Recall.- 1.3.2 Production of Sequences from the Associative Memory.- 1.3.3 On the Meaning of Background and Context.- 1.4 More Abstract Aspects of Memory.- 1.4.1 The Problem of Infinite-State Memory.- 1.4.2 Invariant Representations.- 1.4.3 Symbolic Representations.- 1.4.4 Virtual Images.- 1.4.5 The Logic of Stored Knowledge.- 2. Pattern Mathematics.- 2.1 Mathematical Notations and Methods.- 2.1.1 Vector Space Concepts.- 2.1.2 Matrix Notations.- 2.1.3 Further Properties of Matrices.- 2.1.4 Matrix Equations.- 2.1.5 Projection Operators.- 2.1.6 On Matrix Differential Calculus.- 2.2 Distance Measures for Patterns.- 2.2.1 Measures of Similarity and Distance in Vector Spaces.- 2.2.2 Measures of Similarity and Distance Between Symbol Strings.- 2.2.3 More Accurate Distance Measures for Text.- 3. Classical Learning Systems.- 3.1 The Adaptive Linear Element (Adaline).- 3.1.1 Description of Adaptation by the Stochastic Approximation.- 3.2 The Perceptron.- 3.3 The Learning Matrix.- 3.4 Physical Realization of Adaptive Weights.- 3.4.1 Perceptron and Adaline.- 3.4.2 Classical Conditioning.- 3.4.3 Conjunction Learning Switches.- 3.4.4 Digital Representation of Adaptive Circuits.- 3.4.5 Biological Components.- 4. A New Approach to Adaptive Filters.- 4.1 Survey of Some Necessary Functions.- 4.2 On the "Transfer Function" of the Neuron.- 4.3 Models for Basic Adaptive Units.- 4.3.1 On the Linearization of the Basic Unit.- 4.3.2 Various Cases of Adaptation Laws.- 4.3.3 Two Limit Theorems.- 4.3.4 The Novelty Detector.- 4.4 Adaptive Feedback Networks.- 4.4.1 The Autocorrelation Matrix Memory.- 4.4.2 The Novelty Filter.- 5. Self-Organizing Feature Maps.- 5.1 On the Feature Maps of the Brain.- 5.2 Formation of Localized Responses by Lateral Feedback.- 5.3 Computational Simplification of the Process.- 5.3.1 Definition of the Topology-Preserving Mapping.- 5.3.2 A Simple Two-Dimensional Self-Organizing System.- 5.4 Demonstrations of Simple Topology-Preserving Mappings.- 5.4.1 Images of Various Distributions of Input Vectors.- 5.4.2 "The Magic TV".- 5.4.3 Mapping by a Feeler Mechanism.- 5.5 Tonotopic Map.- 5.6 Formation of Hierarchical Representations.- 5.6.1 Taxonomy Example.- 5.6.2 Phoneme Map.- 5.7 Mathematical Treatment of Self-Organization.- 5.7.1 Ordering of Weights.- 5.7.2 Convergence Phase.- 5.8 Automatic Selection of Feature Dimensions.- 6. Optimal Associative Mappings.- 6.1 Transfer Function of an Associative Network.- 6.2 Autoassociative Recall as an Orthogonal Projection.- 6.2.1 Orthogonal Projections.- 6.2.2 Error-Correcting Properties of Projections.- 6.3 The Novelty Filter.- 6.3.1 Two Examples of Novelty Filter.- 6.3.2 Novelty Filter as an Autoassociative Memory.- 6.4 Autoassociative Encoding.- 6.4.1 An Example of Autoassociative Encoding.- 6.5 Optimal Associative Mappings.- 6.5.1 The Optimal Linear Associative Mapping.- 6.5.2 Optimal Nonlinear Associative Mappings.- 6.6 Relationship Between Associative Mapping, Linear Regression, and Linear Estimation.- 6.6.1 Relationship of the Associative Mapping to Linear Regression.- 6.6.2 Relationship of the Regression Solution to the Linear Estimator.- 6.7 Recursive Computation of the Optimal Associative Mapping.- 6.7.1 Linear Corrective Algorithms.- 6.7.2 Best Exact Solution (Gradient Projection).- 6.7.3 Best Approximate Solution (Regression).- 6.7.4 Recursive Solution in the General Case.- 6.8 Special Cases.- 6.8.1 The Correlation Matrix Memory.- 6.8.2 Relationship Between Conditional Averages and Optimal Estimator.- 7. Pattern Recognition.- 7.1 Discriminant Functions.- 7.2 Statistical Formulation of Pattern Classification.- 7.3 Comparison Methods.- 7.4 The Subspace Methods of Classification.- 7.4.1 The Basic Subspace Method.- 7.4.2 The Learning Subspace Method (LSM).- 7.5 Learning Vector Quantization.- 7.6 Feature Extraction.- 7.7 Clustering.- 7.7.1 Simple Clustering (Optimization Approach).- 7.7.2 Hierarchical Clustering (Taxonomy Approach).- 7.8 Structural Pattern Recognition Methods.- 8. More About Biological Memory.- 8.1 Physiological Foundations of Memory.- 8.1.1 On the Mechanisms of Memory in Biological Systems.- 8.1.2 Structural Features of Some Neural Networks.- 8.1.3 Functional Features of Neurons.- 8.1.4 Modelling of the Synaptic Plasticity.- 8.1.5 Can the Memory Capacity Ensue from Synaptic Changes?.- 8.2 The Unified Cortical Memory Model.- 8.2.1 The Laminar Network Organization.- 8.2.2 On the Roles of Interneurons.- 8.2.3 Representation of Knowledge Over Memory Fields.- 8.2.4 Self-Controlled Operation of Memory.- 8.3 Collateral Reading.- 8.3.1 Physiological Results Relevant to Modelling.- 8.3.2 Related Modelling.- 9. Notes on Neural Computing.- 9.1 First Theoretical Views of Neural Networks.- 9.2 Motives for the Neural Computing Research.- 9.3 What Could the Purpose of the Neural Networks be?.- 9.4 Definitions of Artificial "Neural Computing" and General Notes on Neural Modelling.- 9.5 Are the Biological Neural Functions Localized or Distributed?.- 9.6 Is Nonlinearity Essential to Neural Computing?.- 9.7 Characteristic Differences Between Neural and Digital Computers.- 9.7.1 The Degree of Parallelism of the Neural Networks is Still Higher than that of any "Massively Parallel" Digital Computer.- 9.7.2 Why the Neural Signals Cannot be Approximated by Boolean Variables.- 9.7.3 The Neural Circuits do not Implement Finite Automata.- 9.7.4 Undue Views of the Logic Equivalence of the Brain and Computers on a High Level.- 9.8 "Connectionist Models".- 9.9 How can the Neural Computers be Programmed?.- 10. Optical Associative Memories.- 10.1 Nonholographic Methods.- 10.2 General Aspects of Holographic Memories.- 10.3 A Simple Principle of Holographic Associative Memory.- 10.4 Addressing in Holographic Memories.- 10.5 Recent Advances of Optical Associative Memories.- Bibliography on Pattern Recognition.- References.

8,197 citations


"A patient-adaptable ECG beat classi..." refers methods in this paper

  • ...According to Kohonen, there are three different LVQ algorithms, called LVQ1, LVQ2, and LVQ3 developed at subsequent stages to handle classification problems with different natures....

    [...]

  • ...SOM and LVQ are both clustering based algorithms proposed by Kohonen [14], [15]....

    [...]

  • ...[14] T. Kohonen, Self-Organization and Associative Memory....

    [...]

Journal ArticleDOI
01 Sep 1990
TL;DR: The self-organizing map, an architecture suggested for artificial neural networks, is explained by presenting simulation experiments and practical applications, and an algorithm which order responses spatially is reviewed, focusing on best matching cell selection and adaptation of the weight vectors.
Abstract: The self-organized map, an architecture suggested for artificial neural networks, is explained by presenting simulation experiments and practical applications. The self-organizing map has the property of effectively creating spatially organized internal representations of various features of input signals and their abstractions. One result of this is that the self-organization process can discover semantic relationships in sentences. Brain maps, semantic maps, and early work on competitive learning are reviewed. The self-organizing map algorithm (an algorithm which order responses spatially) is reviewed, focusing on best matching cell selection and adaptation of the weight vectors. Suggestions for applying the self-organizing map algorithm, demonstrations of the ordering process, and an example of hierarchical clustering of data are presented. Fine tuning the map by learning vector quantization is addressed. The use of self-organized maps in practical speech recognition and a simulation experiment on semantic mapping are discussed. >

7,883 citations

Journal ArticleDOI
TL;DR: A real-time algorithm that reliably recognizes QRS complexes based upon digital analyses of slope, amplitude, and width of ECG signals and automatically adjusts thresholds and parameters periodically to adapt to such ECG changes as QRS morphology and heart rate.
Abstract: We have developed a real-time algorithm for detection of the QRS complexes of ECG signals. It reliably recognizes QRS complexes based upon digital analyses of slope, amplitude, and width. A special digital bandpass filter reduces false detections caused by the various types of interference present in ECG signals. This filtering permits use of low thresholds, thereby increasing detection sensitivity. The algorithm automatically adjusts thresholds and parameters periodically to adapt to such ECG changes as QRS morphology and heart rate. For the standard 24 h MIT/BIH arrhythmia database, this algorithm correctly detects 99.3 percent of the QRS complexes.

6,686 citations


Additional excerpts

  • ...algorithm [21]....

    [...]

Journal ArticleDOI
TL;DR: A new supervised learning procedure for systems composed of many separate networks, each of which learns to handle a subset of the complete set of training cases, which is demonstrated to be able to be solved by a very simple expert network.
Abstract: We present a new supervised learning procedure for systems composed of many separate networks, each of which learns to handle a subset of the complete set of training cases. The new procedure can be viewed either as a modular version of a multilayer supervised network, or as an associative version of competitive learning. It therefore provides a new link between these two apparently different approaches. We demonstrate that the learning procedure divides up a vowel discrimination task into appropriate subtasks, each of which can be solved by a very simple expert network.

4,338 citations

01 Jan 1990
TL;DR: An overview of the self-organizing map algorithm, on which the papers in this issue are based, is presented in this article, where the authors present an overview of their work.
Abstract: An overview of the self-organizing map algorithm, on which the papers in this issue are based, is presented in this article.

2,933 citations

Frequently Asked Questions (12)
Q1. What have the authors contributed in "A patient-adaptable ecg beat classifier using a mixture of experts approach" ?

The authors present a “ mixture-of-experts ” ( MOE ) approach to develop customized electrocardigram ( ECG ) beat classifier in an effort to further improve the performance of ECG processing and to offer individualized health care. Tested with MIT/BIH arrhythmia database, the authors observe significant performance enhancement using this approach. 

Automated ECG beat classification was traditionally performed using a decision-tree-like approach, based on various features extracted from an ECG beat [1], [4], [5], [13], [20], [22]. 

Since this study is to evaluate the performance of a classifier that can identify a premature ventricular contraction (PVC), certain records in the database with no PVC’s (11 records) were excluded from the study, leaving 33 records of interest. 

4) A potential drawback of this proposed method is the need to develop a LE classifier for each individual patient, even with only 5 min of patient’s ECG record. 

Since each record in the MIT/BIH database is of length 30 min, the 2.5 min segment account for 1/12th of total available patient specific data and contains approximately 150 ECG beats. 

with the MOE approach, the authors will adapt this GE classifier with a LE classifier to gain significant performance enhancement at low cost. 

The information of each beat is stored as a 13-element vector, with the first nine elements representing the transformed morphological template, and the next three elements representing the temporal parameters. 

the LE is able to pick up those patient-specific beats, and therefore, provide significantly enhanced performance (from 3.65% to 98.4%). 

In practice, the attending cardiologist or any expert in ECG beat annotation will have to annotate a brief segment of patient-specific ECG in order to take advantage of the MOE approach. 

The position of annotation labels is used to identify the peak of the QRS waveform and 14 points on either side of the peak were picked up to form the template. 

To enable the “soft combination” of the classifier output, it is desired that the outputs of each classifier be an estimate of the a posterior probability of the feature vector belonging to that class. 

The second group of 20 records is used to simulate the ECG records of 20 patients, which are to be classified by the GE classifier.