What contributions have the authors mentioned in the paper "On a new class of codes for identifying vertices in graphs" ?

The authors investigate a new class of codes for the optimal covering of vertices in an undirected graphG such that any vertex in G can be uniquely identified by examining the vertices that cover it. The vertex v is then said to cover itself and every other vertex in the ball with center v: their formal problem statement is as follows: Given an undirected graph G and an integer t 1, find a ( minimal ) set C of vertices such that every vertex in G belongs to a unique set of balls of radius t centered at the vertices in C: The authors also describe the identification of sets of vertices using covering codes that uniquely identify single vertices. Finally, the authors describe an application of the theory developed in this paper to fault diagnosis of multiprocessor systems.

What is the condition for generating a graph with vertices?

The authors are interested in generating a graph with vertices in which the number of codewords is as close to as possible for the identification of single vertices and to for identification of sets of up to vertices.

how many codewords are in a ball?

The center of the ball is covered by all codewords, a vertex at distance one from the center is covered only by itself, and each vertex at distance two by a different pair of codewords.

What is the advantage of a test program?

Since the test program has to reside on the local memory of every monitor processor, this also minimizes the amount of memory required to store the test program.

how many codewords are covered by a cube?

Every codeword is covered only by itself because the Hamming distance between any two parity vectors of codewords is at least three.

Why is the vertices in a triangular mesh not distinguishable?

This is because the vertices inare at distance two from each other and any vertex is at the same distance from all the vertices in Hence the vertices in are not distinguishable ifFinally, the authors address the problem of code construction for hexagonal and triangular meshes, the former topology having received attention recently [23].

what is the simplest way to maximize the number of columns under the constraint?

obviously,it follows that(8)To maximize the number of columns under the constraint (8), the authors have to choose the weights of the columns as small as possible, starting with columns of weight , , etc., up to the point where the right-hand side of (8) is exceeded.

How many bits in a graph are the for identifying a faulty processor?

The number of bits in the syndrome equals the number of test links in the test graph; this can be extremely large in systems with thousands of processors, and can easily lead to traffic congestion system when the syndrome is communicated to the host.

what is the density of codewords for a three-dimensional cube?

Theorem 6 and Corollary 4 show that the density of codewords is only for three-dimensional cubes, and tends to zero as increases.

(Open Access) On a new class of codes for identifying vertices in graphs (1998) | Mark G. Karpovsky

IEEE TRANSACTIONS ON INFORMATION THEORY, VOL. 44, NO. 2, MARCH 1998 599

On a New Class of Codes for

Identifying Vertices in Graphs

Mark G. Karpovsky, Fellow, IEEE, Krishnendu Chakrabarty, Member, IEEE, and Lev B. Levitin,

Fellow, IEEE

Abstract—We investigate a new class of codes for the optimal

covering of vertices in an undirected graph

such that any vertex

can be uniquely identiﬁed by examining the vertices that

cover it. We deﬁne a ball of radius

centered on a vertex

be the set of vertices in

that are at distance at most

from

The vertex

is then said to cover itself and every other vertex

in the ball with center

Our formal problem statement is as

follows: Given an undirected graph

and an integer



, ﬁnd

a (minimal) set

of vertices such that every vertex in

belongs

to a unique set of balls of radius

centered at the vertices in

The set of vertices thus obtained constitutes a code for vertex

identiﬁcation. We ﬁrst develop topology-independent bounds on

the size of

We then develop methods for constructing

for

several speciﬁc topologies such as binary cubes, nonbinary cubes,

and trees. We also describe the identiﬁcation of sets of vertices

using covering codes that uniquely identify single vertices. We

develop methods for constructing optimal topologies that yield

identifying codes with a minimum number of codewords. Finally,

we describe an application of the theory developed in this paper

to fault diagnosis of multiprocessor systems.

Index Terms—Code construction, coding theory, covering ra-

dius, fault diagnosis, graph theory, multiprocessor systems.

I. INTRODUCTION

RAPHS ﬁnd a wide range of applications in several

ﬁelds of engineering and information sciences. A graph

can be used to represent almost any physical situation and

the relationship between various entities. Graph models are

therefore often employed in solving a number of practical

problems [7].

In this paper, we investigate the problem of covering the

vertices of a graph

such that we can uniquely identify any

vertex in

by examining the vertices that cover it. We deﬁne

a ball of radius

centered on a vertex to be the set of

vertices of

that are at distance at most from (The

distance between vertices

and is the number of edges in

a shortest path between

and .) The vertex is then said to

cover itself and every other vertex in its ball. We are interested

in identifying the vertices of

using a minimum number of

balls of radius

This is formally stated as follows: Given an

undirected graph

and an integer , ﬁnd a (minimal) set

of vertices such that every vertex of belongs to a unique set

Manuscript received November 5, 1996; revised August 7, 1997. This work

was supported in part by the National Science Foundation under Grant MIP

9630096, by NATO under Grant 910411, and by a start-up grant from Boston

University’s College of Engineering.

The authors are with Reliable Computing Laboratory, Department of

Electrical and Computer Engineering, Boston University, Boston, MA 02215

USA.

Publisher Item Identiﬁer S 0018-9448(98)00841-4.

of balls of radius centered at the vertices in We view as

an identifying code such that all vertices in it are codewords.

An application of the theory developed in this paper lies in

fault diagnosis of multiprocessor systems. The purpose of fault

diagnosis is to test the system and locate faulty processors. A

multiprocessor system can be modeled as an undirected graph

, where is the set of processors and is the set

of links in the system. Speciﬁc software routines are executed

on certain selected processors to carry out diagnosis. The

selection of these processors is done by generating the code

that allows for unique identiﬁcation of faulty processors. Every

processor corresponding to a codeword vertex tests itself and

all its neigbhoring processors. This corresponds to the use of

balls of radius one centered at the codewords, i.e.,

Hence an optimal code (minimum number of codewords)

minimizes the amount of overhead required to implement fault

diagnosis.

The organization of the paper is as follows. In Section II,

we develop topology-independent bounds on the size of

and present methods for constructing

for practical topologies

such as meshes, binary and nonbinary cubes, and trees. Section

III addresses the problem of constructing codes that identify

not just single vertices as in the previous sections, but sets of

vertices of up to a given size. Finally, in Section IV, we discuss

the construction of optimal graphs that yield identifying codes

with a minimum number of codewords.

II. C

ODE CONSTRUCTION

Let be the minimum number of codewords required

to identify every vertex uniquely when a ball of radius

used. We ﬁrst obtain some lower bounds on

Let

be the volume of a ball of radius centered at vertex , i.e.,

the number of vertices that are at distance at most

from

Theorem 1: For a graph with vertices

2) Let Then

, where is the smallest integer such that

where is the

binary entropy function.

3) Let

be the smallest integer such that for a certain

, the following conditions are

0018–9448/98$10.00  1998 IEEE

600 IEEE TRANSACTIONS ON INFORMATION THEORY, VOL. 44, NO. 2, MARCH 1998

satisﬁed:

(1)

(2)

Then

Proof: The ﬁrst lower bound follows from the fact that

there are

cases ( different vertices and the selection

of no vertex—no vertex is identiﬁed) to be distinguished.

Therefore, the information can be encoded in a minimum of

bits.

We prove the second bound as follows. We denote by

the result of the (identiﬁcation) test

performed by the

th codeword. Each is a binary random

variable:

Denote by the random variable which

is equal to

when no vertex is to be identiﬁed and

if the th vertex is to be identiﬁed. In the absence

of any a priori knowledge, we assume that all

cases are

equiprobable. Thus the entropy

Now,

denote by

the information in about

given the outcomes of Then

(3)

On the other hand,

(4)

Since

(the value of is

uniquely determined by

), we obtain

(5)

However, the probability

, hence

(6)

It follows from (4)–(6) that the number of codewords is not

smaller than the smallest

such that

To prove the third bound, we consider a binary

matrix

where

if the th codeword covers the th vertex, and

otherwise. Denote by the weight (number of

nonzero components) of the

th row and by the weight

of the

th column. Obviously,

(7)

Our goal is to ﬁnd the minimum

for a given and

provided that all columns of are

nonzero and distinct.

Consider the dual problem: for a given

, ﬁnd the max-

imum number

of distinct and nonzero columns. Since,

obviously,

it follows that

(8)

To maximize the number of columns

under the constraint

(8), we have to choose the weights of the columns

small as possible, starting with columns of weight

, , etc.,

up to the point where the right-hand side of (8) is exceeded.

Let

be an integer such that

(9)

For the maximum possible number of columns

, (8)

should turn into an equality. Taking into account (9), let

(10)

where

Obviously,

(11)

We need to consider the following three cases.

Then the largest number of columns and the equality in (8)

are achieved if we use all possible distinct columns of weights

columns of weight and

columns of weight

To maximize and to achieve the equality in (8), we should

use all columns of weights

columns of

weight

all columns of weight and

columns of weight

Then we should use all columns of weights

In all cases, the total number of columns in is

(12)

KARPOVSKY et al.: NEW CLASS OF CODES FOR IDENTIFYING VERTICES IN GRAPHS 601

In fact, because of (8), (12) gives an upper bound on the

number

of columns for a given number of rows (where

is deﬁned by (9)). It follows that for a given , the minimum

number of rows

should satisfy (1). Thus (1) and (2) together

determine a lower bound on the number of codewords for a

given number

of vertices.

In the special case of a regular graph where

for all , (9) and (12) take simpler forms

(13)

(14)

A simpler lower bound in the case of a regular graph is

given by Theorem 2.

Theorem 2: The size of an identifying code for a regular

graph with

vertices is lower-bounded by

(15)

Proof: As in the proof of Theorem 1, consider the

binary matrix , where if and only if the th

codeword covers the

th vertex, and otherwise. The

number of nonzero elements in the matrix is obviously

On the other hand, since at most columns can have weight

and the remaining columns must be of weight at

least

, the number of nonzero elements should be at least

Hence,

Therefore,

The lower bound (15) is, in general, weaker than (13) and

(14). However, both bounds coincide if

It can be shown

in Theorem 1 (part 3), that

if and only if

The latter condition is satisﬁed for a broad class of graphs if

does not grow too fast with

We next examine some speciﬁc graph topologies.

A. Binary Cubes

A binary

-cube computer is a multiprocessor system with

processors interconnected as an -dimensional binary

cube. Each processor

constitutes a node of the cube and

is a self-contained computer with its own CPU and local

memory. Each

also has direct communication paths to

other neighbor processors through the edges of the cube.

An example of a commercial binary-cube computer is the

NCUBE/ten, which is a ten-dimensional system developed by

NCUBE Corporation [8], [14].

Let

be the minimum number of codewords required

for identifying the vertices in an

-dimensional binary cube

using balls of radius

We ﬁrst consider the case The

speciﬁc topology of the

-dimensional cube imposes addi-

tional constraints which makes the lower bounds of Theorems

1 and 2 unattainable. A tighter lower bound is given by the

following theorem, a proof of which is given in the Appendix.

Theorem 3: For an

-dimensional binary cube, ,

(16)

where

is the volume of the ball of radius

two in the Hamming space

The lower bound (16) is achieved if there exists a perfect

covering of the

-dimensional cube by balls of radius two,

i.e., a perfect code

with distance ﬁve. The only such case is

for

Then all vertices of weight one and four can be

chosen as codewords, and the total number of codewords is

ten, which is given by (16).

Let

be the size of an optimal code

of length

with covering radius , i.e., every vertex is at Hamming

distance at most

from a codeword of [4], [5], [9],

[12], [17], [21]. An upper bound on

follows from the

theorem below.

Theorem 4: Let

be an optimal binary code of length

and covering radius , i.e., has codewords. Then,

for

, a code identifying vertices in the -dimensional

binary cube can be selected as

( is the Hamming distance between and ).

Proof: We show that every vertex in the cube

covered by a unique set of codewords.

Case 1:

Let

Every neighbor of belongs to and therefore

covers

We need to prove that there exists no vertex that is

covered by the same set of vertices. Let

The codewords covering are

where and Clearly, is the only

vertex that is a neighbor of all these codewords.

Case 2:

Let

and Now, is covered by itself and

every neighbor

We show that there exists such

that

but Note that because there

are no triangles in

, there does not exist a vertex such

that

Hence, we have only to prove the

existence of

for every Let be a neighbor of

Since , there exist exactly two vertices that are at

distance one from both

and One of these vertices (from

above) is

; the other is

Case 3:

Let

and Suppose is covered by codewords

Without loss of generality, let

and the codewords covering be

A binary

(

;k;

+1)

code is perfect if

An optimal covering code is one that has a minimum number of code-

words.

602 IEEE TRANSACTIONS ON INFORMATION THEORY, VOL. 44, NO. 2, MARCH 1998

Clearly, there is no vertex other than that is at distance one

from all these codewords. Hence

is uniquely identiﬁed.

Next, suppose

Without loss of generality, let

and the only codeword covering be

There must exist at least one vertex in the

covering code

such that Thus

It can now be easily seen that each vertex in the above set

contributes codewords to

that cover , which contradicts the

assumption that

Finally, suppose Let the two codewords covering

be and Now,

is the only vertex other than that is at distance one from both

these codewords. If

then is uniquely identiﬁed. If

then there exists that generates

as a codeword of Once again, without loss of generality,

let

Then which

contradicts our assumption that

Hence and is

uniquely identiﬁed.

Corollary 1: The number of codewords in an optimal iden-

tifying code with

for a binary -cube is

upper-bounded by

(17)

Exact values for small

as well as bounds on are

available in the literature; see, e.g., [3]. In particular

Using this and (16), we get

For example, if , then

Therefore, for we have

The ratio of the upper bound to the lower bound

with

Another solution to the identifying code construction prob-

lem for an

-dimensional binary cube is obtained by se-

lecting codewords separately for its two constituent

-dimensional cubes. This “divide and conquer” approach,

which implies that

, often gives better

results for small

than the construction method using

(see Table I). Note that for and we achieved the

lower bound on

using ad hoc construction methods.

The construction of Theorem 4 can be extended in a

straightforward manner for

We now construct an

optimal

with covering radius ; the number of codewords

is The identifying code is generated by

selecting vertices that are at distance exactly

from the ver-

tices in

TABLE I

UMBER OF CODEWORDS REQUIRED FOR IDENTIFYING

VERTICES IN

BINARY CUBES

Theorem 5: For any given , a code for identifying

vertices in the

-dimensional binary cube can be

obtained by selecting as codewords all vertices at distance

exactly

from the codewords of an optimal code which

has covering radius

, i.e.,

Proof: We ﬁrst make the following observation: if ver-

tices

and are such that there is at least one ball centered

at a vertex in

to which belongs but does not

belong, then

and can be distinguished using codewords

from

Therefore, we only need to prove that any two vertices

can be distinguished if they belong to the same ball of radius

centered at a vertex

Without loss of generality, let

All vertices of weight now belong to and serve as

codewords for identifying a vertex. Given two vertices

and

that are in the same ball centered at , we show that we

can always ﬁnd a codeword

such that covers one of

them but not the other. We deﬁne

to be a vector

with components

. In addition, if ,

and

is the component-wise negation of Let

and , where is the weight of Assume,

without loss of generality, that

It follows therefore

that

because otherwise both and will be the same

vertex

Consider now three cases:

We choose

such that Then and

2) Note that if then ,

otherwise,

and would be identical. Assume that at least

one of two conditions is fulﬁlled:

or is even. Choose

such that

and

Then and

On the other hand,

Thus in both cases, codeword covers but not

KARPOVSKY et al.: NEW CLASS OF CODES FOR IDENTIFYING VERTICES IN GRAPHS 603

TABLE II

OUNDS ON THE NUMBER OF CODEWORDS IN AN OPTIMAL

IDENTIFYING CODE WITH

FOR AN

-DIMENSIONAL BINARY CUBE

3) and is odd. Then and at least

one of two conditions is fulﬁlled:

is even or

Choose such that and

Then

and

Thus in this case, codeword covers but not

Corollary 2: For , the number of codewords

required for identifying vertices in a binary cube is upper-

bounded by

We next estimate the ratio between the upper bound and

the lower bound on the number of codewords

when

is an integer. We know from (15) and Corollary 2 that

Since [3], it follows

that

Using the following well-known upper bound on (see

e.g. [3]):

we obtain

If then using

for and constant ,weget

The ratio of the upper bound to the lower bound

is therefore given by

For , we have as before, while for ,

For the special case of , we have the

following corollary, which follows from the fact that

Corollary 3: The number of codewords required for a bi-

nary cube with

dimensions using balls of radius

is upper-bounded by

As special cases, for , we have and

for

we have

Table II shows the upper and lower bounds on For

the lower bounds, we used (15) for

since

for these cases, and (15) coincides with the bound given by

Theorem 1 (part 3). For

, we applied Theorem 1 (part 3)

directly and obtained tighter bounds than given by (15). For

, the covering radius approach cannot be applied with

The last column of the table is based on the following

result, which we prove later (see Corollary 7):

(18)

While it may be intuitively expected that the number of

codewords required for identiﬁcation decreases as

increases,

this is not necessarily the case. For example,

but

(

)



(

)

lim

(

)

(

)=1

On a new class of codes for identifying vertices in graphs

Figures

Citations

Grid coverage for surveillance and target location in distributed sensor networks

Computer Science – Theory and Applications

Minimizing the size of an identifying or locating-dominating code in a graph is NP-hard

Robust location detection in emergency sensor networks

Robust location detection with sensor networks

References

Graph theory

The Theory of Error-Correcting Codes

Introduction to VLSI systems

On the Connection Assignment Problem of Diagnosable Systems

Fat-trees: universal networks for hardware-efficient supercomputing

Related Papers (5)

Minimizing the size of an identifying or locating-dominating code in a graph is NP-hard

Identifying and locating-dominating codes on chains and cycles

Identifying Codes with Small Radius in Some Infinite Regular Graphs

Domination and location in acyclic graphs

Exact Minimum Density of Codes Identifying Vertices in the Square Grid

Frequently Asked Questions (11)

Q1. What contributions have the authors mentioned in the paper "On a new class of codes for identifying vertices in graphs" ?

Q2. What is the condition for generating a graph with vertices?

Q3. how many codewords are in a ball?

Q4. What is the advantage of a test program?

Q5. how many codewords are covered by a cube?

Q6. What is the lower bound of the -dimensional binary cube?

Q7. What is the simplest example of a binary cube computer?

Q8. Why is the vertices in a triangular mesh not distinguishable?

Q9. what is the simplest way to maximize the number of columns under the constraint?

Q10. How many bits in a graph are the for identifying a faulty processor?

Q11. what is the density of codewords for a three-dimensional cube?