What have the authors contributed in "Capacity of the gaussian arbitrarily varying channel" ?

In this paper, the capacity of the arbitrarily varying channel with input constraint r and state constraint 2 admits input sequences x = ( x l,,, x, ~ ) of real numbers with Cxf 5 nT and state sequences s = ( 5,., T, ~ ) of real number with L Y ~ 5 H A ; the output sequence x + s + V, where V = ( VI,,, y, ) is a sequence of independent and identically distributed Gaussian random variables with mean 0 and variance c '.

(Open Access) Capacity of the Gaussian arbitrarily varying channel (1991) | I. Csiszar

IELE

TKANSACTIONS

INFORMATION THEORY,

VOL.

37.

NO.

JANUARY

1991

Capacity

the Gaussian Arbitrarily

Varying Channel

Imre

Csiszhr and

Prakash

Narayan,

Member,

ZEEE

Abstract

-The Gaussian arbitrarily varying channel with input con-

straint

and state constraint

admits input sequences

(xl,

x,~)

of real numbers with

Cxf

and state sequences

T,~)

real numbers with

LY~

HA;

the output sequence

where

(

VI,

y,)

is a sequence of independent and identically distributed

Gaussian random variables with mean

and variance

c’.

It is proved

that the capacity of this arbitrarily varying channel for deterministic

codes and the average probability of error criterion equals

log(l+

r/(

c2))

and is

otherwise.

Index

Terms

-Arbitrarily varying channel, Gaussian, capacity.

INTRODUCTION

RBITRARILY varying channels (AVC’s) were intro-

duced by Blackwell

al. [5]

model communication

channels with unknown parameters that may vary with time

in an arbitrary and unknown manner during the transmission

of a codeword. In this paper, attention is restricted to AVC’s

without memory; further, it is assumed that the sequence of

channel states is selected arbitrarily subject to a constraint

specified later, and possibly depending on the codebook

but

independently of the codeword actually sent.

AVC’s exhibit various mathematical complexities even in

the case of discrete alphabets (cf. Csiszir-Korner [6, Section

2.61). In particular, their capacity may depend on whether or

not random codes are permitted, and whether the average or

maximum probability of error criterion is used. The random

coding capacity admits a simple characterization as a min-max

of mutual information,

a result dating back to Blackwell

al.

[5].

In contrast, the problem of capacity

for

determinis-

tic codes is much harder. In particular, for the maximum

probability of error criterion, a single-letter capacity formula

is known only under certain conditions on the structure of

the AVC (cf. Ahlswede [2] and Csiszir-Korner

[7]).

Unless stated otherwise, the term capacity will hereafter

always refer to capacity for deterministic codes and the ui‘er-

age probability

error

criterion. In the absence of state

constraints, Ahlswede

[l]

proved that this capacity was either

equal to the random coding capacity or otherwise

zero.

The necessary and sufficient condition for positive capacity,

Manuscript received April

25.

1989;

revised February

13,

1990.

Csiszir was supported by the Hungarian National Foundation Scientific

Research Grant

No.

1806.

Narayan was supported by the Systems

Research Center at the University

Maryland under NSF Grant

OIR-85-00108. This work was presented at the

IEEE

International

Symposium on Information Theory, San Diego, CA, January

14-19,

1990.

Csiszir

with the Mathematical Institute

the Hungarian Academy

Sciences. H-1364 Budapest.

POB

127,

Hungary.

Narayan is with the Electrical Engineering Department and the

Systems Research Center, University

Maryland, College Park.

20742.

IEEE

Log Number

9038858.

as well as capacity under a state constraint, have been

determined by Csiszhr-Narayan

[8];

it was further shown

that Ahlswede’s alternatives do not necessarily obtain under

a state constraint.

Less attention has been bestowed in the literature on the

capacity of AVC’s with continuous alphabets. Presumably

motivated by random coding capacity, there have been

game-theoretic considerations concentrating on the min-max

of mutual information (cf. McEliece

11). Hughes-Narayan

[lo] have used a geometric approach to determine the ran-

dom coding capacity of the Gaussian AVC defined formally

in the following paragraph. Blachman [4] has provided lower

and upper bounds on capacity in a communication situation

differing from ours in that the interference (i.e., state se-

quence) could depend on the actual codeword transmitted.

Our incomplete understanding of his paper seems to indicate

that he,

too,

considered random coding capacity.

our

knowledge, Ahlswede’s [3] is the only paper treating the

capacity of a continuous alphabet AVC for deterministic

codes. His AVC (a Gaussian channel with the noise variance

arbitrarily varying but not exceeding a given bound) allowed

a very simple approach, which may not be extendable

other cases of interest.

In this paper, we determine the capacity of the Gaussian

AVC formally defined

follows. Let the input and output

alphabets, and the set of states, be the real line. For any

input sequence

(x,;

x,i)

and state sequence

(si;..,

s,?),

let the

output

where

(V,;

V,,)

is a sequence of independent and identically

distributed (i.i.d.1 Gaussian random variables with mean

and variance

(T’.

We adopt an input constraint

and state

constraint

namely the permissible input sequences

length

are those satisfying

11x11~~

C~,21nr,

(no)

(1.1)

r=l

and the permissible state sequences are those satisfying

Ilsll’=

zs,?~nA,

(A>O).

(1.2)

i=l

A code of block-length

comprises a set of codewords

xi;.

’,x,,,,

each in

R“,

and a decoder

cp:

R”

+{O;..,

N).

The average probability of error of this code, used on the

Gaussian AVC as above when the state sequence is

equals

F(s)=-

Pr{cp(x,+s+V)#i}. (1.3)

N,=I

0018-9448/01/0100-0018$01.00

1991

IEEE

CSISZAR AND NARAYAN: CAPACITY

THE GAUSSIAN ARBITRARILY VARYING CHANNEL

‘j

The capacity C of the Gaussian AVC with input constraint

and state constraint

is the largest number with the

property that for every

and sufficiently large n, there

exist codes with

exp{n(C

a)}

codewords, each satisfy-

ing

(l.l),

such that the supremum

F(s)

subject

(1.2)

converges to

as n

+m.

Our main result is the following.

Theorem

The capacity of the Gaussian AVC with in-

put constraint

and state constraint

riil

According to Hughes-Narayan

[lo],

the random coding

capacity of the Gaussian AVC equals

log(l+

r/(A

u2)).

Thus, in this case Ahlswede’s alternatives do obtain. Yet a

proof of the theorem above by the elegant “elimination

technique” of Ahlswede

[1]

is not apparent. Rather, we shall

use the straightforward but more computational method of

CsiszAr-Narayan [SI. Suitable approximation arguments

would enable a derivation of our theorem directly from the

results of

[SI.

Instead, we prefer to present a more transpar-

ent and direct proof, which will also serve to keep this paper

self-contained.

We also determine the capacity of the noiseless additive

AVC whose output is

r+s

rather than

r+s

+V.

The

capacity of this AVC is defined similarly to that of the

Gaussian AVC with the exception that

(1.3)

is now replaced

F(s)

-I(i:

cp(x,

i}I.

Theorem

The capacity of the noiseless additive AVC

with input constraint

and state constraint

TsA.

Whereas this result is not a formal special case of Theo-

rem 1, both theorems can be proved by the same method.

We shall prove the simpler Theorem

first

that the

reader may better understand the key ideas. Observe that

Theorem

requires a separate proof only in the case

In fact, since

(1.2)

implies for an arbitrary

that

11s

VJ12

n(A

with probability arbitrarily close to

if n is sufficiently large, in the case

the

assertion of Theorem

follows immediately from that of

Theorem

Actually, we shall show that the capacity as claimed in

Theorems

and

can be achieved using the minimum-dis-

tance decoder, namely

II~

.rill2

~ly

xjl12,

no such

exists.

for

d(Y)

{

It is worth pointing out that the result of Theorem

with this

decoder provides a solution to a weakened version of the

unsolved sphere-packing problem. This problem seeks the

exponential rate of the maximum number of nonintersecting

spheres of radius

JT?

R”

with centers in a sphere of radius

In our case, the spheres may intersect but for any given

R”

of norm

I&,

only for a vanishingly small fraction

of sphere centers

can

be closer to another sphere

center than to

xi.

The number

in Theorem

then gives

the exponential rate of the maximum number of spheres

satisfying this condition. A similar weakened version of the

sphere-packing problem in Hamming space was solved in

[8]

as a special case of the coding theorem for the binary adder

AVC.

11.

PROOF

THE

MAIN

RESULT

The proof

the converse parts of Theorems 1 and

being standard, is relegated to the Appendix. The essential

contribution

this paper consists in the direct part of

coding Theorems

and

Our goal is to show that, when

for all sufficiently

large n there exist

exp(nR) codewords

xI;

x,,,

R“

satisfying

Ilx,l12

nT,

with

arbitrarily close to

the asserted capacity value, such that for a suitable decoder

the average probability

error

F(s)

is arbitrarily small

uniformly subject to

llsl12

nA.

Using the minimum distance decoder

(1.5)

for the

noiseless AVC,

(1.4)

becomes

and for the Gaussian case,

(1.3)

gives

Xjl12

11s

VI12,

for some

i}.

We can assume without any loss of generality that

(2.2)

Further,

(2.1)

and

(2.2)

remain unchanged

all

vectors are multiplied by

l/&.

Hence it suffices to prove

that for every sufficiently small

and sufficiently large n

there exist

exp(nR) unit vectors

xI;.

.,xN

R“

with

-26

where

log(1

l/A)

for the noise-

less AVC and C

log(1

l/(A

u2))

for the Gaussian

AVC, such that

F’(s)

is arbitrarily small, uniformly subject to

lls1I2

where

in the noiseless case, and

for some

(2.4)

in the Gaussian case where

(VI;

..,V,,)

is now a se-

quence of i.i.d. Gaussian random variables with mean

and

variance u2/n.

We claim that the unit vectors

xI;

of the following

Lemma

do possess the property above

and

are

sufficiently small.

Lemma

(Codeword Properties): For every

and

exp(nR) with

for n

n,,(E,q,K) there exist unit vectors

x,;

“,xN

R”

such that

for every unit vector

R”

and constants

CY,

in [0,1], we

IEEE

TRANSACTIONS ON INFORMATION THEORY.

VOL.

37.

NO.

JANUARY

I991

Here

(.;)

denotes inner product and

1.1’

denotes “the

positive part

of.”

This lemma is an analog of the key Lemma

[SI,

and can be proved similarly. The proof is in the

Appendix.

Commencing with the noiseless case, in order to bound

(2.3)

for

lls112

note that

Ilx;

x,l12

11x,1I2

1lS1l2 +

llXjl12

Hence

t?’(s)=-({i:

(x,,s)+(x,,x,)2

I+(x,,s),

-N

{

(

)

77)

for some

complete the proof of the direct part

Theorem

suffices to check for every

(a,

(1 -277,O)

and

(ak,

ak)/fi),

1;.

the condition

a’

exp(-2R)

Lemma

(The condition

is clearly

satisfied provided

min{+,(l-

6)/2}.)

Differentiation

shows that

+(1-27

a)’/A

is minimized by

277)/1+

and the minimum equals

-2~)~/

Thus,

the condition to be satisfied is

Obviously, if C

-26

for any fixed

where

({i:

(x,,s)

+(xi,

xi)

for some

c=-log

1+-

=--log

1--

(2.6) 2

(

1;Aj’

The first term of the sum in

(2.6)

can be bounded by Lemma

I(i). In fact, letting

be the unit vector such that

(x,.

implies by the assumption

that

(x,,

Thus

log(1-

v2),

we get that

exp

{n(e

z))

-+W.

(2.8)

The second term

the sum in

(2.6)

can be bounded using

of Lemma

by suitably partitioning the set

possible values

the inner product

(x,,x,).

this end, let

-fi<a2<

...

<a,=1-2~,

with

ar+I-ak~q,

Then

(x,,

s)+

(x,, x,)

implies that

(x,, x,)

aI,

and

(xI,

x,)

then necessarily

(x,,

1-27

ah.

The latter, in turn, implies by

(2.7)

that

the inequality

(2.9)

will be satisfied if

is sufficiently small.

The proof for the Gaussian case (Theorem

is similar but

bounding

(2.4)

is not as easy. We first present two simple

technical lemmas.

Lemma

Let the r.v.

be uniformly distributed on the

unit n-sphere. Then for every vector

on this sphere and

any

we have

1)/2

~r{l(~,u)l>

2(1-

a2)

fi’

Proof:

Denote the angle between the unit vectors

and

by O(U,u). Then by Shannon

[12, (28)1,

With

cos$, it follows that

ifa2-

+zz

The proof is completed by observing that Pr{(U,

Lemma

Let

and

be unit vectors with I(u,u)~~

orthogo-

Pr{(U,

a).

Then for any unit vector

the component

nal to span

{U,

has norm

IIX~II*I

1-(u,x)2-(U,x)2+47.

(2.10)

CSlSZAR

AND NAKAYAN: CAPACITY

THE

GAUSSIAN ARBITRARILY VARYING CHANNEL

Further, for any pair of constants

a,p,

Ilau

pull2

(

p2)(1

77).

(2.11)

Proof:

Let

u'=(u

-(u,u)u)/IIu

-(u,u)ull

be the unit

vector orthogonal to

such that span(u,u'}

span(u,u).

Then

I(u,x)

-(u,u)(u,x)

IIU

-(w)ull

I(u,x)

1+71

I(o',x)l=

2(1(~4-77)(1-77)

I(u,x)I

-277.

Since

J)x

)I2

-(U,

xI2

-(U',

x)',

this implies

(2.10).

Fi-

nally

((au

pull2

+2ap(

u,u)

(a2+

P')(l+

771,

12ap/(a2

p2)I

thereby proving

(2.11).

Continuing with the proof of the direct part of Theorem

note that on account of

(2.8)

it suffices to consider only those

terms in

(2.4)

for which

I(x,,

u)l

77,

where

is a unit vector

satisfying

ullsll.

We shall bound these terms using

IIx,

x,1I2

11x,112

11s

v1t2

llx,112

2(x,,

XI,

x,,

x,)

x,,

2(x,,

v).

(2.12)

Decomposing

and

into components in

M,,u

spanb,,

and in

Mi*:,

we have

(x,,

(xy

U) U)

[

xy=,

V".'.')

=(x,,v"'")+(xy",v).

(2.13)

Since

(VI,.

V,,)

is a sequence of i.i.d. Gaussian random

variables with mean

and variance

a2/n,

we have as

--fm

that

{I(

x,,~)

77)

Pr(

IIv"~~II

77)

uniformly in

and

This along with

(2.12), (2.13),

implies

that

{llx,

x,l12

11s

v1l2,

for some

((x,,x,)

(x,,s)

(xy+c,~)

(x,,~)

(x,,

(x;"i

v),

for some

((

xy'.,

377

((

x,,

x,)

l(x,,

for

some

(2.14)

)

for all sufficiently large

whenever

Kx,,

u)l

17.

uniformly subject to

llsl12

it suffices to prove that

~r((xy',i,~)>1-377-I(x,,x,)1

Hence, in

order

to show that

Z'(s)

(2.4)

goes to

Kx,,u)I<q

-l(x,,u)lfi,

forsome

j#i)

(2.15)

converges to

uniformly for unit vectors

R",

To this end, we partition the set

all possible values of

the inner products

(x,, x,)

and

(x,,

U).

Let

. .

<aK=l

and

pI=0<p2<

...

<p,=l.with

aA+l-ah4

17,

k=l;..,K-l,

and

/3/+l-p/~~,

I=l;..,L-l.

Fur-

ther let

F,~/

{j:

a,,

((x,,x,)I

ak+l,

PlI

1(x,J4

&+I)

and

{(

k,l):

17,

af+p:>l+~~-exp(-2R)}.

Then the expression

(2.15)

$i:W4Il

(k,l)EG

the first term above goes to zero uniformly in

+CO

of Lemma

it remains to consider only the second

term. Recalling again that

(V,,

. .

V,)

is an i.i.d. se-

quence

N(0,u2/n)

random variables, we note that

Pr(llVl12

+W.

Therefore, it suffices to

prove that

Pr{

(IIVII~~~~+~,

i:Kx,,~)ls~

(k,l)EG

iEF,A/

where

(akxj

pp)/

IIakxi

ppll.

Lemma

(for

Kxi,u)l

q),

Ilakxj

ppll

J(ai

p:)(

77)

that

by Lemma

for all

sufficiently large, where we can assume that

+p:

1+q (2.18)

(as otherwise

F,kl

4).

ILtE

TRANSACTIONS ON INFORM,\I ION THFOKY.

VOL 17

JANUARY

I‘NI

Further, by Lemma

x,’(i,u)

represents the unit

vector in the direction

‘1,

for

F,h/

we have

IIxy

,zII

J1-(x,,x,)’-(x,,u)’+477

IJl-a:-pf+477

IKx,,

u)l

Hence

Pr(llVII’

77,

(xy’,,,~)

1-57

p,fi>

(2.19)

where

is a r.v. distributed uniformly on the unit n-sphere

and

U‘

is any fixed unit vector in

R”.

Together with (2.17),

this implies that (2.16) is overbounded by

C,,,,,,,A‘,;‘,

where

Hence it suffices to show that

A$;)+

0 as

+CO

for every

(k,l)

Since

(k,

there are

two

cases to consider:

exp

(

2~),

and

ax.

p,?

-exp(

-2~).

We first observe that in both cases

-p/fi-jV

(2.21)

provided that

is chosen sufficiently small. Indeed, in case

a), the expression in

(2.21)

-677. In case b), the

assumption

log(1

1/(A

u2))-

implies

that

+p:5

77-

exp(26)

l+y

A+u

exp(26).

1+A

Since

P/fi

pf)(

11)

(as can be directly verified by squaring both sides), this yields

-a,

-p/fi-577

1-57

A(exp(26)

is sufficiently small.

Now, in case

we obtain, using Lemma

that

and

are chosen small enough.

using Lemma

we obtain from (2.20) that

In case

b),

we have

log(1-

77)

Then,

-p;

q)+

(l-a,<-P/fi-577)*

(

q)(l-

P,?

+47)

Evaluating the maximum

-pa

-5q)*

+477

y(a,P)=l-a

U2fV

we obtain by differentiation that the maximum is attained at

and the value

the maximum is

-w2

l+477-

A+u2+v

Capacity of the Gaussian arbitrarily varying channel

Citations

Capacity of fading channels with channel side information

The effect upon channel capacity in wireless communications of perfect and imperfect knowledge of the channel

Reliable communication under channel uncertainty

The method of types [information theory]

The worst additive noise under a covariance constraint

References

Information Theory: Coding Theorems for Discrete Memoryless Systems

Probability of error for optimal codes in a Gaussian channel

Elimination of correlation in random codes for arbitrarily varying channels

The capacity of the arbitrarily varying channel revisited: positivity, constraints

The Capacities of Certain Channel Classes Under Random Coding

Related Papers (5)

Elimination of correlation in random codes for arbitrarily varying channels

Information Theory: Coding Theorems for Discrete Memoryless Systems

Information Theory and Reliable Communication

Reliable communication under channel uncertainty

Elements of information theory

Frequently Asked Questions (1)

Q1. What have the authors contributed in "Capacity of the gaussian arbitrarily varying channel" ?