What is the way to classify images?

Since images are nonstationary, image lines can be classified in two ways: (a) Lines for which neither M nor D are dominant, in which case no mismatch occurs.

What is the way to detect the binary sequence?

More specifically, if Y n = yn = (y1; y2; ; yn) denotes the received binary sequence at the output of the channel, the MAP detector “guesses” the transmitted1 General Markov random field (MRF) models [8] are not used here, since MAP estimation for these models would require computationally intensive algorithms such as simulated annealing.

What is the channel transition and marginal probability?

The channel transition and marginal probabilities Q(zn j zn 1) PrfZn = zn j Zn 1 = zn 1g and Q(zn) PrfZn = zng, are given byQ(0 j 0) Q(1 j 0) Q(0 j 1) Q(1 j 1) = 1 1 + 1 + 1 +and Q(1) = = 1 Q(0).

How is the source coding rate controlled?

Source coding rate control should be carried out by modifying the original quantization matrix and accordingly determining the optimal bit rate allocation for each coefficient.

What is the method to achieve higher compression rates?

The authors use instead a suboptimal scheme to achieve higher compression rates: Varying rates are obtained by modifying the size of the zonal mask and discarding additional high-frequency DCT coefficients.

How many digits are in the mRdlog2?

If l denotes the number of accuracy digits for each source parameter, then the percentage of overhead information is equal to% Overhead = mRdlog2 (10l 1)eKwhere K is the image width and m is the number of source statistics per line (m = 4 for the second-order Markov model, m = 2 for the first-order model, and m = 1 for iid model).

What is the way to decode the image?

Note that this representation is amenable to progressive and scalable decoding of the image whereby the DCT coefficients for the full image are transmitted and decoded in order of increasing spatial frequency.

What is the difference between MAP and UEP?

Since MAP methods almost consistently yield a performance superior to that obtained by their ML counterpart for situations of interleaved channels ( = 0), clearly the use of prior distribution translates into appreciable performance gain.

What is the performance of the MAP-UEP schemes?

Significant performance improvements are obtained by introducing even limited UEP, especially at low BER, at the cost of often only moderate increases in overall rate (compare MAP-UNC at ( ; ; R) = ( ; 0:01; 1:19) to MAP-UEP-I at ( ; ; R) = ( ; 0:01; 1:31)).

(Open Access) An error resilient scheme for image transmission over noisy channels with memory (1998) | Philippe Burlina

Q: What have the authors contributed in "An error resilient scheme for image transmission over noisy channels with memory - image processing, ieee transactions on " ?

The authors first consider MAP channel decoding of uncompressed two-tone and bitplane encoded grey-level images. Next, the authors propose a scheme relying on unequal error protection and MAP detection for transmitting grey-level images compressed using discrete cosine transform ( DCT ), zonal coding, and quantization.

Q: What are the future works in "An error resilient scheme for image transmission over noisy channels with memory - image processing, ieee transactions on " ?

Future work will address the use of soft decision information in conjunction with trellis coded modulation ( TCM ) for the MAP channel decoding of compressed images over noisy channels.

Q: What is the way to reduce the error in compressed images?

A. Image Compression SchemeStandard visual compression methods such as Joint Photographers Expert Group (JPEG) and Motion Pictures Expert Group (MPEG) are fragile to channel errors.

IEEE TRANSACTIONS ON IMAGE PROCESSING, VOL. 7, NO. 4, APRIL 1998 593

[15] D. Bhandari, C. A. Murthy, and S. K. Pal, “Genetic algorithm with

elitist model and its convergence,” Int. J. Pattern Recognit. Artif. Intell.,

vol. 10, pp. 731–747, 1996.

[16] S. Bandyopadhyay, C. A. Murthy, and S. K. Pal, “Pattern classiﬁcation

with genetic algorithms,” Pattern Recognit. Lett., vol. 16, pp. 801–808,

1995.

[17] C. A. Murthy and N. Chowdhury, “In search of optimal clusters using

genetic algorithm,” Pattern Recognit. Lett., vol. 17, pp. 825–832, 1996.

[18] S. Daly, “The visual difference predictor: An algorithm for the assess-

ment of image ﬁdelity,” in SPIE Conf. Human Vision, Visual Processing

and Digital Display III, San Jose, CA, 1992, pp. 2–15.

[19] C. A. Murthy and S. K. Pal, “Histogram thresholding by minimizing

gray level fuzzyness,” Inform. Sci., vol. 60, pp. 107–135, 1992.

[20] X. Ran and N. Farvardin, “A perceptually motivated three-component

image model—Part I: Description of the model,” IEEE Trans. Image

Processing, vol. 4, pp. 401–415, 1995.

[21] L. Thomas and F. Deravi, “Region-based fractal image compression

using heuristic search,” IEEE Trans. Image Processing, vol. 4, pp.

832–838, 1995.

An Error Resilient Scheme for Image

Transmission over Noisy Channels with Memory

Philippe Burlina and Fady Alajaji

Abstract— This correspondence addresses the use of a joint source-

channel coding strategy for enhancing the error resilience of images

transmitted over a binary channel with additive Markov noise. In this

scheme, inherent or residual (after source coding) image redundancy

is exploited at the receiver via a maximum a posteriori (MAP) channel

detector. This detector, which is optimal in terms of minimizing the

probability of error, also exploits the larger capacity of the channel

with memory as opposed to the interleaved (memoryless) channel. We

ﬁrst consider MAP channel decoding of uncompressed two-tone and bit-

plane encoded grey-level images. Next, we propose a scheme relying on

unequal error protection and MAP detection for transmitting grey-level

images compressed using discrete cosine transform (DCT), zonal coding,

and quantization. Experimental results demonstrate that for various

overall (source and channel) operational rates, signiﬁcant performance

improvements can be achieved over interleaved systems that do not

incorporate image redundancy.

Index Terms—Channels with memory, DCT coding, error resilience,

joint source/channel coding, MAP decoding, unequal error protection.

I. INTRODUCTION

We address the problem of the reliable communication of images

over bursty channels. Traditional approaches to the design of visual

communication systems over noisy channels rely on Shannon’s

Manuscript received January 24, 1996; revised May 16, 1997. This work

was supported in part by the Natural Sciences and Engineering Research

Council (NSERC) of Canada. Parts of this work were presented at the 1995

International Symposium on Information Theory and the 1996 International

Conference on Image Processing. The associate editor coordinating the

review of this manuscript and approving it for publication was Dr. Christine

Podilchuk.

P. Burlina is with the Institute for Advanced Computer Studies and the

Electrical Engineering Department, University of Maryland, College Park,

MD 20742 USA (e-mail: burlina@cfar.umd.edu).

F. Alajaji is with the Department of Mathematics and Statistics and the

Department of Electrical and Computer Engineering, Queen’s University,

Kingston, Ont. K7L 3N6, Canada.

Publisher Item Identiﬁer S 1057-7149(98)02465-8.

source-channel coding separation principle [9], resulting in what is

known as tandem source-channel coding schemes. The optimality

of this design principle holds only asymptotically; i.e., when no

constraints exist on coding/decoding complexity and delay [9]. An

alternate approach lies in joint source-channel coding (JSSC): this

strategy includes techniques such as maximum a posteriori (MAP)

detection, channel optimized vector quantization, or adaptive source-

channel rate allocation. JSSC has recently received increased attention

(e.g., [5], [7], [11]), and has been shown to outperform tandem

schemes when delay and complexity are constrained. Most of the

work on joint source-channel coding of images [5], [7], [11] has

dealt with memoryless channels, disregarding the fact that real-world

communication channels—in particular, mobile radio or satellite

channels—often have memory.

In this work, we investigate the problem of MAP detection of

images transmitted over a binary Markov channel. The MAP detector

fully exploits the statistical image characteristics in order to efﬁciently

combat channel noise. It also exploits the larger capacity of the

channel with memory as opposed to the interleaved (memoryless)

channel. We ﬁrst describe MAP detection schemes that directly utilize

the inherent image redundancy in uncompressed binary images and

bit-plane encoded grey-level images. The amount of needed overhead

information and the performance degradation when the decoder has

imperfect knowledge of the channel parameters are considered.

The MAP detection approach is then validated for systems employ-

ing image compression. The residual redundancy of quantized low-

frequency discrete cosine transform (DCT) coefﬁcients is exploited

via unequal error protection (UEP) and MAP decoding. Experimental

results show that the proposed schemes exhibit very good perfor-

mance, in spite of their low complexity (which primarily resides

in the MAP decoder). Speciﬁcally, signiﬁcant gains over systems

not exploiting image redundancy can be achieved, at relatively low

overall transmission rates.

II. C

HANNEL MODEL

Consider a binary channel with memory described by

for

;

111

where

and

represent, respectively, the

input, noise and output of the channel. The input and noise sequences

are assumed to be independent from each other. The noise process

is a stationary ergodic Markov process described in [2], with

channel bit error rate (BER) denoted by



, where



;

and correlation parameter denoted by





(the noise correlation

coefﬁcient is given by



). When



, the channel reduces to the

memoryless binary symmetric channel (BSC). The channel transition

and marginal probabilities

(

) Pr

and

(

) Pr

, are given by











and

(1) =



(0)

. Note that this Markov model is general;

it can represent any irreducible ﬁrst-order two-state Markov chain.

The channel capacity is given [2] by

(

)=1



)





h





where

(

)

is the binary entropy function. The capacity is monoton-

ically increasing with



(for ﬁxed



) and monotonically decreasing

with



(for ﬁxed



). Note that for ﬁxed



,as



1057–7149/98$10.00  1998 IEEE

594 IEEE TRANSACTIONS ON IMAGE PROCESSING, VOL. 7, NO. 4, APRIL 1998

(a) (b) (c)

(d) (e)

Fig. 1. MAP detection of two-tone Lena over the Markov channel with



. (a) Binary Lena. (b) Lena received,



second-order model. (d) Received uncoded Lena,



=10

(e) Decoded Lena: adaptive scheme.

III. UNCOMPRESSED IMAGES

A. Image Models, MAP Detection, and Image Redundancy

Consider a two-tone image

[

i;j

]

of height

and width

where

i;j

;

111

;J; j

;

111

. We assume that

the image satisﬁes a causal second-order Markov property such that

any pixel at location

(

i; j

)

depends on the pixels at locations

(

)

and

(

i; j

. When the image is explored lexicographically, it can

be represented as a second-order Markov process

where

;

111

=Pr

for

n>K

. Note that this model is completely speciﬁed by four

transitional distributions. We also consider the following special cases

: the ﬁrst-order Markov chain and nonuniform independent

and identically distributed (i.i.d.) models [4].

Consider the problem of transmitting the binary second-order

Markov source

over the Markov channel. The optimal detec-

tion technique that minimizes the sequence probability of decoding

error is the sequence MAP method [3]. More speciﬁcally, if

;

111

)

denotes the received binary sequence at the

output of the channel, the MAP detector “guesses” the transmitted

General Markov random ﬁeld (MRF) models [8] are not used here, since

MAP estimation for these models would require computationally intensive

algorithms such as simulated annealing. We therefore restrict ourselves to

causal models that are easily implemented via sequential decoding algorithms.

sequence

according to

= arg max

;

(1)

It can be shown [4] that (1) is equivalent to

= arg max

;

log(

(

)

(

))

log(

(

)

(

))

log(

(

)

(

))

(2)

The sequence MAP detector described in (2) can be implemented

using the Viterbi algorithm. Here,

denotes the state at time

;

the trellis will hence have two states, with two branches leaving and

entering each state. For a branch leaving state

at time

and entering state

at time

, the path metric is

log(

(

)

(

))

;

for



and

log(

(

)

(

))

;

for

k>K:

The surviving path for each state is the path with the smallest cumu-

lative metric up to that state. The sequence MAP decoder observes

IEEE TRANSACTIONS ON IMAGE PROCESSING, VOL. 7, NO. 4, APRIL 1998 595

the entire received sequence

in order to estimate

;

111

In this scheme, channel protection is achieved by utilizing the

natural source redundancy as well as the channel noise correlation.

The total redundancy contained in the source



(

)

, where

(

)

is the source entropy rate. This total

redundancy can be written as



[3], where



(

)

denotes the redundancy due to the nonuniformity of the

marginal distribution (

), and



(

)

(

)

denotes the redundancy due to the memory of the process. The type

and amount of redundancy exhibited by an image is important since it

dictates the behavior of the MAP detector. If







, the process

tends to behave like a symmetric Markov source. This results in

a mismatch situation (cf. [3, Section V]) that prevents the decoder

from fully exploiting the channel noise correlation (when the channel

capacity increases, the performance of the MAP detector deteriorates.)

If the redundancy due to the nonuniformity of a process is high

relative to its redundancy in the form of memory

(







)

then the process behaves like a nonuniform i.i.d. source and no

such mismatch occurs [3, Sec. IV]. Images and facsimile documents

exhibit very different types and degrees of redundancy.

Furthermore,

redundancy varies within images themselves since images are hardly

stationary sources. This observation suggests the use of an adaptive

scheme, as will be proposed next.

B. Two-Tone Image Detection

We start by modeling the two-tone images according to the second-

order causal Markov chain. Image lines are each represented as a

Markov chain with transitional probabilities computed empirically,

and transmitted uncompressed in a lexicographic fashion over the

Markov channel. At the receiver, the sequence MAP decoder is

implemented according to (2). While the 2-D Markov model is

appealing, since it closely captures the spatial dependency speciﬁc to

image sources, simulation results suggest that the use of this model

often results in a mismatch between the source and the channel

[4]. This leads us to conclude that when images are modeled by

a second-order Markov chain and sent over the binary Markov

channel, the best performance is obtained when



; i.e., when

the channel is fully interleaved and transformed into a memoryless

channel (BSC). Fig. 1(a)–(c) show the binary Lena image transmitted

over the interleaved channel with BER



. The resulting average

decoding bit error probability is 0.039.

We next consider MAP decoding when the image is modeled as

a ﬁrst-order Markov chain. Since images are nonstationary, image

lines can be classiﬁed in two ways: (a) Lines for which neither



nor



are dominant, in which case no mismatch occurs. (b) Lines

having







, which are likely to result in mismatch. We hence

employ an adaptive encoding system on the image lines that takes

into consideration the line redundancy. Each image line, modeled as

a ﬁrst-order Markov chain, is processed as follows: The empirical

distributions for the line are computed. If



, for some

threshold

, we transmit the image line over the channel and MAP

decode it using the line statistics and ﬁrst-order Markov assumptions.

Otherwise, if







, we ﬁrst convert the redundancy in

the symmetric Markov source

from the form of memory

into redundancy in the form of nonuniform distribution via the

transformation, as follows [4]:

;

111

We then transmit

directly over the Markov channel, and MAP

decode it as

using i.i.d. source assumptions. The decoded

binary image stream is reconstructed using

;

Computational studies that quantify natural redundancy inherent in two-

tone images are reported in [4]

This is essentially equivalent to differential encoding for binary sources.

TABLE I

ERCENTAGE OF OVERHEAD FOR BINARY LENA.

512;

TABLE II

INARY LENA:ROBUSTNESS RESULTS FOR ADAPTIVE MAP DECODING SCHEME

PSNR (dB);

=10

;



DESIGN BER;



ACTUAL BER;



DESIGN CORRELATION PARAMETER;



ACTUAL CORRELATION

PARAMETER. (a) ROBUSTNESS WITH BER

(



=10)

(b) R

OBUSTNESS WITH CORRELATION PARAMETER

(



(a)

(b)

;

111

with

. To prevent error propagation,

packetization is used by grouping source samples into blocks. An

example of adaptive MAP decoding

(

= 10)

of Lena over a very

noisy channel with high noise correlation

(



and



= 10)

is shown in Fig. 1(d) (received as if it were not protected) and (e)

(MAP decoded). A 4.68 dB peak signal-to-noise ratio (PSNR) gain is

achieved by the adaptive MAP decoder over the case when no MAP

decoding is done. Detailed performance evaluation of this scheme for

various images is reported in [4].

C. Overhead Information

As in all joint source-channel coding schemes, it is assumed that

the image statistics are available at the decoder. This can be achieved

by transmitting them along with the image using a forward error-

correcting code.

We assume that a rate

convolutional encoder

is used to protect the source statistics. If the channel is very noisy,

we might need to use a more powerful convolutional code. This

can be achieved by increasing the number of states of the code or

increasing

.If

denotes the number of accuracy digits for each

source parameter, then the percentage of overhead information is

equal to

% Overhead

log

(10

where

is the image width and

is the number of source statistics

per line (

for the second-order Markov model,

for

the ﬁrst-order model, and

for iid model). The amount of

overhead needed for the Lena two-tone image is presented in Table I

for

and

;

Note that we can avoid transmitting overhead information about the source

statistics by using training images to estimate the statistics of the source.

This approach is justiﬁable in applications where the images belong to a

particular class—e.g., in the transmission of medical magnetic resonance

images (MRI’s).

596 IEEE TRANSACTIONS ON IMAGE PROCESSING, VOL. 7, NO. 4, APRIL 1998

(a) (b)

(c)

Fig. 2. Transmission of grey Lena using MAP detection of bit-plane encoded images;



;

=10

. (a) Original Lena. (b) Received uncoded;

PSNR

14.45 dB. (c) Decoded Lena; PSNR

19.53 dB.

D. Robustness Under Imperfectly Known Channel Statistics

Until now we have assumed that the channel statistics

(



and



)

were known a priori at the receiver. We investigate here the

robustness of the MAP decoding system when these parameters are

not known perfectly. This may occur due to inadequate estimation of

the channel parameters, particularly when the channel is time-varying

(e.g., mobile radio channels). Simulation results using the adaptive

MAP decoding scheme for the transmission of Lena are displayed in

Table II. In Table II(a), we present PSNR results when the receiver

misestimates the BER



with the correlation parameter



=10

.In

Table II(b), we provide PSNR results when the receiver misestimates

the correlation parameter



with the channel BER



. We can

conclude that the MAP scheme is not very sensitive to errors in

estimating





, provided that we do not design





to be zero

when the actual parameter is nonzero.

E. Bit-Plane Encoded Grey-Level Images

For illustrative purposes, we herein consider the application of

the MAP decoding method to bit-plane encoded images. In bit-plane

coding, each plane is traditionally compressed using binary image

coding techniques [6]. This method is very sensitive to channel errors

and typically yields low compression ratios leaving little room for

TABLE III

MAP-UNC V

ERSUS UNC: AVERAGE PSNR (IN dB) OF DECODED LENA OVER

MARKOV CHANNEL WITH BER



AND

CORRELATION PARAMETER



.RESULTS

AVERAGED OVER 30 EXPERIMENTS.

IS THE OVERALL RAT E I N B/PIXEL

IEEE TRANSACTIONS ON IMAGE PROCESSING, VOL. 7, NO. 4, APRIL 1998 597

TABLE IV

MAP-UEP I V

ERSUS ML-IL-UEP I: AVERAGE PSNR (IN dB) OF DECODED

LENA OVER MARKOV CHANNEL WITH BER



AND CORRELATION PARAMETER



ESULTS AVERAGED OVER 30 EXPERIMENTS.

IS THE OVERALL RAT E I N B/PIXEL

protection against channel noise. Consider instead the problem of

directly sending the uncompressed bit-planes modeled as Markov

sources over the Markov channel. As in the case of two-tone images,

we use an adaptive MAP detection scheme taking into account the

source and the channel statistics applied on each bit-plane image

explored in a lexicographic fashion. Experimental results are shown

in Fig. 2 for the Lena grey-level image. Signiﬁcant improvements

over the received images are achieved. For



and



gains in excess of 5 dB are achieved.

IV. C

OMPRESSED IMAGES

MAP decoding of uncompressed images relies on the signiﬁcant

intrinsic source redundancy to help combat channel noise. Since

source coding schemes are not ideal, they always leave some residual

redundancy in their output bitstream that can similarly be exploited

at the receiver. A challenging issue lies in the use of the limited

redundancy residing in compressed images for channel protection.

A. Image Compression Scheme

Standard visual compression methods such as Joint Photographers

Expert Group (JPEG) and Motion Pictures Expert Group (MPEG)

are fragile to channel errors. Errors corrupting the compressed data

contribute unequally to the ﬁnal distortion of the reconstructed image

or video stream. This observation justiﬁes the use of unequal error

protection. We propose to improve the error resilience of compressed

images by designing several schemes that combine UEP and MAP

detection. Our objective is to characterize the effectiveness of these

methods for various levels of image compression.

Consider the case of JPEG encoded images, or that of MPEG1/2

or H.261/3 encoding of intraframes. These schemes incorporate

DCT coding, quantization and entropy coding. Clearly, the most

fragile module lies in the variable-length coding (either Huffman

or arithmetic), for which the occurrence of an error produces cat-

astrophic error propagation and total loss of the packet until the next

TABLE V

MAP-UEP II

VERSUS ML-IL-UEP II: AVERAGE PSNR (IN dB) OF DECODED

LENA OVER MARKOV CHANNEL WITH BER



AND CORRELATION PARAMETER



ESULTS AVERAGED OVER 30 EXPERIMENTS.

IS THE OVERALL RAT E IN B/PIXEL

synchronization occurs. Error resilience in this case consist in the

reliable reception of synchronization messages or the packetization

of VL codes.

Since the synchronization issue is outside the scope of this work,

we consider instead a compression scheme similar in spirit to the

above cited standards with the exclusion of entropy coding. More

speciﬁcally, our image compression scheme is as follows: The image

is ﬁrst subdivided into 8

8 blocks, and for each of these blocks the

DCT is computed. The resulting 64 DCT coefﬁcients are uniformly

quantized using one of the quantization matrices proposed in [10]

derived from psychovisual thresholds. The coefﬁcients are then

ordered in a zig-zag fashion. While the basic JPEG scheme would

Huffman encode the resulting stream on the basis of the coefﬁcients’

amplitude and leading run-lengths of zeros, we proceed with zonal

coding and conversion to a binary bitstream. For zonal coding, we

use the ﬁrst 15 zig-zag scanned coefﬁcients. The retained quantized

coefﬁcients are then converted to binary using a folded binary code

(FBC) representation. The bit rates used for converting each quantized

coefﬁcient are those proposed for zonal coding in [10].

B. Channel Coding Schemes

Error resilience is provided by combining UEP and MAP

detection. Because of the high-energy compaction property of

the DCT for highly correlated sources [6], most of the signal

information is concentrated in the lower spatial frequencies. The DC

coefﬁcient is the most important DCT coefﬁcient since it measures

the average value of each block. An error in the DC coefﬁcient

typically results in blocking artifacts. These artifacts are often

resolved through additional channel protection or postprocessing

techniques that employ edge-preserving smoothing operators on

the decoded image. However, traditional channel protection or

error-concealment operations disregard the source characteristics.

We propose instead to use MAP detection of channel encoded DC

This issue is given much attention in current standardization efforts of

MPEG4.

An error resilient scheme for image transmission over noisy channels with memory

Figures

Citations

[서평]「Digital Video Processing」

A lower bound on the probability of a finite union of events

Tight error bounds for nonuniform signaling over AWGN channels

Joint Source-Channel Coding Using Real BCH Codes for Robust Image Transmission

Soft source decoding with applications

References

A mathematical theory of communication

Fundamentals of digital image processing

Digital Video Processing

[서평]「Digital Video Processing」

Genetic algorithm with elitist model and its convergence

Related Papers (5)

A mathematical theory of communication

Use of residual redundancy in the design of joint source/channel coders

Rate-compatible punctured convolutional codes (RCPC codes) and their applications

A study of vector quantization for noisy channels

Channel codes that exploit the residual redundancy in CELP-encoded speech

Frequently Asked Questions (18)

Q1. What have the authors contributed in "An error resilient scheme for image transmission over noisy channels with memory - image processing, ieee transactions on " ?

Q2. What are the future works in "An error resilient scheme for image transmission over noisy channels with memory - image processing, ieee transactions on " ?

Q3. What is the gain of the adaptive MAP decoder over the case?

Q4. What is the way to classify images?

Q5. What is the fragile module in the coding?

Q6. What is the problem with source coding schemes?

Q7. What is the way to detect the binary sequence?

Q8. What is the way to reduce the error in compressed images?

Q9. What is the channel transition and marginal probability?

Q10. How is the source coding rate controlled?

Q11. What are the common ways to resolve artifacts?

Q12. What is the method to achieve higher compression rates?

Q13. What is the MAP decoding scheme for binary images?

Q14. How many digits are in the mRdlog2?

Q15. What is the way to decode the image?

Q16. What is the performance of the image when it is sent over the binary Markov channel?

Q17. What is the difference between MAP and UEP?

Q18. What is the performance of the MAP-UEP schemes?