# TU/e EINDHOVEN UNIVERSITY OF TECHNOLOGY 

## A Viterbi decoder

## Citation for published version (APA):

Meeberg, van de, L. (1974). A Viterbi decoder. (EUT report. E, Fac. of Electrical Engineering; Vol. 74-E-49). Technische Hogeschool Eindhoven.

## Document status and date:

## Published: 01/01/1974

## Document Version:

Publisher's PDF, also known as Version of Record (includes final page, issue and volume numbers)

## Please check the document version of this publication:

- A submitted manuscript is the version of the article upon submission and before peer-review. There can be important differences between the submitted version and the official published version of record. People interested in the research are advised to contact the author for the final version of the publication, or visit the DOI to the publisher's website.
- The final author version and the galley proof are versions of the publication after peer review.
- The final published version features the final layout of the paper including the volume, issue and page numbers.
Link to publication


## General rights

Copyright and moral rights for the publications made accessible in the public portal are retained by the authors and/or other copyright owners and it is a condition of accessing publications that users recognise and abide by the legal requirements associated with these rights.

- Users may download and print one copy of any publication from the public portal for the purpose of private study or research.
- You may not further distribute the material or use it for any profit-making activity or commercial gain
- You may freely distribute the URL identifying the publication in the public portal.

If the publication is distributed under the terms of Article 25 fa of the Dutch Copyright Act, indicated by the "Taverne" license above, please follow below link for the End User Agreement:
www.tue.nl/taverne

## Take down policy

If you believe that this document breaches copyright please contact us at:
openaccess@tue.nl
providing details and we will investigate your claim.

## ひ

A VITERBI DECODER
by
L. van de Meeberg

| TECHNISCHE HOGESCHOOL EINDHOVEN |  | EINDHOVEN UNIVERSITY OF TECHNOLOGY |
| :--- | :--- | :--- |
| NEDERLAND |  | THE NETHERLANDS |
| AFDELING DER ELEKTROTECHNIEK |  | DEPARTMENT OF ELECTRICAL ENGINEERING |
| VAKGROEP TELECOMMUNICATIE |  |  |

A Viterbi decoder by
L. van de Meeberg

TH-Report 74-E-49
October 1974

## Contents

Summary ..... 3
1 INTRODUCTION ..... 4
2 THE ENCODER ..... 5
3 THE VITERBI ALGORITHM ..... 6
3.1 The algorithm in terms of the trellis diagram ..... 6
3.2 Analytical representation of the algorithm ..... 7
3.3 Some comments on the metric and path registers ..... 8
4 THE GENERATING FUNCTIONS ..... 9
5 ERROR PROBABILITIES ..... 11
5.1 The error event probability ..... 11
5.2 The bit error probability ..... 12
5.3 Generally valid upper bounds for $P_{E}$ and $P_{B}$ ..... 12
6 REALIZATION OF THE DECODER ..... 15
6.1 The metric registers ..... 15
6.2 The path registers ..... 16
6.3 The control unit ..... 17
6.4 The error detector ..... 17
7 MEASUREMENTS ..... 21
8 SUGGESTIONS FOR SIMPLIFYING THE CIRCUIT ..... 23
8.1 Simplification of the metric ..... 23
8.2 Large-scale integration ..... 23
Appendix 1 ..... 24
Appendix 2 ..... 25
Appendix 3 ..... 26
Bibliography ..... 28
Acknowledgement ..... 28

## Summary

After an introduction to the structure of convolutional codes, this report discusses a Viterbi decoder for the simplest non-systematic convolutional code (constraint length $\mathrm{K}=3$ ).
In Section 1 it is shown how this code is generated. The consecutive data bits, which are to be encoded, are shifted into a 3-bit shift register. To this register two mod-2 adders are connected; one is linked to all three stages and the other to the first and the last stage of the shift register. The sums presented by the mod- 2 adders appear alternately at the output terminal of the encoder. In this way a binary code with rate $1 / 2$ is obtained. It is shown that all possible code sequences can be represented as paths in a so-called trellis diagram.

Section 2 deals with the Viterbi algorithm, an optimal algorithm for maximum-likelihood decoding of convolutional codes. When a sequence of code digits is mutilated in a transmission channel, the particular path through the trellis diagram is searched which has the smallest Hamming distance with respect to the received sequence. In this case maximum-likelihood decoding is equivalent to minimum-distance decoding.

In Section 3 the generating function is derived. Analysis of the function yields all distance information of the code.
In Section 4 it is shown that the error correction obtained with this algorithm is quite considerable. The bit error probability can be calculated by computing the probability of a wrong path being followed and ascertaining the number of bit errors caused thereby. The generating function appears to be a most useful tool for formulating this error probability. It is derived that in the case of the encoder and decoder being linked by a binary symmetric channel, the bit error probability remains below approximately $50 p^{3}$, where $p$ denotes the cross-over probability of this channel.
The circuit of a decoder making use of TTL is discussed in Section 5.
Measurements dealt with in Section 6 show that the actual bit error rate is in very close agreement with the calculations of Section 4.
In Section 7 some possibilities are suggested for simplify. ing the circuit by means of large-scale integration.

## 1 INTRODUCTION

Since Claude Elwood Shannon published his "Mathematical Theory of Communications" in the Bell System Technical Journal in 1948, an immense amount of work has been done with the object of improving the reliability of communications. Information theory has had great influence on the development of digital modulation systems, space and satellite communications and such fields as semantics, psychology and genetics. However, the most important activities in the discipline of information theory concern source coding and channel coding, the latter of which is used for reducing the effects of noise introduced in a communication channel. In general, a channel encoder adds redundancy to an information source, and it is this redundancy which enables the decoder to improve the signal-to-noise ratio or to lower the error rate.

The class of error-correcting channel codes known as convolutional codes offers promising practical possibilities; the encoding and decoding techniques are considert ably simpler than is the case with block codes.
Several methods have been devised for decoding convolutional codes, such as sequential decoding (Wozencraft [1] , [2]), threshold decoding (Massey [3]), the Fano algorithm [4] and recently the Viterbi algorithm ([5] to [9]). The present report discusses the design of a decoder based on the last of these for the simplest convolutional code. By way of introduction, in this introductory section the general form of a convolutional encoder will first be discussed, and then some subclasses of convolutional codes will be dealt with in brief.
A convolutional encoder is a linear "finite-state machine" consisting of a $K$-stage shift register and $n$ mod- 2 adders. The data sequence, which is usually binary, is shifted into the register $b$ bits at a time ( $b<n$ ). In general, such an encoder will thus assume the form depicted in Fig.1.1, which has been drawn from right to left to show the bits in their correct sequence (first bit at the left). The rate $R$ of such a code is $b / n$, all $b$ data bits being converted into $n$ code digits. This reduction in rate is the price that has
to be paid for the error-correcting feature. We shall confine ourselves to codes of which $b=1$, thus with rate $1 / n$. The systematic convolutional codes form a subclass of these codes. In a systematic encoder $n-1 \bmod -2$ adders are connected to the stages of the shift register, whilst the $n$th adder is replaced by the direct connection to the first stage.
Bucher and Heller [10] showed that for high values of $K$ the behaviour of a systematic encoder of constraint length $K$ is substantially the same as that of a nonsystematic encoder of constraint length $K(1-R)$. For this reason we shall confine ourselves to non-systematic codes. A problem peculiar to non-systematic codes is that of catastrophic error propagation: with certain connection patterns between the mod- 2 adders and the shift register, it is possible for a finite number of errors in transmission to give rise to an infinite number of errors in the decoded data sequence. Massey and Sain [11] showed that a rate $1 / n$ convolutional code is subject to catastrophic error propagation if and only if the subgenerator polynomials contain a common factor. Applying this criterion, Rosenberg [12] showed that only a small fraction of the non-systematic codes, viz. $1 /\left(2^{n}-1\right)$, is in fact catastrophic. Therefore, the question of catastrophic error propagation will not be further dwelt upon here.

In designing convolutional encoders the main problem consists in finding the optimal connections between the mod- 2 adders and the shift register. The main criterion to be kept in mind is the minimum free distance, which should be as large as possible. In Section 5 the meaning of this will be explained. Optimal connections have been ascertained up to a constraint length $K=9$, i.a. by Oldenwalder [13]. Here, we shall confine our attention to a code with $K=3, n=2$ and a minimum free distance 5 . This code belongs to the class of complementary convolutional codes, which again form a subclass of the non-systematic codes. This implies that both mod- 2 adders are connected with the first as well as with the last stage of the shift register.


Fig.1.1 General representation of a convolutional encoder.

## 2 THE ENCODER

The simplest non-systematic convolutional code is generated in the following way, as illustrated by Fig.2.1. Two mod-2 adders are connected to a three-stage shift register. The outputs $Q_{1}$ and $Q_{2}$ of these adders are alternately connected to the output terminal of the encoder (first $Q_{1}$, then $Q_{2}$ ).
Denoting the three positions of the shift register by $S_{0}$, $S_{1}$ and $S_{2}$ respectively, yields the truth table below (Table 1).

The transitions of the states will be investigated on the basis of the table. The four states which the positions $S_{1}$ and $S_{0}$ can assume are denoted by $a, b, c$ and $d$. This provides the transitions given in Table 2.

This may be illustrated by the following example. Assume the data sequence to be 11010 and the shift register to contain initially three zeros (state $a$ ). When the first bit, a 1, is shifted in, the contents of the shift register will become 001 (state $b$ ), so that $Q_{1}=0 \oplus 0 \oplus 1=1$ and $Q_{2}=0 \oplus 1=1$, hence 11 appears at the output. Now the next bit is shifted in, again a 1 . The shift register then contains 011 (state $d$ ). Therefore, $Q_{1}$ becomes 0 and $Q_{2}$ becomes 1 . At the next step the register contains 110 , whence $Q_{1}=0$ and $Q_{2}=1$. Proceeding in this way we find for the data sequence 11010 the code 1101010010 .

Each data bit is converted into two code digits which are fed into the channel, so the rate is $1 / 2$.
The linearity of this system can easily be demonstrated by comparing the response to two different data sequences with the response to their mod 2 sum:
$100000 \ldots \rightarrow 111011000000 \ldots$
$001000 \ldots \rightarrow 000011101100 \ldots$
$101000 \ldots \rightarrow 111000101100 \ldots$

We shall now investigate the transitions of the states (see Table 2) more closely. To this end the states $a, b, c$ and $d$ are represented as 4 levels in a trellis diagram; an entered bit is represented by a solid line if it is a 0 or by a dashed line if it is a 1 . The relevant two code bits are indicated along these lines, which thus represent the transitions. Moreover, we shall assume the shift register to contain initially three zeros, i.e. to be in state $a$. The trellis diagram will then be as shown in Fig.2.2. Each

Table 1

| $S_{2}$ | $S_{1}$ | $S_{0}$ | state | $Q_{1}$ | $Q_{2}$ |
| :--- | :--- | :--- | :--- | :--- | :--- |
| 0 | 0 | 0 | $a$ | 0 | 0 |
| 0 | 0 | 1 | $b$ | 1 | 1 |
| 0 | 1 | 0 | $c$ | 1 | 0 |
| 0 | 1 | 1 | $d$ | 0 | 1 |
| 1 | 0 | 0 | $a$ | 1 | 1 |
| 1 | 0 | 1 | $b$ | 0 | 0 |
| 1 | 1 | 0 | $c$ | 0 | 1 |
| 1 | 1 | 1 | $d$ | 1 | 0 |

Table 2

| if $S_{1}$ and $S_{0}$ are in state: | $a$ | $a$ | $b$ | $b$ | $c$ | $c$ | $d$ | $d$ |
| :--- | :---: | :---: | :---: | :---: | :---: | :---: | :---: | :---: |
| and the shifted-in bit is: | 0 | 1 | 0 | 1 | 0 | 1 | 0 | 1 |
| then $S_{1}$ and $S_{0}$ assume state: | $a$ | $b$ | $c$ | $d$ | $a$ | $b$ | $c$ | $d$ |
| whilst $Q_{1}$ and $Q_{2}$ become: | 0 | 0 | 1 | 1 | 1 | 0 | 0 | 1 |

## 3 THE VITERBI ALGORITHM

### 3.1 The algorithm in terms of the trellis diagram

The Viterbi algorithm is based on the principle of maximumlikelihood decoding, which in the present case is equivalent to minimum distance decoding. Upon reception of a sequence of bits, the particular path through this diagram will be searched which is closest to this sequence in the sense of Hamming distance; i.e., the path which differs from the received sequence by the minimum number of symbols. An example will make the meaning of this clear.
It was explained in the previous section that the data sequence 11010 is converted into the code
1101010010 . Assume this code sequence to be mutilated in the transmission channel so that 1001110010 is received, in other words that the second and fifth bits are erroneous. The paramount question is how this received sequence will be decoded.
Let us first consider the first pair of bits, 10 . Starting at the left top of the trellis diagram, we see that only two paths start from this point, viz. the path $a-a(00)$ and the path $a-b(11)$. Both paths are at a Hamming distance 1 from the first pair of received bits ( 10 ). We keep these distances and both paths in mind. Path $a-a$ corresponds to a 0 and path $a-b$ to a 1 in the relevant data sequence.
Now we consider the second pair of received bits, 01 . From the reached point $a$ there is one path ( 00 ), again to $a$, thus at a Hamming distance 1 from 01 . Since the first step already invoived a Hamming distance 1 , the path $a-a-a$ ( 0000 ) is at a total Hamming distance 2 from the received bits (1001).
From point $a$ reached after the first step, there is also a path (11) to $b$, likewise at a distance 1 from the second pair of received bits, thus this path $a-a-b(0011)$ is also at a total distance 2 from the first four received bits.
From $b$ there is a path ( 10 ) to $c$ at a distance 2 to 01 , totalling $1+2=3$. In other words, the path $a-b-c$ $(1110)$ is at a distance 3 from 1001 .
Finally, there is a path $b-d(01)$ at distance 0 from 01 , yielding a total distance of $1+0=1$. The path $a-b-d$ ( 1101 ) is thus at a distance 1 from 1001 .

## Summarizing, we have

- a path terminating in $a$ at a total distance 2 (metric 2)
- a path terminating in $b$ with metric 2
- a path terminating in $c$ with metric 3
- a path terminating in $d$ with metric 1 .

These four paths correspond to the data sequences 00,01 , 10 and 11 respectively.
The situation becomes slightly more complex at the next step because each of the nodes $a, b, c$ and $d$ is now the terminus of two paths. We need only store the one with the smallest metric. The other path can be disregarded because it has a larger metric and is thus less probable. If the metric values of both paths are identical, we make an arbitrary choice by flipping a coin; in this example, let us suppose that this means disregarding the lower of the two paths.


Fig.3.1 The third step.

The third pair of bits received is 11 . We now consider Fig.3.1 in which the metric values after the second step are indicated at the left.
The Hamming distance of path $a-a$ is 2 (metric $2+2=4$ ), that of path $c-a$ is 0 (metric $3+0=3$ ). Path $a-a$ can thus be disregarded and path $c-a$ with metric 3 should be stored.
The Hamming distance of path $a-b$ is 0 (metric $2+0=2$ ), that of path $c-b$ is 2 (metric $3+2=5$ ). Only path $a-b$ with metric 2 need be stored.
The Hamming distance of path $b-c$ is 1 (metric $2+1=3$ ), so is that of path $d-c$ (metric $1+1=2$ ). Only path $d-c$ with metric 2 is stored.
Finally, the Hamming distance of path $b-d$ is 1 (metric $2+1=3$ ), and so is that of path $d-d$ (metric $1+1=2$ ). Only path $d-d$ with metric 2 is stored.
Recapitulating, the new metric values at points $a, b, c$ and $d$ are now $3,2,2$ and 2 , respectively, and the paths stored correspond to the data sequences $100,001,110$ and 111 , respectively.
For the next two steps we proceed in the same way. After the fourth step we find the metric values $3,2,3,3$ and the paths corresponding to the data sequences 1000,1101 , 0010 and 0011 , respectively. The fifth and last step yields the metric values $4,4,2,3$ and the paths corresponding to the data sequences $10000,10001,11010$ and 00111 , respectively.
Fig.3.2 gives the resulting trellis diagram, omitting the disregarded paths and showing the metric values after each step. It is seen that after reception of the sequence 1001110010 the path terminating in $a$ (data sequence 10000 ) has a metric 4 , and so has the path terminating in $b$ (data sequence 10001 ). The path terminating in $c$ (data sequence 11010 ) has a metric 2 and that terminating in $d$ (data sequence 00111 ) has a metric 3 .
The path with the smallest metric value (2) with respect to the received bit sequence appears to be $a-b-d-c-b-c$, corresponding to the data sequence 11010 and the code sequence 1101010010 . It is seen to be identical to the code sequence generated by the encoder, two bits of which were mutilated in the transmission channel. The errors have
thus been corrected. It should, however, be realized that all errors will not necessarily be corrected by this decoding algorithm. If the error rate is higher, or if the error distribution is different, there is a chance of the wrong path being chosen. This will be explained by the following example.


Fig.3.2 Trellis diagram after reception of 1001110010 .

Assume that the last bit of the mentioned sequence is also wrongly received, so that the received sequence becomes 1001110011 , thus containing a third error. The paths through the trellis disgram then become as shown in Fig.3.3. After the last step the metrics will then be $3,3,3,3$ and the corresponding data sequences 00100,10001 , 11010 and 11011 respectively. Since the metric values of all paths are 3 , the choice is again arbitrary, the four paths being equally likely. If the first is chosen, there will be 4 bit errors; the second path results in 3 bit errors, the third path yields the correct data sequence, and if the fourth is chosen there will be 1 bit error.


Fig.3.3 Trellis diagram after reception of 1001110011 .

The probability of a wrong path being chosen and of this leading to bit errors will be calculated in Section 5. These probability calculations are confined to the case of a binary symmetric transmission channel being used.

### 3.2 Analytical representation of the algorithm

For realizing the algorithm in hardware it is convenient to express the decoding system in terms of formulae. To this end the following notation will be introduced.

For the metric values we shall use the symbols $M_{a} n, M_{b} n$, $M_{C}{ }^{n}$ and $M_{d} n$, where the $n$ stands for the order of the step (or the instant). The symbol $x_{n}$ denotes the $n$th pair of bits, and the symbols $x_{n 1}$ and $x_{n 2}$ the corresponding individual bits. For the decoded sequence we shall use the symbol $P$, defined by the same indices as $M$. Finally, the Hamming distance between $x_{n}$ and, for example 00 , will be written $D\left(x_{n}-00\right)$.
The first step is thus expressed by:
$M_{a} 1=D\left(x_{1}-00\right), P_{a} 1=0$
$M_{b} 1=D\left(x_{1}-11\right), P_{b} 1=1$
and the second step by:

$$
\begin{aligned}
& M_{a}^{2}=M_{a}^{1}+D\left(x_{2}-00\right), P_{a}^{2}=P_{a} 1,0=00 \\
& M_{b}^{2}=M_{a}^{1}+D\left(x_{2}-11\right), P_{b}^{2}=P_{a}^{1}, 1=01 \\
& M_{c}^{2}=M_{b} 1+D\left(x_{2}-10\right), P_{c}^{2}=P_{b} 1,0=10 \\
& M_{d}^{2}=M_{b} 1+D\left(x_{2}-01\right), P_{d}^{2}=P_{b} 1,1=11
\end{aligned}
$$

whilst the third step becomes:
$M_{a}{ }^{3}=\min \left\{M_{a}^{2}+D\left(x_{3}-00\right), M_{c}^{2}+D\left(x_{3}-11\right)\right\}$
$M_{b}{ }^{3}=\min \left\{M_{a}^{2}+D\left(x_{3}-11\right), M_{c}^{2}+D\left(x_{3}-00\right)\right\}$
$M_{c}{ }^{3}=\min \left\{M_{b}^{2}+D\left(x_{3}-10\right), M_{d}^{2}+D\left(x_{3}-01\right)\right\}$
$M_{d}{ }^{3}=\min \left\{M_{b}{ }^{2}+D\left(x_{3}-01\right), M_{d}^{2}+D\left(x_{3}-10\right)\right\}$
if $M_{a} 3=M_{a}^{2}+D\left(x_{3}-00\right)$, then $P_{a}^{3}=P_{a}^{2}, \quad 0=000$
if $M_{a}^{3}=M_{c}^{3}+D\left(x_{3}-11\right)$, then $P_{a}^{3}=P_{c}{ }^{3}, 0=100$
if $M_{b}^{3}=M_{a}^{2}+D\left(x_{3}-11\right)$, then $P_{b}{ }^{3}=P_{a}^{2}, \quad 1=001$
if $M_{b}^{3}=M_{c}^{2}+D\left(x_{3}-00\right)$, then $P_{b}{ }^{3}=P_{c}{ }^{2}, \quad 1=101$
if $M_{C}{ }^{3}=M_{b}^{2}+D\left(x_{3}-10\right)$, then $P_{c}{ }^{3}=P_{b}{ }^{2}, 0=010$
if $M_{c}{ }^{3}=M_{d}{ }^{2}+D\left(x_{3}-01\right)$, then $P_{c}{ }^{3}=P_{d}{ }^{2}, 0=110$
if $M_{d}{ }^{3}=M_{b}{ }^{2}+D\left(x_{3}-01\right)$, then $P_{d}{ }^{3}=P_{b}{ }^{2}, 1=011$
if $M_{d}{ }^{3}=M_{d}{ }^{2}+D\left(x_{3}-10\right)$, then $P_{d}{ }^{3}=P_{d}{ }^{2}, \quad 1=111$
The procedure for the following steps is identical to that for the third step, so that, in general, for $n \geqslant 3$ the following scheme applies:

Table 3

$$
\begin{aligned}
& M_{a}^{n}=\min \left\{\begin{array}{l}
M_{a} n \cdot 1+D\left(x_{n}-00\right) \longrightarrow P_{a} n=P_{a} n-1,0 \\
M_{c} n-1+D\left(x_{n}-11\right) \longrightarrow P_{a} n=P_{c} n-1,0
\end{array}\right. \\
& M_{b} n=\min \left\{\begin{array}{l}
M_{a} n-1+D\left(x_{n}-11\right) \longrightarrow P_{b} n=P_{a} n \cdot 1,1 \\
M_{c} n \cdot 1+D\left(x_{n}-00\right) \longrightarrow P_{b} n=P_{c} n \cdot 1,1
\end{array}\right. \\
& M_{c} n=\min \left\{\begin{array}{l}
M_{b} n-1+D\left(x_{n}-10\right) \longrightarrow P_{b} n-1,0 \\
M_{d} n-1+D\left(x_{n}-01\right) \longrightarrow P_{c} n=P_{d} n \cdot 1,0
\end{array}\right. \\
& M_{d} n=\min \left\{\begin{array}{l}
M_{b} n \cdot 1+D\left(x_{n}-01\right) \longrightarrow P_{d} n=P_{d} n-1,1 \\
M_{d} n \cdot 1+D\left(x_{n}-10\right) \longrightarrow \longrightarrow P_{d} n-1,1
\end{array}\right.
\end{aligned}
$$

The validity of these relationships is general (they also apply to $n=1$ and $n=2$ ) if we put $M_{a} 0=0, M_{b} 0=R$, $M_{c} 0=S$ and $M_{d} 0=T$, where $R>3, S>2$ and $T>3$.
This can be demonstrated by carrying out the first three steps in the algorithm with these initial values for all possible received bit sequences. As a result of the minimizing procedure, $R, S$ and $T$ will disappear from the $M^{3}$ values after the third step.

### 3.3 Some comments on the metric and path registers

Up to now it has been assumed that the initial state was $a$; in other words, that the shift register initially contained three zeros. In general, this will not be the case. If we drop this assumption and start at an arbitrary node in the trellis diagram, then all four initial metric values are taken to be zero. It is further obvious that if an arbitrary codeword has been correctly received up to a certain instant $i$, only four combinations of metric values are possible, viz.

| $M_{a}{ }^{i}=0$ | $M_{a}{ }^{i}=2$ | $M_{a}{ }^{i}=3$ | $M_{a}{ }^{i}=3$ |
| :---: | :---: | :---: | :---: |
| $M_{b}{ }^{i}=2$ | $M_{b}{ }^{i}=0$ | $M_{b}{ }^{i}=3$ | $M_{b}{ }^{i}=3$ |
| $M_{C}{ }^{i}=3$ | $M_{C}{ }^{i}=3$ | $M_{C}{ }^{i}=0$ | $M_{c}{ }^{i}=2$ |
| $M_{d}{ }^{i}=3$ | $M_{d}{ }^{i}=3$ | $M_{d^{i}}=3$ | $M_{d}{ }^{i}=0$ |

It should also be recognized that for each step only the mutual differences between the metrics are of importance. In designing a decoder, advantage may be taken of this feature by always deducting from all metric values the smallest one, $M_{m}$. In this way we prevent the metric values from becoming excessive (the bit sequences may be very long). By this procedure the total number of possible metric combinations is limited to 31 , as can be ascertained by taking 4 arbitrary metric values as a starting point and the comparing all possible sequences and the corresponding metrics. These are entered in Table 4.

This table shows that the highest occuring metric value is 3 . Four two-bit memories are therefore sufficient for storing the metric values. The metric calculations may be performed by a simple combinatorial network or even a PROM or a PLA.

As far as the path registers are concerned, it should be noted that $P_{a}{ }^{i}$ always terminates with $00, P_{b}{ }^{i}$ with $01, P_{c}{ }^{i}$ with 10 and $P_{d}{ }^{i}$ with 11 , as clearly shown by the trellis diagram. For example, to get from an arbitrary node to $d$ in two
steps, it is always necessary to travel along two dashed lines, which means that $P_{d^{i}}$ always ends in two ones.
It is also of interest to know how long the shift registers must be to store the four paths. If we consider that messages may consist of many thousands of bits, it will become obvious that it would be unpractical to wait until an entire message has been received before starting to read out the path with the smáltest metric value. Fortunately, after a number of steps, the first bits of the four stored paths will coincide. These bits can then be read out, as will become clear from the following example.
Reverting to the trellis diagram shown in Fig.3.2, let us assume that the next bits received are 0101 . Omitting the paths which come to a dead end, extension of the diagram by the following two steps then gives the diagram shown in Fig.3.4. The diagram can now be simplified to that of Fig. 3.5 by again omitting the paths which can be deleted or come to a dead end. This diagram shows that all four paths have the first bit (1) in common. Therefore, this bit can safety be read out.
However, there is no certainty that after the next step all paths will again have a bit in common. Therefore, if for each path we were to store only six bits in a six-bit shift register and, after each step, read out that overflow bit which had the smallest metric value of the four, we would risk introducing additional bit errors due to premature truncation of the stored paths.
It is difficult to evaluate this path memory truncation error. Jacobs and Heller [6] ascertained that a path register with a length of 4 to 5 times the constraint length of the coder would as a rule be sufficient to avoid such errors. In the present case this would amount to a path register length of 12 to 15 bits. The influence of this length on the bit error probability will be discussed in Section 7 for several values of the error probability in a binary symmetric channel.

Table 4

| $M_{a}^{i}$ | $:$ | 0 | 1000 | 10 | 0111 | 0012 | 0211 | 0122 | 0222 | 0233 |  |
| :--- | :--- | :--- | :--- | :--- | :--- | :--- | :--- | :--- | :--- | :--- | :--- | :--- | :--- | :--- | :--- | :--- | :--- |
| $M_{b}^{i}$ | $:$ | 0 | 01 | 0 | 10 | 1011 | 0021 | 2011 | 1022 | 2022 | 2033 |
| $M_{c}^{i}$ | $:$ | 0 | 0010 | 01 | 1101 | 1200 | 11102 | 2201 | 2202 | 3302 |  |
| $M_{d^{i}}$ | $:$ | 0 | 0001 | 01 | 1110 | 2100 | 1120 | 2210 | 2220 | 3320 |  |



Fig.3.4 Trellis diagram after reception of 4 more bits.


Fig.3.5 Four paths with common first bit.

## 4 THE GENERATING FUNCTIONS

To calculate error probabilities it is first necessary to investigate the distance properties of the code. Convolutional codes are group codes, which implies that the set of distances of the all-zeros codeword to all other codewords is equal to the set of distances of any specific codeword to all others. It will therefore be useful to ascertain how many paths deviate from the all-zeros path, at what distance each of these paths is located, and how many bit errors each path represents.
All this information can be expressed in terms of the socalled generating function, which we shall now derive. As a starting point we shall reconsider the state diagram shown in Fig.2.3. Since we are interested in the paths which deviate from the all zeros path and merge with it again later, we cut this diagram open at node $a$, so that it assumes the form shown in Fig.4.1. The distance of each branch to the all-zeros codeword will be denoted by an exponent of a formal variable $D$, so that the branch $a-b$ for example will be labeled $D^{2}$, the Hamming distance between 11 and 00 being 2. Fig.4.1 will thus become as shown in Fig.4.2.
We shall now investigate how many ways there are to pass through the diagram. Let us first consider the upper part, redrawn in Fig.4.3. This can be passed through in $D+D^{2}+D^{3}+\ldots=D /(1-D)$ ways. (Since $D$ is defined in the neighbourhood of 0 , this summation is permissible.) The diagram of Fig.4.2 may therefore be simplified to that of Fig.4.4.

We thus have the following possibilities of travelling through the whole diagram:
$\frac{D^{5}}{1-D}+\frac{D^{6}}{(1-D)^{2}}+\frac{D^{7}}{(1-D)^{3}}+\ldots=\frac{D^{5} /(1-D)}{1-D /(1-D)}=\frac{D^{5}}{1-2 D}$
This expression is called the generating function $\mathrm{T}(D)$, being:
$\mathrm{T}(D) \triangleq \frac{D^{5}}{1-2 D}=D^{5}+2 D^{6}+4 D^{7}+\ldots+2 k D^{k+5}$,
in which $k=0,1,2, \ldots$. This expression thus simply indicates that there is one path at distance 5 from the all zeros path, two at distance 6 , and so forth, In general there are $2^{k}$ paths at distance $k+5$.
We shall now also express in the generating function the lengths of the paths and the number of ones in each path (hence the number of bit errors if the zero code word is transmitted). To this end we label each branch in the diagram of Fig.4.1 with an $L$ and add an additional label $N$ to the branches which indicate a data-one (i.e. the branches in dashed line). The diagram of Fig. 4.5 thus obtained then contains all information.


Fig.4.1 State diagram cut open at node $a$.


Fig.4.2 As Fig.4.1, but with the formal variable $D$ introduced.


Fig. 4.3 The four upper branches of Fig.4.2.


Fig.4.4 Simplified representation of Fig.4.2.


Fig.4.5 Cut-open state diagram with the formal variables $D, L$ and $N$ introduced.

Analysing this diagram in the same way as Fig.4.2 yields:
$\mathrm{T}(D, L, N) \triangleq \frac{D^{5} L^{3} N}{1-D L(1+L) N}=$

$$
\begin{align*}
& D^{5} L^{3} N+D^{6} L^{4}(1+L) N^{2}+ \\
& \ldots+D^{5+k} L(1+L)^{k} N^{1}+k+\ldots \tag{4.2}
\end{align*}
$$

where $k$ is again $0,1,2,3, \ldots$. This expression has the following meaning.
There is one path at distance 5 of length 3 in which 1 dataone occurs; there are two paths at distance $6, \mathrm{viz}$. one of length 4 and one of length 5, two data-ones occurring in both paths, and so forth.
If, say, only $D$ and $N$ are of interest, $L$ is put equal to unity in eq.(4.2), which gives:
$\mathrm{T}(D, N) \triangleq \frac{D^{5} N}{1-2 D N}=$

$$
=D^{5} N+2 D^{6} N^{2}+
$$

$$
\ldots+2^{k} D^{k+5} N^{k+1}+\ldots
$$

$$
\begin{equation*}
(k=0,1,2, \ldots) \tag{4.3}
\end{equation*}
$$

These generating functions will be required in the next section for determining the several error probabilities.
The generating function $T(D)$ can also be derived in a more general way by means of the distance matrix, which indicates the weight necessary to change over from one state to another (or to the same state) in $n$ steps. The one-step matrix for the middle part of Fig.3.2 thus becomes:

whilst $\Delta^{0}+\Delta^{1}+\Delta^{2}+\ldots=\mathrm{I}+\Delta+\Delta^{2}+\ldots=(\mathrm{I}-\Delta)^{-1}$. We are interested only in $b \ldots c$, i.e. in the first row and the second column of the matrix $(I-\Delta)^{-1}$ :
$(\mathrm{I}-D)_{12}^{-1}=\frac{D}{1-2 D}$.
For the whole diagram of Fig. 4.2 we then get
$\mathrm{T}(D)=\frac{D^{5}}{1-2 D}$.

Both $\mathrm{T}(D, N)$ and $\mathrm{T}(D, L, N)$ can be calculated in an analogous way. For more complex convolutional codes this method is in fact preferable to the previous one which is apt to become very time-consuming.

## 5 ERROR PROBABILITIES

### 5.1 The error event probability

The error event probability is understood to be the probability of, at a certain node in the trellis diagram, an erroneous path being chosen which merges again with the correct path for the first time at that node.
It follows clearly from the trellis diagram that the shortest path which deviates from the all-zeros path is the path $a-b-c-a(1111011)$, corresponding to the data sequence 100 . It is situated at distance 5 (termed the minimum free distance) from the all-zeros path.
Let us introduce the symbol $p$ to denote the probability of a 1 being received when a 0 has been transmitted via a binary symmetric channel (BSC) or vice versa. The probability of a bit being correctly received thus amounts to $1-p=q$ (see Fig.5.1). We shall now ascertain the probability of this path at distance 5 being chosen if the all-zeros codeword has been transmitted.


Fig. 5.1 Representation of a binary symmetric channel.

There are $\binom{5}{3}$ possible combinations of 2 zeros and 3 ones in the positions $1,2,3,5$ and 6 of a sequence of 6 bits. The 4th bit can be disregarded because it is a 0 in both sequences and therefore does not contribute to the probability of an erroneous path being chosen. These $\binom{5}{3}$ sequences are all at distance 2 from the path 111011 and at distance 3 from the all-zeros path. The probability of the path 111011 being chosen instead of the correct 000000 if 3 of the 5 bits are not correctly received thus amounts to $\binom{5}{3} p^{3} q^{2}$.
An analogous argument applies to the $\binom{5}{4}$ possible combinations of 1 zero and 4 ones and to the sequence of 5 ones. The total probability of the erroneous path being chosen is thus
$P_{5}=\sum_{e=3}^{5}\binom{5}{e} p^{e} q^{5-e}$.
A similar equation can be derived for any path at an odd distance, so that, in general,
$P_{k}=\sum_{e=(k+1) / 2}^{\sum_{e}^{k}}\left(_{k}^{k}\right) p^{e} q^{k-e}, \quad(k$ is odd $)$.

There is also a path in the trellis diagram at distance 6 from the all-zeros path, e.g. $a-b-d-c-a(11010111)$; this corresponds to the data sequence 1100 , hence of length 4 .

This path will be chosen if 4 or more of the bits in the positions $1,2,4,6,7$ and 8 of the sequence 00000000 are not correctly received. If exactly 3 of these 6 bits are erroneous, the correct and the erroneous path will both be at distance 3 from the received sequence. The probability of the erroneous path then equals the probability of the correct path being chosen and thus amounts to $1 / 2$. The total probability of a path at distance 6 being chosen is therefore
$P_{6}=\frac{1}{2}\binom{6}{3} p^{3} q^{3}+\sum_{e=4}^{6}\binom{6}{e^{6}} p^{e} q^{6-e}$.

In general, for a path at an even distance,
$P_{k}=\frac{1}{2}\binom{k}{k / 2} p^{k / 2} q^{k / 2}+\sum_{e=k / 2+1}^{k}\binom{k}{e} p^{e} q^{k-e}$
( $k$ is even)

According to the generating function $\mathrm{T}(D)$, there is one path at distance 5 , two paths at distance 6 , and in general $2^{k}$ paths at distance $k+5$ (cf. Eq.(4.1)). It is hardly feasible to calculate the probability of one of the many erroneous paths being chosen at any given node; we can say, however, that this probability is in any case smaller than the sum of the probabilities for any possible path, as given by

$$
\begin{aligned}
& P_{E}<P_{5}+2 P_{6}+4 P_{7}+\ldots+2^{k} P_{k+5}+\ldots \\
& \\
& \quad(k=0,1,2, \ldots)
\end{aligned}
$$

In Appendix 1 it is demonstrated that $P_{5}=P_{6}, P_{7}=P_{8} \ldots$ and, in general, that $P_{k}=P_{k-1}$ for even values of $k$. In Appendix 2 it is derived that
$P_{k}<\frac{5}{32}(2 \sqrt{ } p)^{k}$,
whence we may write:

$$
\begin{align*}
P_{E} & <3 P_{6}+12 P_{8}+\ldots+3 \times 4 k P_{2 k+6}+\ldots,(k=0,1,2, \ldots) \\
& <\frac{5}{32} \times 3\left\{(2 \sqrt{ } p)^{6}+4(2 \sqrt{ } p)^{8}+4^{k}(2 \sqrt{ } p)^{2 k+6}+\ldots\right) \\
& =\frac{15}{32} \cdot \frac{(2 \sqrt{ } p)^{6}}{1-4(2 \sqrt{ } p)^{2}}=\frac{15}{32} \cdot \frac{64 p^{3}}{1-16 p}=\frac{30 p^{3}}{1-16 p}, \tag{5.3}
\end{align*}
$$

provided that $p<1 / 16$.

### 5.2 The bit error probability

The bit error probability $P_{B}$ is defined as the ratio of the expected number of bit errors in the decoded data sequence to the total number of bits transmitted. From the generating function

$$
\begin{array}{r}
\mathrm{T}(D, N)=D^{5} N+2 D^{6} N^{2}+\ldots+2^{k} D^{k+5} N^{k+1}+\ldots, \\
(k=0,1,2, \ldots) \tag{4.3}
\end{array}
$$

it follows that there are $2^{k}$ paths at distance $k+5$, each of which corresponds to $(k+1)$ ones in the original data sequence. The exponents of $N$ thus determine the number of bit errors per path. To obtain these exponents as weighting factors before each term, the function $\mathrm{T}(D, N)$ should be differentiated with respect to $N$. Subsequently $N$ can be eliminated again from the derivative by putting $N=1$; thus,

$$
\begin{array}{r}
\left.\frac{\mathrm{dT}(D, N)}{\mathrm{d} N}\right|_{N=1}=D^{5}+2 \times 2 D^{6}+\ldots+(k+1) 2^{k} D^{k+5}+\ldots \\
(k=0,1,2, \ldots)
\end{array}
$$

In a similar way as for $P_{E}$ we find for the bit error probability:
$P_{B}<P_{5}+2 \times 2 P_{6}+\ldots+(k+1) 2^{k} P_{k+5}+\ldots,(k=0,1,2, \ldots)$ and with $P_{k}=P_{k-1}$ for even values of $k$ and $P_{k}<\frac{5}{32}(2 \sqrt{ } p)^{k}$ :
$P_{B}<5 P_{6}+4 \times 11 P_{8}+4 k(6 k+5) P_{2 k+6}+\ldots, \quad(k=0,1,2, \ldots)$

$$
<\frac{5}{32}\left\{5(2 \sqrt{ } p)^{6}+44(2 \sqrt{ } p)^{8}+\ldots+4^{k}(6 k+5)(2 \sqrt{ } p)^{2 k+6}+\ldots\right\}
$$

$$
=\frac{5}{32}\left\{24(2 \sqrt{ } p)^{8}+192(2 \sqrt{ } p)^{10}+\ldots+4^{k} \cdot 6 k(2 \sqrt{ } p)^{2 k+6}+\ldots\right\}+\frac{5}{32}\left\{5(2 \sqrt{ } p)^{6}+20(2 \sqrt{ } p)^{8}+\ldots+4 k \cdot 5(2 \sqrt{ } p)^{2 k+6}+\ldots\right\}
$$

$$
\begin{equation*}
=\frac{960 p^{4}}{(1-16 p)^{2}}+\frac{50 p^{3}}{1-16 p}=50 p^{3} \frac{1+3,2 p}{(1-16 p)^{2}}, \tag{5.4}
\end{equation*}
$$

provided that $p<1 / 16$.

### 5.3 Generally valid upper bounds for PE and $P_{B}$

Viterbi [5] calculated different upper bounds by demonstrating that
$P_{k}<\{2 \sqrt{p(1-p)}\}^{k}$,
$P_{B}<\left.\frac{\mathrm{dT}(D, N)}{\mathrm{d} N}\right|_{N=1, D=2 \sqrt{p(1-p)}}=\frac{\{2 \sqrt{p(1-p)}\}^{5}}{\{1-4 \sqrt{p(1-p)}\}^{2}}$

By making use of the relations $P_{k}=P_{k-1}$ for even $k$ and $P_{k}<\Gamma(2 \sqrt{ } p)^{k}$ where $\Gamma$ is a constant which is determined by the minimum free distance of the used convolutional code (in the case under consideration $\Gamma=5 / 32$ ), it is possible to derive tighter bounds, as shown in Appendix 3 and previously by van de Meeberg [14]:

$$
\begin{align*}
& P_{E}<\Gamma\left\{\frac{\mathrm{T}(D)+\mathrm{T}(-D)}{2}+D \frac{\mathrm{~T}(D)-\mathrm{T}(-D)}{2}\right\}_{D=2 \sqrt{ } p}  \tag{5.7}\\
& P_{B}<\Gamma\left\{\frac{\frac{\mathrm{dT}(D, N)}{\mathrm{d} N}+\frac{\mathrm{dT}(-D, N)}{\mathrm{d} N}}{2}+D \frac{\frac{\mathrm{dT}(D, N)}{\mathrm{d} N}-\frac{\mathrm{dT}(-D, N)}{\mathrm{d} N}}{2}\right\} \begin{array}{l}
N=1, \\
D=2 \sqrt{ } p
\end{array} \tag{5.8}
\end{align*}
$$

It is true that these expressions are less compact than those derived by Viterbi, but the upper bounds given by eqs (5.7) and (5.8) are considerably tighter. For the sake of comparison the upper bounds given by the eqs (5.6) and (5.8) have been plotted in Fig.5.2 as functions of the channel crossover error probability $p$. For small values of $p$ the bound according to eq.(5.8) asymptotically approaches
$50 p^{3}$, whereas the bound according to eq.(5.6), as derived by Viterbi, approaches $32 p^{21 / 2}$. Hence the smaller the value of $p$, the greater will be the difference between the two asymptotes.
In the graph of Fig.5.2 the measured error rate curve has also been plotted; how the measurements were made is discussed in Section 6.


Fig.6.1 Circuit of the metric registers.

## 6 REALIZATION OF THE DECODER

The design of the decoder was based on the use of TTL MSI-circuits (Fairchild 9300 and 9000 series), complemented where necessary by Texas Instruments circuits (7483 and 74164).

### 6.1 The metric registers

Fig.6.1 (opposite) shows the circuit diagram of the metric registers. We shall discuss its operation with reference to Table 3.

Assume the values $M^{n-1}$ to be present at the $Q$ outputs of the four shift registers $9300(A, B, C$ and $D)$. (The $Q_{1}$ outputs are followed by exclusive ORs connected as buffers to cope with the low input resistance of the $A_{3}$ inputs of the full-adders.) In the eight full-adders 7483 these values are now added to the Hamming distances of the $n$th pair of received bits $x_{n}$ to $00,11,10$ and 01 , grouped according to the formulae.

By way of example, the sum of $M_{d}{ }^{n-1}$ and the Hamming distance from $x_{n}$ to 00 appears at the output of the fulladder A 00 . The Hamming distances are determined by the circuit of Fig.6.2; the truth table and switching functions are given in Tables 5 and 6 respectively.

The outputs of the full-adders are subsequently compared two by two and multiplexed. The $A<B$ outputs of the comparators 9324 ( $A, B, C$ and $D$ ) are linked to the select inputs of the multiplexers $9322(A, B, C$ and $D)$ so that at their outputs the minima of the two presented full-adder outputs appear. Expressed in terms of the formulae in Table 3, this amounts to $M_{a}{ }^{n}, M_{b}{ }^{n}, M_{c}{ }^{n}$ and $M_{d} n$ appearing at the outputs of the multiplexers.
Subsequently the minimum of $M_{a}{ }^{n}$ and $M_{b}{ }^{n}$ and that of $M_{C}{ }^{n}$ and $M_{d} n$, and finally the minimum of these two minima are determined in the same way. We thus see that $M_{m}{ }^{n}$ is the minimum of ( $M_{a} n, M_{b}{ }^{n}, M_{c} n, M_{d} n$ ). As pointed out in Section 3.3, this value of $M_{m}{ }^{n}$ should be deducted from all four values of $M^{n}$.

Table 5

|  | $d-11$ |  |  |  |  |  |  |  | $d-00$ |  | $d-01$ |  | $d-10$ |  |
| :--- | :---: | :---: | :---: | :---: | :---: | :---: | :---: | :---: | :---: | :---: | :---: | :---: | :---: | :---: |
| $x_{n 1}$ | $x_{n 2}$ | $i$ | $j$ | $i$ | $j$ | $i$ | $j$ | $i$ | $j$ |  |  |  |  |  |
| 0 | 0 | 1 | 0 | 0 | 0 | 0 | 1 | 0 | 1 |  |  |  |  |  |
| 0 | 1 | 0 | 1 | 0 | 1 | 0 | 0 | 1 | 0 |  |  |  |  |  |
| 1 | 0 | 0 | 1 | 0 | 1 | 1 | 0 | 0 | 0 |  |  |  |  |  |
| 1 | 1 | 0 | 0 | 1 | 0 | 0 | 1 | 0 | 1 |  |  |  |  |  |

Table 6

| output | $i$ | $j$ |
| :---: | :---: | :---: |
| $\bar{d} \overline{1} 1$ | $\bar{x}_{n 1}$ | $\bar{x}_{n 2}$ |
| $d-00$ | $x_{n 1}$ | $x_{n 2}$ |
| $d-01$ | $x_{n 1}$ | $\bar{x}_{n 2} \theta x_{n 2}$ |
| $d-10$ | $\bar{x}_{n 1} \theta x_{n 2}$ | $x_{n 2}$ |
| $\overline{x_{n 1} \theta x_{n 2}}$ |  |  |
| $x_{n 1} \theta x_{n 2}$ |  |  |



Fig.6.2 Logic circuit for determining the Hamming distances.

This is achieved by the circuit connected to the output of multiplexer 9322 (M); it determines the twos complement of $M_{m}{ }^{n}$. The truth table and switching functions are given in Table 7.

Table 7

| $9322(\mathrm{M})$ |  |  | 7483 |  |  |
| :---: | :---: | :---: | :---: | :---: | :---: |
| $Z_{c}$ | $Z_{b}$ | $Z_{a}$ | $B_{4}$ | $B_{3}$ | $B_{2}$ |
| 0 | 0 | 0 | 0 | 0 | 0 |
| 0 | 0 | 1 | 1 | 1 | 1 |
| 0 | 1 | 0 | 1 | 1 | 0 |
| 0 | 1 | 1 | 1 | 0 | 1 |
| 1 | 0 | 0 | 1 | 0 | 0 |
| 1 | 0 | 1 | 0 | 1 | 1 |
| 1 | 1 | 0 | 0 | 1 | 0 |
| 1 | 1 | 1 | 0 | 0 | 1 |

$$
\begin{aligned}
B_{2} & =Z_{a} \\
B_{3} & =Z_{a} \theta Z_{b} \\
\bar{B}_{4} & =\bar{Z}_{a} \bar{Z}_{b} Z_{c}+Z_{a} \bar{Z}_{c}+Z_{b} \bar{Z}_{c} \\
& =\bar{Z}_{c} \theta\left(\bar{Z}_{a} \bar{Z}_{b}\right)
\end{aligned}
$$

The additional exclusive-OR has been provided to act as a buffer to cope with the four $B 3$-inputs of the full-adders.
The twos complement of $M_{m}{ }^{n}$ is added to $M_{a}^{n}, M_{b}{ }^{n}, M_{c} n$ and $M_{d}{ }^{n}$ by means of the full-adders $7483(A, B, C$ and $D)$. The result is fed back to the $P$ inputs of the corresponding shift registers $9300(A, B, C$ and $D)$, so that at the next clock puise the new values $M^{n}-M_{m}{ }^{n}$ appear at the $Q$ outputs. The next two bits $x_{n+1}$ can now be shifted in and processed.
The initial values $M^{0}$ can be set to zero at the start of the decoding process by the master reset inputs of the shift registers. This implies that we start at an arbitrary level of the trellis diagram, thus taking the initial state of the encoder to be unknown.

The $A<B$ outputs $a, b, c, d$ of the relevant four comparators can be used for loading the path registers, whilst the $A<B$ outputs $p, q, r$ of the three other comparators can serve for reading out these registers, as discussed in the next subsection.

### 6.2 The path registers

Let us first consider the outputs $a, b, c$ and $d$ of the metric circuit, to which the following relationships apply:
if $M_{a} n-1+D\left(x_{n}-00\right)<M_{c} n-1+D\left(x_{n}-11\right)$, then $a=\mathrm{H}$
if $M_{a} n-1+D\left(x_{n}-00\right) \geqslant M_{C} n-1+D\left(x_{n}-11\right)$, then $a=\mathrm{L}$
if $M_{a} n-1+D\left(x_{n}-11\right)<M_{C} n-1+D\left(x_{n}-00\right)$, then $b=\mathrm{H}$
if $M_{a} n-1+D\left(x_{n}-11\right) \geqslant M_{C} n-1+D\left(x_{n}-00\right)$, then $b=\mathrm{L}$
if $M_{b} n-1+D\left(x_{n}-10\right)<M_{d} n-1+D\left(x_{n}-01\right)$, then $c=\mathrm{H}$
if $M_{b} n-1+D\left(x_{n}-10\right) \geqslant M_{d} n-1+D\left(x_{n}-01\right)$, then $c=\mathrm{L}$
if $M_{b}^{n-1}+D\left(x_{n}-01\right)<M_{d}^{n-1}+D\left(x_{n}-10\right)$, then $d=\mathrm{H}$
if $M_{b} n-1+D\left(x_{n}-01\right) \geqslant M_{d} n-1+D\left(x_{n}-10\right)$, then $d=\mathrm{L}$

This may be expressed by Table 8, which shows how the path registers must be filled.

Table 8

| if | then | if | then |
| :--- | :---: | :---: | :---: |
| $a=\mathrm{H}$ | $P_{a}^{n}:=P_{a} n \cdot 1,0$ | $a=\mathrm{L}$ | $P_{a} n:=P_{c} n \cdot 1,0$ |
| $b=\mathrm{H}$ | $P_{b} n:=P_{a} n \cdot 1,1$ | $b=\mathrm{L}$ | $P_{b} n:=P_{c} n \cdot 1,1$ |
| $c=\mathrm{H}$ | $P_{c}{ }^{n}:=P_{b} n \cdot 1,0$ | $c=\mathrm{L}$ | $P_{c} n:=P_{d} n \cdot 1,0$ |
| $d=\mathrm{H}$ | $P_{d}{ }^{n}:=P_{b} n-1,1$ | $d=\mathrm{L}$ | $P_{d} n:=P_{d} n \cdot 1,1$ |

Table 9 holds for the outputs $p, q$ and $r$.

Table 9

| $p$ | $q$ | $r$ | $\dot{p}$ | $r$ |  | read | $S_{0}$ | $S_{1}$ |
| :---: | :---: | :---: | :---: | :---: | :---: | :---: | :---: | :---: |
| 0 | 0 | 0 | $M_{b} \leqslant M_{a}$ | $M_{d} \leqslant M_{c}$ | $M_{d} \leqslant M_{b}$ | $P_{d}$ | 1 | 1 |
| 0 | 0 | 1 | $M_{b} \leqslant M_{a}$ | $M_{c}<M_{d}$ | $M_{c} \leqslant M_{b}$ | $P_{c}$ | 0 | 1 |
| 0 | 1 | 0 | $M_{b} \leqslant M_{a}$ | $M_{d} \leqslant M_{c}$ | $M_{b}<M_{d}$ | $P_{b}$ | 1 | 0 |
| 0 | 1 | 1 | $M_{b} \leqslant M_{a}$ | $M_{c}<M_{d}$ | $M_{b}<M_{c}$ | $P_{b}$ | 1 | 0 |
| 1 | 0 | 0 | $M_{a}<M_{b}$ | $M_{d} \leqslant M_{c}$ | $M_{d} \leqslant M_{a}$ | $P_{d}$ | 1 | 1 |
| 1 | 0 | 1 | $M_{a}<M_{b}$ | $M_{c}<M_{d}$ | $M_{c} \leqslant M_{a}$ | $P_{c}$ | 0 | 1 |
| 1 | 1 | 0 | $M_{a}<M_{b}$ | $M_{d} \leqslant M_{c}$ | $M_{a}<M_{d}$ | $P_{a}$ | 0 | 0 |
| 1 | 1 | 1 | $M_{a}<M_{b}$ | $M_{c}<M_{d}$ | $M_{a}<M_{c}$ | $P_{a}$ | 0 | 0 |



Fig.6.3 Logic circuit for selecting the path registers.
$S_{0}$ and $S_{1}$ are intended to serve as the select inputs of a 4 -input multiplexer 9309, which reads out the path registers. The switching functions performed by the circuit shown in Fig.6.3 are
$S_{0}=\overline{p q+\bar{q} r} \quad$ and $\quad S_{1}=\bar{q}$.
The algorithm offers some arbitrary choices which we shall deal with before describing the path register. It will therefore be useful to discuss Tables 8 and 9 in some greater detail.
(1) When determining a new metric value it is necessary to choose the minimum of two previous metric values, each of which is augmented by a Hamming distance. If the two sums are equal, an arbitrary choice should be made between the two paths. In the circuit the lower of the two paths is then always chosen since in
the metric circuit only the $A<B$ outputs of the comparators $9324(A, B, C$ and $D)$ are used (cf. Table 8). This has been done for the sake of simplicity. Besides, for carrying out measurements with random signals it is immaterial which choice is made. However, this is not the case when the all-zeros codeword is transmitted, for in that case the upper of the two paths would be the best choice, whereas the lower path would be the best when the all-ones codeword is transmitted, because we then travel along the lower line of the trellis diagram. When the all-zeros codeword is transmitted the measured bit error rate is therefore likely to exceed the bit error rate measured with a random input signal; with the all-ones codeword the opposite is likely to be the case.
(2) A similar arbitrary choice is possible when reading out the path registers. If 2,3 or all 4 metric values are identical, a choice must be made between them. For the same reasons the circuit will then, too, choose the lower path (see Table 9).
For the sake of completeness some of the error probability measurements have been carried out not only with a random signal, but also with the all-zeros and the all-ones codewords.

Fig. 6.4 shows the circuit diagram of the first of the four identical sections of the 16 -bit path registers.
The $16 Q$ outputs $P_{a}$ of the registers 9300 denoted by AI, $\mathrm{A} 2, \mathrm{~A} 3, \mathrm{~A} 4$ and the $16 Q$ outputs $P_{C}$ of the registers 9300 denoted by $\mathrm{C} 1, \mathrm{C} 2, \mathrm{C} 3$ and C 4 are linked to the inputs of the multiplexers 9322 (A1, A2, A3, A4). The 16 outputs are connected to the $16 P$ inputs of the registers 9300 denoted by A1, A2, A3, A4. Depending on the select input, at the next clock pulse either $P_{a}$ or $P_{c}$ is read into $P_{a}$ in parallel (provided that $\overline{\mathrm{PE}}=\mathrm{L}$ ). If $a=\mathrm{H}$, then $P_{a} n_{:}=P_{a} n \cdot 1$, whereas in the event of $a=\mathrm{L}$, then $P_{a} n:=P_{c} n-1$.
In order to comply with the first line of Table 8, a 0 must still be shifted into the register. To this end $\overline{P E}$ is required to be H , so that at the next clock pulse a 0 is indeed shifted in; it should be recognized that the $\mathrm{J} \overline{\mathrm{K}}$ input of register 9300 (A1) is grounded.

A similar argument applies to $P_{b}, P_{c}$ and $P_{d}$ : however, a 1 is shifted into $P_{b}$ and $P_{d}$ at the second clock pulse.
The four NAND-gates 9009 (AD and BC) preceding the PEand CP -inputs of the shift registers again serve as buffers.
The 4-input multiplexer 9309 connected to the last stage of each path register selects the sixteenth bit of the path register corresponding to the lowest metric value. The readout information $S_{0}$ and $S_{1}$ is delayed by the latch 9314 (S); the delay must be equal to the duration of one code bit because the four new paths are present in the shift registers after the second clock pulse has been produced. The second of the two bits which consecutively appear at the $Z_{a}$-output of the multiplexer 9309 is always the decoded data bit. Since the bit rate of the code is twice that of the data, the duration of the decoded bit must be doubled. This is achieved by means of a latch incorporated in the circuit of the control unit discussed below.

### 6.3 The control unit

Fig. 6.5 shows the circuit diagram of the control unit and Fig.6.6 its timing diagram.
It is necessary for each pair of code bits to be fed from the code input of the decoder to the metric circuit. For this reason the code sequence is delayed for one bit-period by feeding $x_{n 1}$ and $x_{n 2}$ to the circuit of Fig. 6.2 via the latch 9314 (X).
The latch command coincides with the clock pulse for the metric registers. It is thus possible for the calculation of the new metric values and the shift-in and read-out information $a, b, c, d$ and $S_{0}, S_{1}$ to start at this instant. Once this calculation has been completed, the path registers can be filled. "PE path" must then obviously be low, and "CP path" is transferred from L to H . Subsequently "PE path" should become high; at the next positive flank of "CP path" the four new paths are thus stored in the shift register. At that instant the read-out information should be available. After reading-out, the decoded bit is available at $Z_{a}$ of multiplexer 9309 . This bit is stored by the latch 9314 (X) until, after two clock pulses, a new bit is available at $Z_{a}$ of multiplexer 9309.
It will be clear that the speed of the decoder is limited by the time the metric circuit needs for calculating the read-in information of the path. Theoretically (according to the specification of the integrated circuits used), this time is about 100 ns . The maximum repetition frequency of the input clock pulse is determined mainly by this time and the width of the $\overline{\mathrm{CP}}$ metric, which results in a maximum of about 2,5 megabits per second for the data signal.
To minimize the influence of mains and similar interferences on the decoder, all flip-flops and one-shots are controlled by a clock with the largest practicable duty cycle. In addition, various bypass capacitors are incorporated in the circuit. The circuit is nevertheless still affected to some extent by interference. This should be taken into account when carrying out measurements with very small error rates.

### 6.4 The error detector

By extending the 3-bit shift register of the encoder from 3 to 17 bits, the input signal can be brought roughly into phase with the decoded output signal. The latch 9314 (X) will then bring it completely into phase. If the input and output data sequences are then applied to an exclusive OR-gate it will register a 1 for each error in the decoded sequence. Feeding this output directly to a counter would result in two consecutive errors being recorded as only one, since there would be only one positive flank to actuate the counter. This is avoided by feeding the output signal of the XOR-gate, together with a clock signal produced by a oneshot, to a NAND-gate, which is in turn followed by a NAND-buffer. This results in a separate positive-going pulse for each individual error.




Fig.6.6 Tining of the control unit.

## 7 MEASUREMENTS

In order to measure the bit error rate as a function of the crossover error rate of a binary symmetric channel, the coder and decoder can be linked by a simulated channel with a variable error rate. The measuring set-up is then as shown in Fig.7.1.
J. Alma [15] devised a binary symmetric channel simulator in which the crossover probability can be adjusted stepwise in steps of $2^{-1}$ from $2^{-2}$ to $2^{-12}$. To generate the errors, a random data generator is incorporated in the circuit. The only random data generator available at the time of the investigations had to be controlled by a clock frequency not exceeding 50 kHz ; the maximum permissible code bit rate was thus limited to $4 \mathrm{kbit} / \mathrm{s}$, so the maximum data bit rate to only $2 \mathrm{kbit} / \mathrm{s}$. Since, according to eq.(5.4), the calculated upper bound of $P_{B}$ is $50 p^{3}(1+3,2 p) /(1-16 p)^{2}$, the expected bit error rate for $p=2^{-12}$ is at the most $10^{-9}$. At the maximum data bit rate of $2 \mathrm{kbit} / \mathrm{s}$ this amounts to 1 error every 6 days. It will be clear that carrying out error probability measurements in this range would be quite impracticable, the more so because the sensistivity to manmade interference would render the results unreliable. For this reason the measurements were-limited to $p=2^{-9}$.

The curves $P_{B}(p)$ shown in Fig. 7.2 were plotted by feeding the outputs "counter" and "data clock" to a programmable counter-timer. The latter directly computed $P_{B}$, i.e. the ratio of the number of bit errors in the decoded data sequence to the total number of data bits. The results were checked by measuring the crossover error rate $p$ in the same way.

As mentioned in Section 6.2, measurements were carried out not only with a pseudo-random data signal fed to the input of the encoder, but also with shorted input (all-zeros codeword) and with open-circuited input (all-ones codeword). The resulting curves are also plotted in Fig.7.2.

For small values of $p$ the measured bit error rate appears to be in excellent agreement with the bound given by eq.(5.4) and closely follows the asymptote $50 p^{3}$ (cf. Fig.5.2).
The bit error probability was also measured as a function of the path register length for three values of $p$, viz. $2^{-3}, 2^{-5}$ and $2^{-7}$. These measurements, which are plotted in Fig.7.3, showed that increasing the path register lengths beyond 12 bits scarely reduced $P_{B}$ any further. This confirms the conclusion of Heller and Jacobs [6], Viterbi [5], Odenwalder [13] and others, according to which a path register with a length of 4 to 5 times the constraint length is sufficient to justify disregarding path memory truncation errors.
The maximum data bit rate at which encoder and decoder still operate reliably without a binary symmetric channel being used, proved to be $2,46 \mathrm{Mbits} / \mathrm{s}$, almost as calculated in Section 6.3.
The total power consumption of the installation was 16 W (3,2 A at 5 V ).
It should be recognized that availability of a faster random data generator would have allowed the measurements to be carried out in less time and with greater accuracy.


Fig.7.1 Block diagram of the measuring circuit.


Fig.7.2 Measured bit error rate $P_{B}$ as a function of $p$. The dash-dot curve applies to a random input signal, the solid curve to the allzeros codeword and the dashed curve to the all-ones codeword.


Fig.7.3 Measured bit error rate as a function of the path register length for three values of $p$.

## 8 SUGGESTIONS FOR SIMPLIFYING THE CIRCUIT

### 8.1 Simplification of the metric circuit

At the time the circuit of Fig.6.1 was built, it was not realized that the four metric values would never exceed 3 (cf. Section 3.3); this explains why the design of the whole circuit was based on 3 bits. However, it can be proved that the metric values, even before subtracting $M_{\min }$, never will exceed 3, so that the part of the circuit which follows the four multiplexers 9322 ( $A, B . C$ and $D$ ) need only be capable of handling 2 bits.
For this same reason the four shift registers 9300 (A, B, C and D) could be replaced by one 8 -bit latch (e.g. 9308); moreover, the greater fan-out of the latter would then render the four buffers 9014 (B) superfluous.
It can also be proved that $M_{\min }$ can only be either 0 or 1 , i.e. 1 bit, so that the entire circuitry for the twos complement could be omitted. This will become clear by considering that the twos complement of 0 is 00 (for 2 bits), and that of 1 is 11 . It would therefore have been permissible to link both $B_{2}$ and $B_{3}$ of the 7483 s to $Z_{a}$ of the 9322 (M) via a buffer.

### 8.2 Large-scale integration

As explained in Section 3.3 (cf. Table 4), there are only 31 possible combinations of metric values. After every possible combination $M^{n-1}, x_{n}$ will be either $00,01,10$ or 11 . $M^{n}=M^{n}-M_{m}{ }^{n}$ will then again be one of the 31 combinations, as illustrated by the following example.
Assume $M^{n-1}$ to be $0,1,1,1$ (8th column of Table 4), then
if $x_{n}=00$
(cf. Table 3) $\left\{\begin{array}{l}M_{a} n=\min (0+0,1+2)=0 \\ M_{b} n=\min (0+2,1+0)=1 \\ M_{C} n=\min (1+1,1-1)=2 \\ M_{d} n=\min (1+1,1+1)=2\end{array}\right.$

| (1) | $+V_{c c}$ | (5) | $a$ | (9) |  |
| :---: | :---: | :---: | :---: | :---: | :---: |
| (2) | GND | (6) | $b$ | (10) |  |
| (3) | $S_{0}$ | (7) | $c$ | (11) | latch command |
| (4) | $S_{1}$ | (8) | $d$ | (12) | reset (if desired) |

The smallness of the number of external connections required is attributable to the fact that the values of $M^{n}$ one of no further interest for the circuit, and hence no provision need be made for bringing them out. A cheaper solution to the metric circuitry might be a ROM or a PLA (programmable logic array).
It might also be practicable to integrate the path register circuit on a single chip, in which case 12 external connections would again suffice (cf. Fig.6.4), viz.:
(1) $+V_{c c}$
(5) $a$
(9) $Z_{a}$
(2) GND
(6) $b$
(10) CP
(3) $S_{0}$
(7) $c$
(11) $\overline{\mathrm{PE}}$
(4) $S_{1}$
(8) $d$
(12) $\overrightarrow{\mathrm{E}}$ select.

Large-scale integration of the path register circuit might be facilitated by reducing the bit length from 16 to 12 ; this would not noticeably affect the error-correcting properties of the decoder, as is evidenced by the graph of Fig.7.3.
Finally, it might be possible to apply large-scale integration to the control unit.

In this way it would be possible for the entire decoder to be composed of a few integrated circuits. The main objection against using a Viterbi decoder for digital transmission or in computer applications - its price - might thus be overcome. However, the manufacture of customized integrated circuits required for large-scale integration involves considerable initial expense, to justify which there must obviously be sufficient applications for this type of decoder.
in accordance with the 20th column of Table 4;

$$
\begin{aligned}
& \text { if } x_{n}=01 \\
& \text { (cf. Table 3) }
\end{aligned}\left\{\begin{array}{l}
M_{a} n^{\prime}=\min (0+1,1+1)=1 \\
M_{b} n^{\prime}=\min (0+1,1+1)=1 \\
M_{c} n^{\prime}=\min (1+2,1+0)=1 \\
M_{d} n^{\prime}=\min (1+0,1+2)=1
\end{array}\right\} M_{m}{ }^{n=1}=\left\{\begin{array}{l}
M_{a} n=0 \\
M_{b} n=0 \\
M_{c} n=0 \\
M_{d} n=0
\end{array}\right.
$$

in accordance with the first column of Table 4.
Similarly, if $x_{n}=10, M^{n}=0,0,0,0$ (first column)
if $x_{n}=11, M^{n}=1,0,1,1$ ( 9 th column)
These operations can be performed by a fairly simple combinatorial network to the inputs of which $x_{n}, M_{a}{ }^{n-1}$, $M_{b} n-1, M_{c}{ }^{n-1}$ and $M_{d}{ }^{n-1}$ are applied (for each, 2 bits are required), whilst at the outputs $M_{a}^{n}, M_{b}{ }^{n}, M_{c}{ }^{n}, M_{d}{ }^{n}$ ( 2 bits each) and $a, b, c, d, S_{0}, S_{1}$ ( 1 bit each) become available. It might be feasible to realize such a circuit on a single LSI chip, and if an 8-bit latch could also be accommodated on the same chip, only 12 connections would have to be made to it (cf. the metric circuit of Fig.6.1 and Figs 6.2 and 6.3), viz.

## Appendix 1

Given, for odd values of $k$ :
$P_{k}=\sum_{e=(k+1) / 2}^{k}\left(e_{e}^{k}\right) p^{e} q^{k-e}$,
and for even values of $k$ :
$P_{k}=\frac{1}{2}\binom{k}{k / 2} p^{k / 2} q^{k / 2}+\sum_{e=k / 2+1}^{k}\binom{k}{e} p^{e} q^{k} \cdot e$,
or, with $k=2 n-1 \quad$ and $\quad k=2 n$ respectively:
$P_{2 n-1}=\sum_{e=n}^{2 n-1}\binom{2 n-1}{e} p^{c} q^{2 n-1-e}$,
$P_{2 n}=\frac{1}{2}\binom{2 n}{n} p^{n} q^{n}+\sum_{e=n+1}^{2 n}\binom{2 n}{e} p^{e} q^{2 n-e}$.

To be proved: $P_{k}=P_{k-1}$ for even values of $k$, or $P_{2 n}=P_{2 n-1}$.
Proof:
$P_{2 n}=\frac{1}{2}\binom{2 n}{n} p^{n} q^{n}+\underset{e=n}{2 n-1}\binom{2 n}{e} p^{e} q^{2 n-e}+p^{2 n}-\binom{2 n}{n} p^{n} q^{n}$,
whilst
$\binom{2 n}{e}=\frac{(2 n)!}{(2 n-e)!e!}=\frac{(2 n-e+e) \cdot(2 n-1)!}{(2 n-e)!e!}$

$$
\begin{align*}
& =\frac{(2 n-e)(2 n-1)!}{(2 n-e)(2 n-e-1)!e!}+\frac{e(2 n-1)!}{(2 n-e)!e(e-1)!} \\
& =\frac{(2 n-1)!}{(2 n-e-1)!e!}+\frac{(2 n-1)!}{(2 n-e)!(e-1)!}=\binom{2 n-1}{e}+\binom{2 n-1}{e-1} . \tag{A1.2}
\end{align*}
$$

Substitution of eq.(A1.2) in eq.(A1.1) yields:
$P_{2 n}=-\frac{1}{2}\binom{2 n}{n} p^{n} q^{n}+\sum_{e=n}^{2 n-1}\binom{2 n-1}{e} p^{e} q^{2 n-e}+\sum_{e=n}^{2 n-1}\binom{2 n-1}{e-1} p^{e} q^{2 n-e}+p^{2 n}$.
Substituting $i$ for $e-1$ in the third term gives:

$$
\begin{align*}
P_{2 n} & =-\frac{1}{2}\binom{2 n}{n} p^{n} q^{n}+q P_{2 n-1}+\sum_{i=n-1}^{2 n-2}\binom{2 n-1}{i} p^{i+1} q^{2 n-1-i}+p^{2 n} \\
& =-\frac{1}{2}\binom{2 n}{n} p^{n} q^{n}+q P_{2 n-1}+\sum_{i=n}^{2 n-1}\binom{2 n-1}{i} p^{i+1} q^{2 n-1-i}+\binom{2 n-1}{n-1} p^{n} q^{n}-p^{2 n+p^{2 n}} \tag{A1.4}
\end{align*}
$$

moreover,

$$
\begin{equation*}
\binom{2 n-1}{n-1}=\frac{(2 n-1)!}{(n-1)!n!}=\frac{(2 n)!n}{2 n \cdot n!n!}=\frac{1}{2} \frac{(2 n)!}{n!n!}=\frac{1}{2}\binom{2 n}{n} \tag{A1.5}
\end{equation*}
$$

From eqs (A1.4) and (A1.5):
$P_{2 n}=q P_{2 n-1}+p P_{2 n-1}=P_{2 n-1}$
q.e.d.

## Appendix 2

As proved in Appendix 1, $P_{k}=P_{k-1}$ for even values of $k$; in that case

$$
\begin{equation*}
P_{k}=\sum_{e=k / 2}^{k-1}\binom{k-1}{e} p^{e} q^{k-1-e \leqslant}\binom{k-1}{k / 2} q^{k-1} \sum_{e=k / 2}^{k-1}\left(\frac{p}{q}\right)^{e} \tag{A2.1}
\end{equation*}
$$

since

$$
\begin{equation*}
\binom{k-1}{k / 2}>\binom{k-1}{k / 2+i},(i=1,2,3, \ldots, k / 2-1) \tag{A2.2}
\end{equation*}
$$

## Considering that

$$
\begin{equation*}
\frac{\binom{k-1}{k / 2}}{\binom{k-2-1}{(k-2) / 2}}=\frac{(k-1)!\left(\frac{k}{2}-1\right)!\left(\frac{k}{2}-2\right)!}{\frac{k}{2}!\left(\frac{k}{2}-1\right)!(k-3)!}=\frac{(k-1)(k-2)}{\frac{k}{2}\left(\frac{k}{2}-1\right)}=\frac{k-1}{\bar{k} / 4}=4 \frac{k-1}{k}<4, \tag{A2.3}
\end{equation*}
$$

eq.(A2.1) may be written:
$P_{k}<\binom{k^{\prime}-1}{k^{\prime} / 2} 2^{k-k^{\prime}} q^{k-1}\left(\frac{p}{q}\right)^{k / 2} \frac{1-(p / q)^{k / 2}}{1-p / q}$,
in which $k^{\prime}$ is the minimum even free distance of the
convolutional code concerned; in the case at issue $\dot{k}$ ' $=6$.
Hence, in eq.(A2.4), $k^{\prime}<k$.
Rearrangement of eq.(A2.4) yields
$P_{k}<\Gamma(2 \sqrt{ } p)^{k} \frac{q^{k / 2}-p^{k / 2}}{q-p}$,
where
$\Gamma \triangleq\binom{k-1}{k^{\prime} / 2} 2-k^{\prime}$.

In the case at issue,
$\Gamma=\binom{5}{3}_{2} \cdot 6=\frac{5}{32}$.

In the last term of eq.(A2.5) we now make the substitution $q=1 / 2+x$; and, from considerations of symmetry we can, without departing from generality, assume that $1 / 2<q \leqslant 1$, whence $0<x \leqslant 1 / 2$. The last term then becomes
$\frac{(1 / 2+x)^{k / 2}-(1 / 2-x)^{k / 2}}{2 x}=A(k)+B(k) x^{2}+C(k) x^{4}+\ldots$,
where $A, B, C$ are non-negative constants which depend exclusively on $k$. Eq.(A2.6) increases monotonically for $x>0$ and hence reaches a maximum at the maximum value of $x$. Substituting $1 / 2$ for $x$ yields 1 , so that
$\frac{q^{k / 2}-p^{k / 2}}{q-p} \leqslant 1$.
Then
$P_{k}<\Gamma(2 \sqrt{ } p)^{\mathrm{k}}$
for even values of $k$.

## Appendix 3

In general (cf. Viterbi [5]), if $P_{k}<\mathrm{F}(p) k$ :

$$
\begin{align*}
& P_{E}<\mathrm{T}(D){ }_{D=\mathrm{F}(p)}  \tag{A3.1}\\
& P_{B}<\left.\frac{\mathrm{dT}(D, N)}{\mathrm{d} N}\right|_{N=1, D=\mathrm{F}(p)}
\end{align*}
$$

In general $T(D)$ may be expressed by

$$
\begin{equation*}
\mathrm{T}(D)=a_{1} D+a_{2} D^{2}+a_{3} D^{3}+\ldots \tag{A3.3}
\end{equation*}
$$

and then in general:

$$
\begin{equation*}
P_{E}<a_{1} P_{1}+a_{2} P_{2}+a_{3} P_{3}+\ldots \tag{A3.4}
\end{equation*}
$$

or, making use of the fact that $P_{1}=P_{2}, P_{3}=P_{4}, \ldots$, as proved in Appendix 1,

$$
\begin{equation*}
=\quad P_{E}<\left(a_{1}+a_{2}\right) P_{2}+\left(a_{3}+a_{4}\right) P_{4}+\ldots \tag{A3.5}
\end{equation*}
$$

Making use of eq.(A2.8) and the definitions of $\Gamma$ and $k^{\prime}$ given in Appendix 2, we may write

$$
\begin{equation*}
P_{E}<\Gamma\left\{\left(a_{1}+a_{2}\right)(2 \sqrt{ } p)^{2}+\left(a_{3}+a_{4}\right)(2 \sqrt{ } p)^{4}+\ldots\right\} \tag{A3.6}
\end{equation*}
$$

To analyse this expression, we shall try to find a function of $\mathrm{T}(D)$ which assumes the general form

$$
\begin{equation*}
\mathrm{G}\{\mathrm{~T}(D)\}=\left(a_{1}+a_{2}\right) D^{2}+\left(a_{3}+a_{4}\right) D^{4}+\ldots, \tag{A3.7}
\end{equation*}
$$

which is satisfied by

$$
\begin{equation*}
\frac{\mathrm{T}(D)+\mathrm{T}(-D)}{2}+D \frac{\mathrm{~T}(D)-\mathrm{T}(-D)}{2} \tag{A3.8}
\end{equation*}
$$

Eq.(A3.6) may thus be expressed by

$$
\begin{equation*}
P_{E}<\Gamma\left\{\frac{\mathrm{T}(D)+\mathrm{T}(-D)}{2}+D \frac{\mathrm{~T}(D)-\mathrm{T}(-D)}{2}\right\}_{D=2 \sqrt{ } p} \tag{A3.9}
\end{equation*}
$$

In an analogous way it can be shown that

$$
\begin{equation*}
P_{B}<\Gamma\left\{\frac{\frac{\mathrm{dT}(D, N)}{\mathrm{d} N}+\frac{\mathrm{dT}(-D, N)}{\mathrm{d} N}}{2}+D \frac{\frac{\mathrm{dT}(D, N)}{\mathrm{d} N}-\frac{\mathrm{dT}(-D, N)}{\mathrm{d} N}}{2}\right\}_{N=1, D=2 \sqrt{ } p} \tag{A3.10}
\end{equation*}
$$

These upper bounds are thus generally valid for any convolutional code. We shall now show that eqs.(5.3) and (5.4) are indeed obtained by substituting in eqs.(A3.9) and (A3.10):

$$
\mathrm{T}(D)=\frac{D^{5}}{1-2 D} . \quad \mathrm{T}(D, N)=\frac{D^{5} N}{1-2 D} \quad \text { and } \quad \Gamma=\frac{5}{32}
$$

Eq.(A3.9) then yields

$$
\begin{align*}
P_{E} & \left.<\frac{5}{32}\left\{\frac{\frac{D^{5}}{1-2 D}-\frac{D^{5}}{1+2 D}}{2}+\frac{\frac{D^{6}}{1-2 D}+\frac{D^{6}}{1+2 D}}{2}\right\}_{D=2 \sqrt{ } p}\right\}_{D=2 \sqrt{ } p}^{2\left(1-4 D^{2}\right)} \\
& =\frac{5}{32}\left\{\frac{D^{5}+2 D^{6}-D^{5}+2 D^{6}+D^{6}+2 D^{7}+D^{6}-2 D^{7}}{}\right\}_{D=\frac{5}{32} \cdot \frac{3 \times 64 p^{3}}{1-4 \times 4 p}=\frac{30 p^{3}}{1-16 p},}
\end{align*}
$$

and eq.(A3.10) becomes:

$$
\begin{align*}
& \left.\frac{\mathrm{dT}(D, N)}{\mathrm{d} N}\right|_{N=1}=\frac{D^{5}}{(1-2 D)^{2}},\left.\quad \frac{\mathrm{dT}(-D, N)}{\mathrm{d} N}\right|_{N=1}=\frac{-D^{5}}{(1+2 D)^{2}} \\
& P_{B}<\frac{5}{32}\left\{\frac{\frac{D^{5}}{(1-2 D)^{2}}-\frac{D^{5}}{(1+2 D)^{2}}}{2}+\frac{\frac{D^{6}}{(1-2 D)^{2}}+\frac{D^{6}}{(1+2 D)^{2}}}{2}\right\} \begin{array}{l}
D=2 \sqrt{ } p
\end{array} \\
& =\frac{5}{32}\left\{\frac{D^{5}(1+2 D)^{2}-D^{5}(1-2 D)^{2}+D^{6}(1+2 D)^{2}+D^{6}(1-2 D)^{2}}{2\left(1-4 D^{2}\right)^{2}}\right\}_{D=2 \sqrt{ } p} \\
& =\frac{5}{32}\left\{\frac{5 D^{6}+4 D^{8}}{\left(1-4 D^{2}\right)^{2}}\right\}_{D=2 \sqrt{ } p}=50 p^{3} \frac{1+3,2 p}{(1-16 p)^{2}} . \tag{5.4}
\end{align*}
$$

## Bibliography

(1] J.M. Wozencraft, "Sequential decoding for reliable communication", IRE Nat. Conv. Record, vol.5, pt.2, pp. 11-25, 1957.
[2] J.M. Wozencraft and B. Reiffen, Sequential Decoding, Cambridge, Mass.: M.I.T. Press, and New York: Wiley, 1961.
[3] J.L. Massey, Threshold Decoding, Cambridge, Mass.: M.I.T. Press. 1963.
[4] R.M. Fano, "A heuristic discussion of probabilistic decoding", IEEE Trans. Inform. Theory, vol. IT-9, pp. 64-74, Apr. 1963.
[5] A.J. Viterbi, "Convolutional codes and their performance in communication systems", IEEE Trans. Commun. Technol., vol. COM:19, pp. 751-772, Oct. 1971.
[6] J.A. Heller and I.M. Jacobs, "Viterbi decoding for satellite and space communication", IEEE Trans. Commun. Technol., vol. COM-19, pp. 835-848, Oct. 1971.
[7] G.D. Forney, Jr., "The Viterbi algorithm", Proc. of the IEEE, vol. 61, pp. 268-278, Mar. 1973.
[8] J.K. Omura, "On the Viterbi algorithm", IEEE Trans. Inform. Theory, vol. IT-15, pp. 177-179, Jan. 1969.
[9] A.J. Viterbi, "Error bounds for convolutional codes and an asymptotically optimum decoding algorithm", IEEE Trans. Inform. Theory, vol. IT-13, pp. 260-269, Apr. 1967.
[10] E.A. Bucher and J.A. Heller, "Error probability bounds for systematic convolutional codes", IEEE Trans. Inform. Theory, vol. IT-16, pp. 219-224, Mar. 1970.
[11] J.L. Massey and M.K. Sain, 'Inverses of linear sequential circuits", IEEE Trans. Comput., vol. C-17, pp. 330-337. Apr. 1968.
[12] W.J. Rosenberg, "Stractural properties of convolutional codes", Ph. D. dissertation, Dept. Syst. Sci., Sch. Eng. Appl. Sci., Univ. California, Los Angeles, 1971.
[13] J.P. Odenwalder, "Optimal decoding of convolutional codes", Ph. D. dissertation, Dep. Syst. Sci., Sch. Eng. Appl. Sci., Univ. California, Los Angeles, 1970.
[14] L. van de Meeberg, "A tightened upper bound on the error probability of binary convolutional codes with Viterbi decoding", IEEE Trans. Inform. Theory, vol. IT-20, pp. 389-3917, May 1974.
[15] J.G.H.- Alma, "Een binair symmetrisch kanaal met instelbare foutenkans", Stageverslag Eindhoven University of Technology, Group ECA, Eindhoven, Apr. 1973.

## Acknowledgement

The author is greatly indebted to his thesis advisors Prof. J.P.M. Schalkwijk and A.P. Verlijsdonk of the Department of Electrical Engineering at the Eindhoven University of Technology, and to K.A. Post of the Mathematics Department, for helpful discussions and suggestions in preparing this report.

```
EINDHOVEN UNIVERSITY OF TECHNOLOGY
THE NETHERLANDS
DEPARTMENT OF ELECTRICAL ENGINEERING
```


## Reports:

1) Dijk, J., M. Jeuken and E.J. Maanders

AN ANTENNA FOR A SATELLITE COMMUNICATION GROUND STATION (PROVISIONAL ELECTRICAL DESIGN) . TH-report 68-E-01. March 1968. ISBN 9061440017
2) Veefkind, A., J.H. Blom and L.Th. Rietjens

THEORETICAL AND EXPERIMENTAL INVESTIGATION OF A NON-EQUILIBRIUM PLASMA IN A MHD CHANNEL. TH-report 68-E-2. March 1968. Submitted
to the Symposium on a Magnetohydrodynamic Electrical Power Generation, Warsaw, Poland, 24-30 July, 1968. ISBN 9061440025
3) Boom, A.J.W. van den and J.H.A.M. Melis

A COMPARISON OF SOME PROCESS PARAMETER ESTIMATING SCHEMES. TH-report 68-E-03. September 1968. ISBN 9061440033
4) Eykhoff, P., P.J.M. Ophey, J. Severs and J.O.M. Oome

AN ELECTROLYTIC TANK FOR INSTRUCTIONAL PURPOSES REPRESENTING THE COMPLEX-FREQUENCY PLANE. TH-report 68-E-04. September 1968. ISBN 9061440041
5) Vermij, L. and J.E. Daalder ENERGY BALANCE OF FUSING SILVER WIRES SURROUNDED BY AIR. TH-report 68-E-05. November 1968. ISBN 906144005 X
6) Houben, J.W.M.A. and P. Massee

MHD POWER CONVERSION EMPLOYING LIQUID METALS. TH-report 69-E-06. February 1969. ISBN 9061440068
7) Heuvel, W.M.C. van den and W.F.J. Kersten

VOLTAGE MEASUREMENT IN CURRENT ZERO INVESTIGATIONS. TH-report 69-E-07. September 1969. ISBN 9061440076
8) Vermij, I.

SELECTED BIBLIOGRAPHY OF FUSES. TH-report 69-E-08. September 1969. ISBN 9061440084
9) Westenberg, J.z.

SOME IDENTIFICATION SCHEMES FOR NON-LINEAR NOISY PROCESSES. TH-report 69-E-09. December 1969. ISBN 9061440092
10) Koop, H.E.M., J. Dijk and E.J. Maanders

ON CONICAL HORN ANTENNAS. TH-report 70-E-10. February 1970.
ISBN 9061440106
11) Veefkind, A.

NON-EQUILIBRIUM PHENOMENA IN A DISC-SHAPED MAGNETOHYDRODYNAMIC GENERATOR. TH-report 70-E-11. March 1970. ISBN 9061440114
12) Jansen, J.K.M., M.E.J. Jeuken and C.W. Lambrechtse THE SCALAR FEED. TH-report 70-E-12. December 1969. ISBN 9061440122
13) Teuling, D.J.A.

ELECTRONIC IMAGE MOTION COMPENSATION IN A PORTABLE TELEVISION CAMERA. TH-report 70-E-13. 1970. ISBN 9061440130
14) Lorencin, M.

AUTOMATIC METEOR REFLECTIONS RECORDING EQUIPMENT. TH-report 70-E-14. November 1970. ISBN 9061440149
15) Smets, A.J.

THE INSTRUMENTAL VARIABLE METHOD AND RELATED IDENTIFICATION SCHEMES. TH-report 70-E-15. Nọvember 1970. ISBN 9061440157
16) White, Jr., R.C.

A SURVEY OF RANDOM METHODS FOR PARAMETER OPTIMIZATION. TH-report 70- E-16. February 1971. ISBN 9061440165
17) Talmon, J.L.

APPROXIMATED GAUSS-MARKOV ESTIMATIONS AND RELATED SCHEMES. TH-report 71-E-17. February 1971. ISBN 9061440173
18) Kalásek, V.

MEASUREMENT OF TIME CONSTANTS ON CASCADE D.C. ARC IN NITROGEN. TH-report 71-E-18. February 1971. ISBN 9061440181
19) Hosselet, L.M.L.F.

OZONBILDUNG MITTELS ELEKTRISCHER ENTLADUNGEN. TH-report 71-E-19. March 1971. ISBN 906144019 X
20) Arts, M.G.J.

ON THE INSTANTANEOUS MEASUREMENT OF BLOODFLOW BY ULTRASONIC MEANS.
TH-report 71-E-20. May 1971. ISBN 9061440203
21) Roer, Th. G. van de

NON-ISO THERMAI ANALYSIS OF CARRIER WAVES IN A SEMICONDUCTOR. TH-report 71-E-21. August 1971. ISBN 9061440211
22) Jeuken, P.J., C. Huber and C.E. Mulders

SENSING INERTIAL ROTATION WITH TUNING FORKS. TH-report 71-E-22. September 1971. ISBN 906144022 X
23) Dijk, J. and E.J. Maanders

APERTURE BLOCKING IN CASSEGRAIN ANTENNA SYSTEMS. A REVIEW. TH-report 71-E-23. September 1971. ISBN 9061440238
24) Kregting, J. and R.C. White, Jr.

ADAPTIVE RANDOM SEARCH. TH-report 71-E-24. October 1971. ISBN 9061440246
25) Damen, A.A.H. and H.A.L. Piceni

THE MULTIPLE DIPOLE MODEL OF THE VENTRICULAR DEPOLARISATION. TH-report 71-E-25. October 1971. ISBN 9061440254 (In preparation).
26) Bremmer, H.

A MATHEMATICAL THEORY CONNECTING SCATTERING AND DIFFRACTION PHENOMENA, INCLUDING BRAGG-TYPE INTERFERENCES. TH-report 71-E-26. December 1971. ISBN 9061440262
27) Bokhoven, W.M.G. van

METHODS AND ASPECTS OF ACTIVE-RC FILTERS SYNTHESIS. TH-report 71-E-27. 10 December 1970. ISBN 9061440270
28) Boeschoten, F. TWO FLUIDS MODEL REEXAMINED. TH-report 72-E-28. March 1972. ISBN 9061440289
29) REPORT ON THE CLOSED CYCLE MHD SPECIALIST MEETING. Working group of the joint ENEA/IAEA international MHD liaison group.
Eindhoven, The Netherlands, September 20-22, 1971. Edited by L.H.Th. Rietjens.

TH-report 72-E-29. April 1972. ISBN 9061440297
30) Kessel, C.G.M. van and J.W.M.A. Houben

LOSS MECHANISMS IN AN MHD GENERATOR. TH-report 72-E-30. June 1972. ISBN 9061440300
31) Veefkind, A.

CONDUCTING GRIDS TO STABILIZE MHD GENERATOR PLASMAS AGAINST IONIZATION INSTABILITIES. TH-report 72-E-31. September 1972. ISBN 9061440319
32) Daalder, J.E. and C.W.M, Vos DISTRIBUTION FUNCTIONS OF THE SPOT DIAMETER FOR SINGLE- AND MULTICATHODE DISCHARGES IN VACUUM. TH-report 73-E-32. January 1973. ISBN 9061440327
33) Daalder, J.E.

JOULE HEATING AND DIAMETER OF THE CATHODE SPOT IN A VACUUM ARC. TH-report 73-E-33. January 1973. ISBN 9061440335
34) Huber, C.

BEHAVIOUR OF THE SPINNING GYRO ROTOR. TH-report 73-E-34. February 1973. ISBN 9061440343
35) Bastian, C. et al.

THE VACUUM ARC AS A FACILITY FOR RELEVANT EXPERIMENTS IN FUSION RESEARCH. Annual Report 1972. EURATOM-T.H.E. Group "Rotating Plasma". TH-report 73-E-35. February 1973. ISBN 9061440351
36) Blom, J.A.

ANALYSIS OF PHYSIOLOGICAL SYSTEMS BY PARAMETER ESTIMATION TECHNIQUES. 73-E-36. May 1973. ISBN 906144036 X
37) Lier, M.C. van and R.H.J.M. Otten

AUTOMATIC WIRING DESIGN. TH-report 73-E-37. May 1973. ISBN 9061440378 (vervalt zie 74-E-44)
38) Andriessen, F.J., W. Boerman and I.F.E.M. Holtz CALCUIATION OF RADIATION LOSSES IN CYLINDRICAL SYMMETRICAL HIGH PRESSURE DISCHARGES BY MEANS OF A DIGITAL COMPUTER. TH-report 73-E-38. October 1973. ISBN 9061440386
39) Dijk, J., C.T.W. van Diepenbeek, E.J. Maanders and L.F.G. Thurlings THE POLARIZATION LOSSES OF OFFSET ANTENNAS. TH-report 73-E-39. June 1973. ISBN 9061440394 (in preparation)
40) Goes, W.P.

SEPARATION OF SIGNALS DUE TO ARTERIAL AND VENOUS BLOOD FLOW IN THE DOPPLES SYSTEM THAT USES CONTINUOUS ULTRASOUND. TH-report 73-E-40. September 1973. ISBN 9061440408
41) Damen, A.A.H.

COMPARATIVE ANALYSIS OF SEVERAL MODELS OF THE VENTRICULAR DE-POLARISATION; INTRODUCTION OF A STRING-MODEL. TH-report 73-E-41. October 1973. ISBN $90 \quad 61440416$
42) Dijk, G.H.M. van

THEORY OF GYRO WITH ROTATING GIMBAL AND FLEXTURAL PRIOTS. TH-report 73-E-42. November 1973. ISBN 9061440424
43) Breimer, A.J.

ON THE IDENTIFICATION OF CONTINUOUS LINEAR PROCESSES. TH-report 74-E-43, January 1974. ISBN 9061440432
44) Lier, M.C. van and R.H.J.M. Otten CAD OF MASKS AND WIRING. TH-report 74-E-44. February 1974. ISBN $90 \quad 61440440$
45) Bastian, C. et al.

EXPERIMENTS WITH A LARGE SIZED HOLLOW CATHODE DISCHARGE FED WITH ARGON. Annual Report 1973. EURATOM-T.H.E. GRoup "Rotating Plasma". TH-report 74-E-45. April 1974. ISBN 9061440459
46) Roer, Th.G. van de

ANALYTICAL SMALL-SIGNAL THEORY OF BARITYT DIODES. TH-report 74-E-46. May 1974. ISBN 9061440467
47) Leliveld, W.H.

THE DESIGN OF A MOCK CIRCULATION SYSTEM. TH-report 74-E-47. June 1974. ISBN 9061440475
48) Damen, A.A.H.

SOME NOTES ON THE INVERSE PROBLEM IN ELECTRO CARDIOGRAPHY. TH-report 74-E-48. July 1974. ISBN 9061440483
49) Meeberg, L. van de

A VITERBI DECODER. TH-report 74-E-49. October 1974. ISEN 9061440491
50) Poel, A.P.M. van der

A COMPUTER SEARCH FOR GOOD CONVOLUTIONAL CODES. TH-report 74-E-50. October 1974. ISBN 9061440503
51) Sampic, G.

THE BIT ERROR PROBABILITY AS A FUNCTION PATH REGISTER LENGTH IN THE VITERBI DECODER. TH-report 74-E-51. October 1974. ISBN 9061440513
52) Schalkwijk, J.P.M.

CODING FOR A COMPUTER NETWORK. TH-report 74-E-52. October 1974. ISBN 9061440521
53) Stapper, M.

MEASUREMENT OF THE INTENSITY OF PROGRESSIVE ULTRASONIC WAVES BY MEANS OF RAMAN-NATH DIFRACTION. TH-report 74-E-53. November 1974.
ISBN 906144053 X
54) Schalkwijk, J.P.M. and A.J. Vinck

SYNDROME DECODING OF CONVOLUTIONAL CODES. TH-report 74-E-54. November 1974. ISBN 9061440548
55) Yakimov, A.

FLUCTUATIONS IN IMPATT-DIODE OSCILLATORS WITH LOW q-SECTORS. TH-report 7--E-55. November 1974. ISBN 9061440546

