What are the contributions mentioned in the paper "1-bit observation for direct learning based digital predistortion of rf power amplifiers" ?

Q: What are the contributions mentioned in the paper "1-bit observation for direct learning based digital predistortion of rf power amplifiers" ?

In this paper, the authors propose a low-cost data acquisition approach for model extraction of digital predistortion ( DPD ) of RF power amplifiers.

(Open Access) 1-bit Observation for Direct-Learning-Based Digital Predistortion of RF Power Amplifiers (2017) | Haoyu Wang

Provided by the author(s) and University College Dublin Library in accordance with publisher

policies. Please cite the published version when available.

Title 1-b Observation for Direct-Learning-Based Digital Predistortion of RF Power Amplifiers

Authors(s) Wang, Haoyu; Li, Gang; Zhou, Chongbin; Zhu, Anding; et al.

Publication date 2017-01-23

Publication information IEEE Transactions on Microwave Theory and Techniques, PP (99): 1-11

Publisher IEEE

Item record/more information http://hdl.handle.net/10197/8381

obtained for all other uses, in any current or future media, including reprinting/republishing

this material for advertising or promotional purposes, creating new collective works, for

resale or redistribution to servers or lists, or reuse of any copyrighted component of this

work in other works

Publisher's version (DOI) 10.1109/TMTT.2016.2642945

Downloaded 2022-08-10T08:55:37Z

The UCD community has made this article openly available. Please share how this access

benefits you. Your story matters! (@ucd_oa)

1-Bit Observation for Direct Learning Based

Digital Predistortion of RF Power Ampliﬁers

Haoyu Wang, Gang Li, Chongbin Zhou, Wei Tao, Falin Liu, and Anding Zhu, Senior Member, IEEE

Abstract—In this paper, we propose a low-cost data acquisition

approach for model extraction of digital predistortion (DPD) of

RF power ampliﬁers. The proposed approach utilizes only 1-bit

resolution analog-to-digital converters (ADCs) in the observation

path to digitize the error signal between the input and output

signals. The DPD coefﬁcients are then estimated based on the

direct learning architecture using the measured signs of the

error signal. The proposed solution is proved to be feasible in

theory and the experimental results show that the proposed

algorithm achieves equivalent performance as that using the

conventional method. Replacing high resolution ADCs with 1-

bit comparators in the feedback path can dramatically reduce

the power consumption and cost of the DPD system. The 1-bit

solution also makes DPD become practically implementable in

future broadband systems since it is relatively straightforward

to achieve an ultra-high sampling speed in data conversion by

using only simple comparators.

Index Terms—Analog-to-digital converter (ADC), digital pre-

distortion (DPD), error signal, linearization, low resolution,

power ampliﬁer (PA), wideband.

I. INTRODUCTION

N the past twenty years or so, digital predistortion (DPD)

has become one of the most popular linearization tech-

niques for radio frequency (RF) power ampliﬁers (PAs) in

wireless communication systems, especially in cellular base

stations [1], [2]. Although it already seems to be a well-

established technique at current stage, DPD is still facing

new challenges since the development of the next generation

communication system never stops [2], [3]. For instance, most

current DPD solutions are employed in middle to high power

base stations where power consumption and cost of DPD units

are negligible [2]. In future networks, small-cell base stations

will be deployed, where the output power of the PA becomes

much lower and thus the power consumption and cost of the

digital components become an issue. There are many efforts

having been made to address this issue. One idea is to employ

new algorithms to simplify the DPD model. For example,

compressed sensing (CS) has recently been introduced to DPD

to reduce the model complexity [4]–[6]. It also has been

shown that some of the distortion compensation, that is usually

done at the transmitter side, can be moved to the receiver

This work was supported in part by the National Natural Science Foundation

of China under Grant Number 61471333 and by the Science Foundation

Ireland under Grant Numbers 13/RC/2077 and 12/IA/1267.

H. Wang, G. Li, C. Zhou, W. Tao and F. Liu are with the Department of

Electronic Engineering and Information Science, University of Science and

Technology of China, and also with the Key Laboratory of Electromagnetic S-

pace Information, Chinese Academy of Sciences, Hefei, Anhui, China (e-mail:

{wamhoyle; lgml; zhouzcb; jxtaowei}@mail.ustc.edu.cn; liuﬂ@ustc.edu.cn).

A. Zhu is with the School of Electrical and Electronic Engineering,

University College Dublin, Dublin 4, Ireland (e-mail: anding.zhu@ucd.ie).

side, to reduce the complexity and power consumption of the

transmitters in small cells [7], [8].

One of the main concerns in DPD implementation is the

bandwidth requirement of the feedback path that is used to

capture the output signal from the PA for the purpose of

model extraction. With carrier aggregation (CA), the signal

bandwidth in long-term evolution-advanced (LTE-A) is up to

100 MHz already and it will be increased to 160 MHz or wider

soon [3]. For the coming 5th generation (5G) systems, the

signal bandwidth will be even wider. In DPD, the bandwidth

of the feedback path usually requires ﬁve times of the signal

bandwidth which means that mutli-giga samples per second

(GSPS) analog-to-digital converters (ADCs) are required. The

existing and forthcoming data converter technologies, however,

could hardly meet this requirement.

Some solutions have been proposed to reduce the sig-

nal bandwidth requirement. The band-limited method was

proposed in [9] but it requires an extra bandpass ﬁlter in

the RF transmit chain that is difﬁcult and costly to design.

The analog aliased sampling method in [10] can reduce the

sampling rate but it needs additional analog aliasing operation.

The spectral extrapolation based algorithm was reported in

[11] and in [12] a forward model was ﬁrst carried out and

then DPD coefﬁcients can be estimated. In [13] a two-stage

DPD, i.e., a static nonlinear box cascaded with a dynamic

weak nonlinear box, was proposed to decrease the feedback

bandwidth. The method proposed in [14] was designed just for

concurrent dual-band signals. All of the methods mentioned

above require the acquisition bandwidth not narrower than

the signal bandwidth. Contrarily, [15] proposed an algorithm

based on random demodulation, with which an ultra-narrow

feedback bandwidth is enough for wideband DPD, but it

requires an extra random sequence generator in the analog

domain which is hard to implement due to the cost and time-

alignment issue.

Besides the sampling rate, the other issue relating to ADC

is the resolution. Before training the DPD model, the output

signal of the PA is digitized. The number of quantization bits

depends on the actual system requirement. Usually in a real

system, a 14-bit ADC is needed to give a minimum noise ﬂoor

of -70 dBc [16]. Designing a 14-bit ADC with multi-GSPS is

very challenging and costly [17]. It is therefore desirable that

the required resolution can be reduced; however, this is not a

straightforward task, since reducing the resolution of ADC is

equivalent to increasing the noise ﬂoor of the feedback signal,

which is critical to the accuracy of DPD modeling. Y. Liu et

al. proposed a method in [18] to reduce the ADC dynamic

range, but a minimum 8-bit ADC is required for achieving

comparable linearization performance with the conventional

DPD. A 1-bit estimator was proposed to quantize the phase

of the original input signal in [19] to reduce the complexity

of model identiﬁcation while the resolution requirement for

ADCs remains the same.

In this paper, a novel direct learning architecture (DLA)

based 1-bit quantization method is proposed. The proposed

method utilizes only 1-bit resolution comparators to measure

the error signal that is then used for DPD coefﬁcients training.

The proposed approach dramatically reduces the cost of the

feedback chain. Moreover, both theoretical derivation and

experimental tests show that the proposed method can be

extended to the systems transmitting very wideband signals.

This paper is organized as follows. Section II introduces

the proposed 1-bit observation method after reviewing the

conventional direct learning architecture. In Section III, the

time alignment, power alignment, optimization of convergence

speed and the overall system complexity are discussed. The

experimental results are given in Section IV, followed by a

conclusion in Section V.

II. THEORETICAL DERIVATION

The principle of DPD is that a digital block, called predis-

torter, is inserted into the transmitter chain to preprocess the

input signal before it enters the RF PA. If the two nonlinear

systems, i.e., the predistorter and the PA, exactly invert each

other, a highly linear system can be achieved. In order to

extract the coefﬁcients of the predistorter, a small fraction

of the transmit signal is transferred back to baseband via

a feedback loop. Two architectures are generally employed

for model extraction: direct learning and indirect learning

architecture (IDLA). The difference between DLA and IDLA

has been investigated in [20]. The IDLA estimates the post-

inverse of the PA ﬁrst and then copies the coefﬁcients of the

post-inverse estimator to the pre-inverse one. The IDLA can

be run in an open-loop fashion. While the DLA is usually

used in closed-loop systems and it compares the PA output

with the original input directly. In low resolution systems, the

performance of IDLA is limited, while DLA is able to identify

the changes between input and output signals effectively,

especially in the 1-bit method we will propose in Section II.B.

As a result, the DLA is used for DPD modeling in this paper.

A. Conventional Direct Learning Architecture

The simpliﬁed conventional DLA block diagram is shown

in Fig. 1 [21], [22], where the bold lower-case vectors x and y

represents the input and output sequences, respectively. More

speciﬁcally, x and y are expressed as

x = [x(n − K + 1), x(n − K + 2), . . . , x(n)]

∈ C

K×1

y = [y(n −K + 1), y(n − K + 2), . . . , y(n)]

∈ C

K×1

(1)

where K is the length of the sequences used for training,

x(n) and y(n), n ∈ Z are baseband input and output signals,

respectively, and ()

denotes the matrix transpose. The output

of digital predistorter is denoted by z(n), and its corresponding

vector form is z. Various behavioral models can be used to

DPD PA

Model

coefficients

extraction

Fig. 1. Simpliﬁed DPD block diagram based on direct learning architecture.

describe the input-output relationship of the DPD [1]–[3]. For

instance, the baseband equivalent expression of Volterra model

is given by

z(n) =



p=1

p:odd



···



, . . . , m

)

(p+1)/2



l=1

x(n − m

)



l=1+(p+1)/2

∗

(n − m

(2)

where h

is the p-th order Volterra kernel, P and M are the

nonlinear order and memory depth, respectively, and (2) can

be rewritten in a matrix form as

z = Xh. (3)

In (3), each row of X ∈ C

K×L

consists of all of the product

terms appearing in (2), and h ∈ C

L×1

is the coefﬁcient vector

with the length of L. Let g() be the transfer function of PA,

then the output of PA can be expressed as

y = g(z) = g(Xh). (4)

The cost function of the DLA-based DPD system is the

norm of the difference between the output and input of

the system, i.e., ∥y − x∥

. Newtons method is one of the

most popular candidates that solve this kind of nonlinear

problem. To do so, the Jacobian and Hessian matrices, i.e.,

ﬁrst-order and second-order derivatives of the cost function,

are calculated ﬁrst. Then the DPD coefﬁcients can be updated

in an iterative procedure [11], [21], [22]:

k+1

= h

− µ





−1

(y − x), (5)

where ()

represents the Hermitian transpose, and the damp-

ing factor µ 6 1.

To achieve a relatively good performance using (5), one

needs high resolution of the feedback signal, e.g., 14-bit

ADC to digitize the output of PA, which is one of the main

bottlenecks for DPD applications in the next generation com-

munication systems. In the next subsection, we will discuss

the detail of the proposed novel 1-bit observation algorithm,

which exhibits comparative performance with the conventional

method.

B. Proposed 1-Bit Observation for Direct Learning Based

Digital Predistortion

In a DLA-based DPD system, the difference between the

output and input signals, y(n) − x(n), should be properly

measured sample by sample, as demonstrated in (5). Both

x(n) and y(n) are baseband complex values, consisting of

the in-phase and quadrature (I/Q) signals. They have the form

x(n) = x

(n) + j · x

(n),

y(n) = y

(n) + j · y

(n),

(6)

where x

(n), x

(n), y

(n) and y

(n) are all real values.

An arbitrary real number can be written in the way that its

sign multiplies its magnitude, i.e., a = sign(a) · |a|, a ∈ R.

If the magnitude information |a| is already known or can be

estimated in an easy way, sign(a) is the only thing that needs

to be measured to calculate the number a.

By deﬁning ∆

(n) = y

(n)−x

(n) and ∆

(n) = y

(n)−

(n) as the error samples for the real and imaginary parts,

respectively, the difference between the output and input can

be expressed as

y(n) − x(n) = (y

(n) − x

(n)) + j · (y

(n) − x

(n))

= sign (∆

(n)) |∆

(n)|

+ j · sign (∆

(n)) |∆

(n)|.

(7)

Because PA is a nonlinear device, without linearization, sig-

niﬁcant distortion can be introduced into the transmit signal,

especially if the PA is run into deep compression. In a real

application, however, e.g., LTE, the signal has non-constant

envelope and the amplitude of the signal follows a Gaussian-

like distribution. Only a small percentage of the signal with

high amplitudes is affected severely by the deep compression.

The magnitudes of the most error samples are relatively small,

compared to the original input. Furthermore, although |∆

(n)|

and |∆

(n)| could hardly be strictly equal, they have the

same statistical properties and during DPD training, the errors

decrease with the number of iterations and they both approach

zero when the training converges. In this work, during the

model training process, we assume that the magnitude of

the error sample I/Q can be approximately made equal to

an updating constant, namely, |∆

(n)| ≈ |∆

(n)| ≈ ˆc(n).

Equation (7) then becomes

y(n) − x(n) ≈ ˆc(n) (sign (∆

(n)) + j · sign (∆

(n)))

= ˆc(n)sign (∆(n)) ,

(8)

where ∆(n) = ∆

(n) + j ·∆

(n) and sign(∆(n)) calculates

the signs of real and imaginary parts of ∆(n) separately. The

vector form for (8) is given by

y − x = [∆(n − K + 1), ∆(n − K + 2), . . . , ∆(n)]

≈







ˆc(n − K + 1)sign (∆(n − K + 1))

ˆc(n)sign (∆(n))







≈ ˆc[sign (∆(n − K + 1)) , . . . , sign (∆(n))]

, ˆc · ∆

(9)

where ∆

is deﬁned as a column vector that consists of the

signs of each I/Q sample. By substituting (9) into (5), it yields

k+1

= h

− ˆc





−1

∆

. (10)

conventional

proposed

error

samples

proposed

-1

( appropriate )

( )

Fig. 2. Demonstration of the relationship between conventional DPD and the

proposed 1-bit DPD.

As it can be seen, the data matrix X is already known, and

ˆc

is treated as the step size for the k-th iteration. Note that

the damping factor µ in (5) is combined into ˆc

to simplify

the expression and this has no impact on the ﬁnal result. Only

the sign information of the error signal is thus needed for

conducting the calculation in (10). This enables using 1-bit

ADCs to digitize the error signal.

The difference between the proposed algorithm in (10) and

the conventional one in (5) is demonstrated in Fig. 2. The

grey dots are the error samples, and the circle in black line

denotes the objective of the conventional method with radius

equaling the root mean square (RMS) of magnitudes of the

error samples, while the two squares represent the targets of

the proposed method with different step sizes. In the proposed

algorithm, the error samples are approximately averaged to

the vertexes of the square, e.g., the error samples in the ﬁrst

quadrat are moved to the upper-right vertex of the square.

Equation (10) is similar to that used in the simultaneous

perturbation method [23], [24], where a Bernoulli process

is carried out to estimate the gradient. How to choose an

appropriate step size ˆc

is critical. If it is properly chosen,

(10) achieves comparative performance as (5). This issue will

be discussed in detail in Section III.

III. SYSTEM IMPLEMENTATION

A. System Description

The block diagram of the proposed 1-bit observation DPD

system is illustrated in Fig. 3. The main difference from

the conventional DPD is that, in the feedback path, after

demodulation, the analog I and Q signal is sent to a comparator

to compare with the original input, respectively, to obtain the

sign of the error signal, instead of being fully digitized. In this

conﬁguration, an additional digital to analog conversion path,

path 2 as highlighted in Fig. 3, is added to convert the original

digital I/Q to the analog domain to make the comparison. The

DPD QMod PA

QDmod

Attenuator

Comparators

(1-bit ADCs)

I: Time delay

estimate

II:DPD

training

I: Time-delay estimation mode

II: DPD training mode

delay

( )

x t

( )

x t

( )

y t

( )

y t

( )

sign

GND

DAC

Digital Domain

Analog Domain

( )

sign

path 2

path 1

SW-I-2

SW-II

SW-I-1 SW-I-2

SW-I-1

SW-II

Fig. 3. Proposed 1-bit observation DPD system.

comparators here are equivalent to the conventional ADCs

working with only 1-bit. The signs of the error signal are then

sent to the DPD training block for model extraction.

Before model extraction, time delay between the input and

output samples must be properly calibrated. In the conven-

tional system, time alignment is conducted in the digital

domain by comparing the input and output data samples.

In the proposed system, because only 1-bit comparators are

used, the high resolution output samples are not available. A

special time alignment methodology must be developed, which

will be discussed in the following subsection. To facilitate

time alignment, the sign of the output signal can be obtained

by using the existing comparators with the reference level

switched to ground, shown in Fig. 3.

Another issue is power alignment. In the conventional sys-

tem, power alignment is also done in the digital domain in both

conventional DLA-based and IDLA-based DPDs [25], [26]. In

the proposed system, power alignment must be carried out in

the analog domain, because only the input and output signal

levels are aligned properly, the sign of the error signal then

be obtained correctly. The attenuation level of the attenuator

thus must be properly chosen to ensure the powers between

input and output signals are aligned before they enter the

comparators. In real systems, some power control modules,

e.g., variable gain ampliﬁers (VGAs) [27], can be applied to

facilitate the implementation.

B. Time-Alignment Algorithm

Calculating cross-correlation between the input and output

signals in the time domain [28] for time alignment is a com-

mon approach in the conventional DPD training algorithms.

This is, however, not practical in the proposed system, since

only the signs of the output signal can be obtained. Directly

calculating the cross-correlation between the signs of the input

and output in the time domain will cause large errors. In this

paper, instead, we suggest to use the frequency domain based

algorithm to estimate the time delay [29], [30].

Fourier transform (FT) states that a delay in the time domain

is equivalent to a phase rotation in the frequency domain.

The time delay can thus be calculated from the measured

phase rotation in the frequency domain. For a given set of

time domain data samples, after discrete Fourier transform

(DFT), the phase-frequency relation is a simple linear function

expressed as

φ = s · f + b, (11)

where φ and f are phase rotation and frequency, respectively,

s is the slope which is directly proportional to the time delay,

b is a constant related to phase shift in the time domain. s and

b can be estimated by using the least squares (LS) algorithm

with the frequency domain data samples. Once the slope s is

obtained, the time delay is calculated as

delay

= −

N ˆs

2π

, (12)

where N is the total number of samples used for DFT

calculation, and ˆs is the estimated slope for s in (11).

The reason why the time domain cross-correlation does not

work in this case is because the signal amplitudes are only

at two levels. If we transform it into the frequency domain,

however, the signal power in in-band is still much higher than

the noise ﬂoor, despite of high quantization noise. This is

illustrated in Fig. 4 where the spectra of a LTE signal with

different time domain resolutions are given. To simplify the

illustration, only quantization noise is considered here. From

the ﬁgure, we can see that the noise ﬂoor increases while the

number of bits reduces. Despite the high noise ﬂoor with 1-

bit sampling, the signal power in in-band is higher than the

noise about 6 dB. If we use these in-band values to form the

equation in (11), we should be able to ﬁnd the slope s and

thus calculate the time delay between the input and output

1-bit Observation for Direct-Learning-Based Digital Predistortion of RF Power Amplifiers

Figures

Citations

Behavioral Modeling of RF Power Amplifiers Using Reduced Sampling Rate

A Manifold Regularization Approach for Low Sampling Rate Digital Predistortion With Band-Limited Feedback

Closed-Loop Sign Algorithms for Low-Complexity Digital Predistortion

Forward modeling assisted 1-bit data acquisition based model extraction for digital predistortion of RF power amplifiers

Inverse Covariance Matrix Estimation for Low-Complexity Closed-Loop DPD Systems: Methods and Performance

References

Analog-to-digital converter survey and analysis

Implementation of the simultaneous perturbation algorithm for stochastic optimization

Behavioral modeling and predistortion

Discrete time techniques for time delay estimation

An Overview of the Simultaneous Perturbation Method for Efficient Optimization

Related Papers (5)

Output-Controllable Partial Inverse Digital Predistortion for RF Power Amplifiers

Novel Technique for Wideband Digital Predistortion of Power Amplifiers With an Under-Sampling ADC

New digital predistortion technique of RF power amplifiers for wideband OFDM signals

Digital predistortion for RF power amplifiers: State of the art and advanced approaches

Improving DPD performance by compensating feedback loop impairments in RF ADCs

Frequently Asked Questions (1)

Q1. What are the contributions mentioned in the paper "1-bit observation for direct learning based digital predistortion of rf power amplifiers" ?