What is the main task of calculating 20?

The main task of calculating (20) is to find two minimum Euclidean distances with the corresponding bit vectors having the lth value equal to 1 and 0, respectively.

How many cycles does a TSB take to generate the candidate list?

It should be reemphasized that TSB takes 14Ltotal cycles to generate the candidate list L by outputting a size-four candidate vector list Li per clock cycle.

What is the function of the soft-output tree-search algorithm?

The soft-output tree-search algorithm generates a list L of candidate vectors by going through the tree and finds the two elements of (6) within the list, i.e.,L(bl | r) ≈ min b∈L∩χ0l1N0 |r −Hs|2 − min b∈L∩χ1l1N0 |r −Hs|2.

Why is the MRC in SM based on the diagonal property of the matrix R?

Due to the diagonal property of the equivalent channel matrix R in (19), this minima-search procedure is conducted for each real-valued scalar symbol independently, which is then equivalent to the symbol-level bit-flipping operation in the SM signal detection algorithm, i.e., (12).

Why is the performance degradation so low without bit-flipping?

Due to the multi-node extension, the performance degradation is minor without bit-flipping when signals are detected at the top layer.

What is the way to find the smallest bit-flipped symbol?

Instead of calculating the Euclideandistances of all M/2 possible bit-flipped symbols and finding the minimum with extensive comparison, the authors propose to observe the location of sML in the constellation plane and then select sMLl with simple boundary check.

What is the BER of the early-pruned FSD algorithm?

Associated with the corresponding constraint shapes plotted in Fig. 4, the authors observe that the early-pruned FSD algorithm (with bit-flipping scheme) performs better when the pruning parameter L2N−1 leads to a constraint that better approximates the circularshaped admissible region.

What is the algorithm for detecting a tree?

Their algorithm offers better performance than other fixed-complexity tree-search detections with a much smaller candidate list size (e.g., Ltotal = 16 in their algorithm comparing to K = 64 in K-Best detection and NL in LFSD).

What is the selection criteria in (13)?

(13)The selection criteria in (13) finds the minimum of |LBFbl | and |LFSDbl |, which is efficient in relieving the problem of getting the pseudo-minimum of (10), leading to a more accurate approximation of the MAP result.

(Open Access) VLSI Implementation of a Soft-Output Signal Detector for Multimode Adaptive Multiple-Input Multiple-Output Systems (2013) | Liang Liu

LUND UNIVERSITY

PO Box 117

221 00 Lund

+46 46-222 00 00

VLSI Implementation of a Soft-Output Signal Detector for Multi-Mode Adaptive MIMO

Systems

Liu, Liang; Löfgren, Johan; Nilsson, Peter; Öwall, Viktor

Published in:

IEEE Transactions on Very Large Scale Integration (VLSI) Systems

DOI:

10.1109/TVLSI.2012.2231706

2013

Link to publication

Citation for published version (APA):

Liu, L., Löfgren, J., Nilsson, P., & Öwall, V. (2013). VLSI Implementation of a Soft-Output Signal Detector for

Multi-Mode Adaptive MIMO Systems.

IEEE Transactions on Very Large Scale Integration (VLSI) Systems

(12), 2262-2273. https://doi.org/10.1109/TVLSI.2012.2231706

Total number of authors:

General rights

Unless other specific re-use rights are stated the following general rights apply:

and/or other copyright owners and it is a condition of accessing publications that users recognise and abide by the

legal requirements associated with these rights.

• Users may download and print one copy of any publication from the public portal for the purpose of private study

or research.

• You may not further distribute the material or use it for any profit-making activity or commercial gain

• You may freely distribute the URL identifying the publication in the public portal

Read more about Creative commons licenses: https://creativecommons.org/licenses/

Take down policy

If you believe that this document breaches copyright please contact us providing details, and we will remove

access to the work immediately and investigate your claim.

purposes must be obtained from the IEEE by sending an email to pubs-permissions@ieee.org.

VLSI Implementation of a Soft-Output Signal

Detector for Multi-Mode Adaptive MIMO Systems

Liang Liu, Member, IEEE, Johan L

ofgren, Student Member, IEEE, Peter Nilsson, Senior Member, IEEE,

and Viktor

Owall , Member, IEEE

Abstract—This paper presents a multi-mode soft-output

multiple-input multiple-output (MIMO) signal detector that is ef-

ﬁcient in hardware cost and energy consumption. The detector is

capable of dealing with spatial-multiplexing (SM), space-division-

multiple-access (SDMA), and spatial-diversity (SD) signals of

4×4 antenna and 64-QAM modulation. Implementation-friendly

algorithms, which reuse most of the mathematical operations in

these three MIMO modes, are proposed to provide accurate soft

detection information, i.e., log-likelihood ratio (LLR), with much

reduced complexity. A uniﬁed reconﬁgurable VLSI architecture

has been developed to eliminate the implementation of multiple

detector modules. In addition, several block level technologies,

such as parallel metric update and fast bit-ﬂipping, are adopted

to enable a more efﬁcient design. To evaluate the proposed

techniques, we implemented the triple-mode MIMO detector in a

65-nm CMOS technology. The core area is 0.25 mm

with 83.7 K

gates. The maximum detecting throughput is 1 Gb/s at 167-MHz

clock frequency and 1.2-V supply, which archives the data rate

envisioned by the emerging long-term evolution advanced (LTE-

A) standard. Under frequency-selective channels, the detector

consumes 59.3 pJ, 10.5 pJ, and 169.6 pJ energy per bit detection

in SM, SD, and SDMA modes, respectively.

Index Terms—Multiple-input multiple-output (MIMO), signal

detector, soft-output, spatial-multiplexing (SM), spatial-diversity

(SD), space-division-multiple-access (SDMA), very-large scale

integration (VLSI).

I. INTRODUCTION

O meet the growing demands for better user experience,

the International Telecommunication Union (ITU) has re-

leased its requirements for next-generation wireless networks,

where much higher spectral efﬁciency, higher coverage, and

lower latencies are expected [1]. It has been a broad agreement

that enhanced multiple-input multiple-output (MIMO) tech-

nologies play an essential role in emerging wireless standards,

e.g., IEEE 802.16m (WiMAX Proﬁle 2.0) [2] and 3GPP Long

Term Evolution Advanced (3GPP LTE-A) (Release 10) [3], to

achieve or exceed the International Mobile Telecommunica-

tions Advanced (IMT-A) target.

Cellular systems experience highly dynamic channel con-

ditions, where the signal-to-noise ratio (SNR) and fading

properties vary within huge ranges. To guarantee the quality

of service (QoS) for users with a speciﬁed error rate and data

throughput, it is necessary that the system is equipped with

multiple MIMO technologies, which are dynamically adapted

to the ﬂuctuating channels. This is because single-mode

L. Liu, J. L

ofgren, P. Nilsson, and V.

Owall are with Department of Elec-

trical and Information Technology, Lund University, Lund, Sweden (email:

{Liang.Liu, Johan.Lofgren, Peter.Nilsson, Viktor.Owall}@eit.lth.se).

Digital Object Identiﬁer

MIMO schemes have shown their limitations in satisfying

such requirements. For example, the widely-used spatial mul-

tiplexing (SM) technique [4] suffers from huge performance

loss when the spatial channel becomes highly correlated [5].

Currently, extensive discussions are ongoing about the multi-

mode adaptive MIMO schemes in 3GPP-LTE and WiMAX [6].

MIMO transmission techniques to be switched include SM [7],

space-division-multiple-access (SDMA) [5], [8], and spatial

diversity (SD) [9]. For such an adaptive system, multiple

signal detectors are needed at the receiver side with each

one corresponding to the respective mode. A straightforward

implementation strategy will incur considerable silicon area

overhead and be immensely inefﬁcient since most of the mod-

ules would remain in an idle state for a large part of the time.

As a consequence, an efﬁcient implementation is expected

to integrate multiple MIMO detectors into a single module,

which can be reconﬁgured for the respective mode at run-time.

Moreover, in real-life wireless systems, signal detectors are

usually attached with channel decoders to provide robustness

against noise and fading. Therefore, a detector should be

capable of not only providing the binary estimation of each

bit but also its reliability measurement, e.g., log-likelihood

ratio (LLR) [7], to achieve further performance enhancement.

Finally, the chip area and power consumption should be low

enough to be adopted in practical systems, especially for hand-

held devices where the high performance and ﬂexibility need

to be combined with energy efﬁciency. To the best of our

knowledge, VLSI implementation of such a reconﬁgurable

multi-mode soft-output MIMO detector remains missing in

open literatures.

In an attempt to ﬁll this gap, this paper proposes a

soft-output signal detector that supports 64-QAM modulated

SM/SDMA/SD triple-mode signals for up to 4×4 MIMO

transmission. Furthermore, it achieves near maximum a pos-

teriori probability (MAP) detection performance and provides

gigabit-per-second throughput. The uniﬁcation of multi-mode

processing is mainly realized by algorithm-level exploita-

tion, where the algorithms for each mode consist of similar

mathematical operations to enable substantial hardware reuse.

First, we develop soft-output detection algorithms for SM and

SDMA modes based on an efﬁcient extension and modiﬁcation

of the hard-output ﬁxed-complexity sphere decoder (FSD)

[10]. More speciﬁcally, we introduce a symbol-level bit-

ﬂipping scheme, which generates accurate LLR values with

marginal hardware increment. Additionally, a polygon-shaped

constraint technique is adopted to facilitate the reduction of

unnecessary node extensions in the tree search procedure.

For SD signal detecting, e.g., Alamouti space-frequency block

codes (SFBC) [11], [12], we propose a low complexity MAP

algorithm owning a uniﬁed detection procedure that is inde-

pendent of antenna number. It allows for parallel detection of

the real and imaginary parts of each transmitted symbol with

the help of QR decomposition to the orthogonal real-valued

channel matrix. Taking advantage of these implementation-

oriented algorithms, a uniﬁed VLSI architecture is subse-

quently developed, capable of being reconﬁgured to support

different MIMO modes at run-time. To further improve the

implementation efﬁciency, e.g., reduce the detection latency,

we introduce a parallel metric update strategy, which processes

multiple candidate vectors simultaneously for soft-value com-

putation and a fast bit-ﬂipping scheme to select the bit-ﬂipped

symbol with simple boundary-check operations. To validate

the effectiveness of foregoing design solutions, we designed

the proposed triple-mode soft-output signal detector using

Synopsys tools with a 65-nm CMOS standard cell library.

Occupying only 0.25 mm

core area (83.7K equivalent gate

count), the detector achieves 1 Gb/s throughput in SM and SD

modes with 4×4 64-QAM conﬁguration, representing a 44%

saving to state-of-the-art in terms of hardware efﬁciency. The

throughput for detecting SDMA signal is 250 Mb/s. Working

at frequency-selective channels, e.g., the extended vehicular

A (EVA) channel speciﬁed in LTE standard [13], the detector

consumes 59.3mW power in SM mode, resulting in a 59.3

pJ/bit energy consumption. The energy needed to detect a bit

in SD and SDMA modes is 10.5 pJ and 169.6 pJ, respectively.

The remainder of this paper is organized as follows: Sec-

tion II brieﬂy introduces the system model and soft-output

MIMO signal detection. Section III describes and evaluates

the proposed detection algorithms. Section IV shows the VLSI

architecture and module circuit design. The implementation

results and performance comparison are presented in Section

V, and conclusions are drawn in Section VI.

II. BACKGROUND

A. System Model

As illustrated in Fig. 1, we consider a downlink switching

SM/SDMA/SD MIMO system with one base station (BS) and

K user equipments (UEs). Both the BS and UEs are equipped

with N transmit and receive antennas. The received N × 1

complex signal vector at the k

UE is given by

˜r

k=1

˜s

+ ˜n

, (1)

where ˜s

= [˜s

(0)

, . . . , ˜s

(L−1)

]

is the L-layer transmitted

vector for user k, in which each component is taken inde-

pendently from a set of Grey-labeled M -QAM constellation

points. Each symbol vector ˜s

is associated with a bit-level

vector b

(i.e., ˜s

= MAP(b

)), which is obtained by error

correction coding (ECC) to the original binary source. In (1),

˜n

is the vector of independent Gaussian noise samples with

mean zero and variance N

/2,

is the N × N complex

channel matrix between the BS and the k

UE, and

the N × L pre-coding matrix which is selected from a pre-

deﬁned code-book and is assumed to be known to both BS and

source

MappingEncode

Pre-code

)

Channel

estimate

Soft

detector

llr

Decode

Ante.1

Ante. N

source b

MappingEncode

Pre-code

)

Fig. 1. LTE downlink multi-mode MIMO transmission.

UE [14]. The switch between different MIMO transmissions

is realized by changing the matrix

. Throughout this paper,

we set

to be an N ×N identity matrix (I

) in SM mode.

While in SD mode,

is an Alamouti coding matrix [12].

For SDMA system,

is a unitary pre-coding matrix such

that

= 1 and

l,l6=k

= 0, where (·)

means

Hermitian transposition. Moreover, we assume point-to-point

transmission in SM and SD modes, i.e., K = 1 and L = N.

Finally, L is set to 1 in SDMA mode, because the number of

layer per UE is limited to one in LTE [14].

The complex system can be transformed to its real-valued

representation r

= H[s

, . . . , s

]

+ n

, where

= [<(˜r

k,1

), =(˜r

k,1

), . . . , <(˜r

k,N

), =(˜r

k,N

)]

= [<(˜s

k,1

), =(˜s

k,1

), . . . , <(˜s

k,L

), =(˜s

k,L

)]

= [<(˜n

k,1

), =(˜n

k,1

), . . . , <(˜n

k,N

), =(˜n

k,N

)]

(2)

and







1,1

) −=(

1,1

) . . . <(

1,N

) −=(

1,N

)

1,1

) <(

1,1

) . . . =(

1,N

) <(

1,N

)

N,1

) −=(

N,1

) . . . <(

N,N

) −=(

N,N

)

N,1

) <(

N,1

) . . . =(

N,N

) <(

N,N

)







. (3)

In (2) and (3),

H =

[

, . . . ,

] is the equivalent chan-

nel, [·]

means vector transposition, <(·) and =(·) represent

the real and imaginary parts of a complex number, respectively.

B. Soft-Output MIMO Signal Detection

Hard-output signal detectors tries to recover the original

vector s

, given r

and H. While the objective of soft-output

detector is to provide reliability information by computing the

LLRs for each bit of b

, e.g., for the l

bit, we have

L(b

k,l

| r

) = ln

P (b

k,l

= 1 | r

)

P (b

k,l

= 0 | r

)

= L

k,l

| r

)+L

k,l

(4)

In (4), L

k,l

) is the a priori probability and L

k,l

| r

) is

the extrinsic information. For simpliﬁcation, we will omit the

user index k in the following. According to [7], L

| r)

can be rewritten as

| r) = ln

b∈χ

P (r | b

) exp(1/2b

[l]

A[l]

)

b∈χ

P (r | b

) exp(1/2b

[l]

A[l]

)

, (5)

where χ

and χ

are the sets of bit-level vectors hav-

ing the l

bit equal to 1 and 0, respectively, b

[l]

de-

notes the sub-vector of b with the l

bit b

being omitted,

A[l]

is the sub-vector of the a priori information vector

= [L

), L

), . . . , L

N log

)]

omitting L

The computation of (5) is usually simpliﬁed with max-log

approximation, yielding the maximum a posteriori probability

(MAP) algorithm as

L(b

| r) ≈ min

b∈χ

|r − Hs|

− min

b∈χ

|r − Hs|

. (6)

Note that the a priori information is not considered in (6),

meaning that we do not take into account the turbo receiver

scheme where the inner detector and the outer decoder ex-

change extrinsic information iteratively [7].

From a hardware design perspective, tree-search algorithms

[15]–[21] are promising alternatives to the direct implementa-

tion of (6) due to their effectiveness in conﬁning the detection

procedure within a much smaller search space. A tree-search

algorithm formulates the detection as a 2N-depth

√

M-ary

tree search problem by rewriting the Euclidean distance as

|y − Rs|

, where R is an upper triangular matrix obtained

by H = QR, y = Q

r, and Q is a unitary matrix. Starting

from the top (2N

) layer, the calculation of the Euclidean

distance T is carried out in a recursive way as

= T

i+1

+ inc

inc

= |y

−

j=i+1

− R

= |y

− R

(7)

where T

is the partial Euclidean distance (PED) at the i

layer. The soft-output tree-search algorithm generates a list L

of candidate vectors by going through the tree and ﬁnds the

two elements of (6) within the list, i.e.,

L(b

| r) ≈ min

b∈L∩χ

|r − Hs|

− min

b∈L∩χ

|r − Hs|

(8)

In tree-search detection, L ∩ χ

1/0

can be empty. Under

such circumstance, a constant value is usually adopted to

demonstrate that b

equals to 1 or 0 with a large probability

[16]. Speciﬁed in this paper, the breadth-ﬁrst ﬁxed-complexity

sphere decoder (FSD) [10], [19] will be explored, because of

its low computational complexity, completely regular and feed-

forward-only dataﬂow, and near-optimal performance.

III. SOFT-OUTPUT DETECTION ALGORITHMS

In this section we will develop detection algorithms for SM,

SDMA, and SD MIMO modes, respectively. These proposed

algorithms feature low computational complexity and demon-

strate similar mathematical operations that can be conveniently

integrated into a single VLSI architecture. In detail, we focus

on the modiﬁcation of FSD for SM and SDMA modes. For SD

mode, we propose an extensively simpliﬁed MAP algorithm

by leveraging the orthogonality of Alamouti signals and the

matrix-decomposition operations.

A. Low-Complexity LLR Generation Based on FSD

FSD divides the real-valued search tree into two unique

parts using a parameter D. A full-search is performed in the

ﬁrst D layers, exhaustively expanding all

√

M branches per

node, while in the remaining (2N -D) layers, a single-search is

adopted, expanding only one best branch per node. It has been

analyzed in [22] that FSD achieves close-to-ML performance

if (D + 1)

≥ 2N. For example, D = 2 allows the FSD to

present an asymptotical ML performance for MIMO system

with N = 4. However, FSD is more efﬁcient in ﬁnding the ML

solution in a hard-output scenario instead of generating a list

of vectors around the ML result, resulting in poor performance

from a soft-output perspective [19]. In this section, we extend

the original FSD to provide accurate soft values while main-

taining its low computational complexity. With this purpose,

we utilize a symbol-level bit-ﬂipping scheme for performance

improvement and a polygon-shaped constraint technique to

reduce unnecessary node extensions.

1) LLR Accuracy Improvement by Modiﬁed Bit-Flipping:

Compared to the hard-output ML detection, i.e.,

= arg min

s∈2N

√

|r − Hs|

, (9)

the soft-output detection consists of two minima search pro-

cedures, as demonstrated in (6). One of them is obtained by

(9), which is then referred as T

|r − Hs

. The

other can then be formulated as

= min

b∈χ

|r − Hs

, (10)

in which χ

is the binary complement to the l

bit in the

ML bit vector b

. Basically, there are two major reasons

that FSD tends to generate poor-quality LLRs. One is the

occurrence of vacant bits in the candidate list L corresponding

to χ

(i.e., L∩χ

= ∅), existing in most tree search

algorithms. Even for those existing bits, FSD cannot ensure the

minimization of (10). This is because unlike K-Best detection,

where strict sorting is performed at every layer, FSD simply

extends all nodes at the ﬁrst D layers while only one at the

remaining. Such a tree travel scheme does not guarantee the

inclusion of best vectors (i.e., vectors with smallest Euclidean

distances) in the candidate list.

To tackle these two issues with reasonable complexity over-

head, we suggest a modiﬁed bit-ﬂipping scheme by replacing

the whole vector re-calculation [23] with a per symbol re-

calculation scheme. Its basic idea is described as follows: when

calculating the LLR L(b

i,l

) for the l

bit in the i

scalar

symbol s

, the strategy is to ﬁrst ﬁnd the locally best symbol

with the l

bit value different to b

, i.e.,

i,l

= arg min

i,l

6=b

i,l

− R

i,i

, (11)

and then compute the bit-ﬂipped LLR by

i,l

| y

) = |y

− R

i,i

− |y

− R

i,i

i,l

= inc

− inc

i,l

(12)

In (11) and (12), y

is the received symbol at the i

layer

with the interference from previously detected signals being

received signal

Fig. 2. Polygon-shaped constraint with L

2N −1

= [5, 5, 3, 3, 1, 1, 0, 0].

canceled and b

is the bit-level vector corresponding to

, which is denoted as the i

scalar symbol of the ML

vector. It should be pointed out that although the ML result

is obtained by minimizing |y − Rs|

, it does not promise

a locally best result, i.e., inc

is not necessarily smaller

than inc

i,l

. Therefore, the sign of L

i,l

| y

) should be

adjusted to positive or negative according to the corresponding

bit value of b

i,l

. It should also be mentioned that inc

(12) has already been calculated during tree search, which can

thus be reused in bit-ﬂipping for hardware saving.

So far, we may have two possible LLRs for each bit, which

are acquired by FSD tree search (L

F SD

) and the bit-ﬂipping

scheme presented above (L

). The ﬁnal result is selected

according to the magnitude of these two candidates

L(b

) =







, if |L

| ≤ |L

F SD

| or L ∩ χ

= ∅

F SD

, otherwise.

(13)

The selection criteria in (13) ﬁnds the minimum of |L

| and

F SD

|, which is efﬁcient in relieving the problem of getting

the pseudo-minimum of (10), leading to a more accurate

approximation of the MAP result.

2) Complexity Reduction with Polygon-Shaped Constraint:

According to the analysis in Sec.III-A1, the exhaustive ex-

pansion at upper layers of the FSD tree introduces a lot of

computational waste by including vectors with large Euclidean

distances in the candidate list. To reduce such unnecessary

visits to some nodes, we adopt the imbalanced-expansion

technique proposed in [24] to ﬁnd the list of vectors closer

to the ML result. This technique is brieﬂy repeated here for

convenience of presentation.

The concept of imbalanced-expansion is to approximate the

circular-shaped constraint in a sphere decoder [17] with a

polygon-shaped constraint, as illustrated in Fig. 2. The polygon

constraint is realized by introducing an extension number

limitation L

, by which only the L

best nodes are extended

from the m

father node at the i

(i > 2N −D) layer. The

detailed explanation can be found in [24], where a smaller

is set for the node with larger PED to expand more/fewer

branches from more/less reliable nodes. Moreover, considering

that the FSD performs full extension only at the ﬁrst two real-

valued tree-search layers for a 4 × 4 system, L

is applied

only for the (2N −1)

layers, i.e., i = 2N −1 (the constraint

to the top layer is accomplished by setting the corresponding

2N−1

to 0).

Compared to the radius constraint, the polygon-shaped

constraint is more efﬁcient from a hardware implementation

perspective. Firstly, with a radius constraint r

, a node dissat-

isfying the constraint will not be pruned until its PED is com-

pletely calculated and compared with r

. On the other side, the

polygon-shaped constraint with extension number limitation

early prunes less reliable paths before PED calculations by

using the well-known zigzag enumeration technology [25].

Moreover, the number of nodes to be extended is ﬁxed with

a given constraint L

2N−1

= [L

2N−1

, ··· , L

√

2N−1

], which is

not the case for radius-constrained algorithms where the node

extension number is variable depending on the channel and

the noise. Therefore, the polygon-constraint algorithm has a

very regular data ﬂow and the corresponding control circuitry

can be signiﬁcantly simpliﬁed. Finally, the proposed scheme

is convenient in tuning the complexity-performance tradeoff

by setting L

total

2N−1

to a smaller/larger number.

3) Application to SDMA Mode: The aforementioned FSD

detection is originally developed for SM signal. In the follow-

ing, the algorithm is modiﬁed to be adopted for the SDMA

mode. Detecting downlink SDMA signals is unique in that

only signals dedicated to the k

user (i.e., ˜s

) are reserved,

while the signals intended for other users (i.e., ˜s

l,l6=k

) are dis-

carded after detection [26]. To take full utilization of this fea-

ture, we add a layer-reordering step to the pre-processing stage

of FSD such that the desired signal ˜s

is moved to the top layer

of the FSD search tree where multiple candidates are extended.

The reordering is accomplished by a permutation matrix W

which moves the k

column of the channel matrix

to the

last position, i.e., W

= [w

, ...w

k−1

, w

k+1

, ..., w

, w

where w

denotes an N × 1 vector whose i

element is

one, and zeros elsewhere. Therefore, the system model can

be rewritten as

˜r

˜s

+ ˜n

˜s

+ ˜n

(14)

where ˜s

= [˜s

, ...˜s

k−1

, ˜s

k+1

, ..., ˜s

, ˜s

]

is the transmit

vector with ˜s

being moved to the last position. Taking the

reordered channel matrix

as an input, the imbalanced-

FSD tree search in Section III-A2 is then conducted to get a list

of candidate vectors, based on which the LLRs corresponding

to ˜s

are computed.

Since multiple candidates of ˜s

are extended at upper layers,

it can be expected in the ﬁnal list that ˜s

has a good diversity in

its bit values. Moreover, the single-extension at the remaining

layers attaches only the best node of ˜s

l,l6=k

to the candidates

of ˜s

. Hence, it is highly likely that the ﬁnal list contains the

actual minimum of (10) for the bits corresponding to ˜s

. In

view of the above analysis, the candidate list obtained by the

layer-reordered FSD tree search is good enough to generate

high-quality soft values for ˜s

. Thereby, we can turn off the

bit-ﬂipping operation in SDMA mode in order to reduce power

consumption.

VLSI Implementation of a Soft-Output Signal Detector for Multimode Adaptive Multiple-Input Multiple-Output Systems

Figures

Citations

Low-Computing-Load, High-Parallelism Detection Method Based on Chebyshev Iteration for Massive MIMO Systems With VLSI Architecture

Stochastic Iterative MIMO Detection System: Algorithm and Hardware Design

A 38 pJ/b Optimal Soft-MIMO Detector

Hardware Efficient Architecture for Element-Based Lattice Reduction Aided K-Best Detector for MIMO Systems

Efficient MIMO Detection Methods

References

A simple transmit diversity technique for wireless communications

V-BLAST: an architecture for realizing very high data rates over the rich-scattering wireless channel

Achieving near-capacity on a multiple-antenna channel

Space-time block coding for wireless communications: performance results

Algorithm and implementation of the K-best sphere decoding for MIMO detection

Related Papers (5)

Relaxed $K$ -Best MIMO Signal Detector Design and VLSI Implementation

Implementation of a Near-Optimal Detector for Spatial Modulation MIMO Systems

Breadth-first tree search MIMO signal detector design and VLSI implementation

Nonlinear Soft-Output Signal Detector Design and Implementation for MIMO Communication Systems with High Spectral Efficiency

A 675 Mbps, 4 $\times$ 4 64-QAM K-Best MIMO Detector in 0.13 $\mu{\rm m}$ CMOS

Frequently Asked Questions (10)

Q1. What have the authors contributed in "Vlsi implementation of a soft-output signal detector for multi-mode adaptive mimo systems" ?

Q2. What is the main task of calculating 20?

Q3. How many cycles does a TSB take to generate the candidate list?

Q4. What is the function of the soft-output tree-search algorithm?

Q5. Why is the MRC in SM based on the diagonal property of the matrix R?

Q6. Why is the performance degradation so low without bit-flipping?

Q7. What is the way to find the smallest bit-flipped symbol?

Q8. What is the BER of the early-pruned FSD algorithm?

Q9. What is the algorithm for detecting a tree?

Q10. What is the selection criteria in (13)?