GPU Implementation of a Programmable Turbo Decoder for Software Defined Radio Applications
Citations
37 citations
Cites methods from "GPU Implementation of a Programmabl..."
...Other windowing techniques include the Previous Iteration Value Initialization (PIVI) technique of [39], [41], which is also known as State-Metric Propagation (SMP) [42]....
[...]
18 citations
14 citations
Cites background from "GPU Implementation of a Programmabl..."
...General Purpose computing capable Graphics Processor Units (GPGPU) [7] often consist of a large number of parallel processing elements with reduced dynamic scheduling support....
[...]
11 citations
Cites background from "GPU Implementation of a Programmabl..."
...…dedicated hardware circuits on communication devices, the evolution of general purpose processors in terms of energy efficiency and parallelism (vector processing, number of cores,...) drives a growing interest for software ECC implementations (e.g. LDPC decoders [1]–[3], Turbo decoders [4], [5])....
[...]
9 citations
References
1,594 citations
1,511 citations
"GPU Implementation of a Programmabl..." refers background in this paper
...II showcases the speed up achieved using the GPU over an implementation done purely on the CPU for both Max-LogMAP and Full Log-MAP implementations....
[...]
...The GPU architecture differs significantly from that of a CPU [9]....
[...]
...For a Max Log-MAP turbo decoder with 5 iterations, the GPU implementation with 96 parallel sub-blocks is more than an order of magnitude faster than the CPU implementation....
[...]
...The C code run on the CPU is compiled using gcc with -O3 optimization flag and is single threaded i.e it does not utilize any parallelism on multiple CPU cores....
[...]
...More than an order of magnitude speed up over an implementation done purely on the CPU has been achieved....
[...]
1,334 citations
"GPU Implementation of a Programmabl..." refers background in this paper
...Four different kinds of device memories are presented to the programmer [10]....
[...]
672 citations
"GPU Implementation of a Programmabl..." refers background in this paper
...To obtain a high throughput on the GPU, an architecture aware [8] mapping of the algorithm is paramount....
[...]
285 citations
"GPU Implementation of a Programmabl..." refers background in this paper
...where f1 and f2 satisfy several properties detailed in [6]....
[...]