Non-uniform DFT implementation for channel simulations in GPU

doi:10.1109/NCC.2015.7084858

Proceedings ArticleDOI

Non-uniform DFT implementation for channel simulations in GPU

- pp 1-6

TLDR

A parallel scan based method to speed up channel simulation in wireless link-level OFDM network simulators without restricting the scope of the simulations is proposed, and the DFT properties in scan method are utilized to reduce register usage and hence the computation overhead of sine and cosine values.

Abstract:

Channel simulation in wireless link-level OFDM network simulators involves a computationally intensive non-uniform discrete Fourier transform. In this paper, we propose a parallel scan based method to speed up this computation in GPU without restricting the scope of the simulations. We further utilize the DFT properties in scan method to reduce register usage and hence the computation overhead of sine and cosine values. This technique is compared against a method that saves computation by using uniform power delay profiles at the cost of generality, and we show that the performance is competitive. For single DFT, up to 19x speedup over a CPU implementation is observed using the scan based approach. For a simulation with 512 channels and a 1024 point DFT, the scan method gives a speedup of 141x with respect to the CPU, which compares favourably to the more restrictive uniform PDP method.

Non-uniform DFT implementation for channel simulations in GPU

Citations

Frequency domain multipath fading channel simulator integrated with OFDM transmitter for E-UTRAN baseband traffic generator

Performance analysis of frequency domain simulator of multi-UE E-UTRAN fading channel with intercarrier interference

References

On channel estimation in OFDM systems

Programming Massively Parallel Processors: A Hands-on Approach

Programming Massively Parallel Processors. A Hands-on Approach

New trends in high performance computing

Automated Empirical Optimizations of Software and the ATLAS Project (LAPACK Working Note 147)

Related Papers (5)

Architecture of array processors of 1-D, 2-D complex and real DFT

A fully pipelined, high speed DFT architecture

ASIC implementation of high speed fast fourier transform based on Split-radix algorithm

Low-power FPGA implementation for DA-based video processing

Design of high speed FFT algorithm For OFDM technique