scispace - formally typeset
Proceedings ArticleDOI

Non-uniform DFT implementation for channel simulations in GPU

TLDR
A parallel scan based method to speed up channel simulation in wireless link-level OFDM network simulators without restricting the scope of the simulations is proposed, and the DFT properties in scan method are utilized to reduce register usage and hence the computation overhead of sine and cosine values.
Abstract
Channel simulation in wireless link-level OFDM network simulators involves a computationally intensive non-uniform discrete Fourier transform. In this paper, we propose a parallel scan based method to speed up this computation in GPU without restricting the scope of the simulations. We further utilize the DFT properties in scan method to reduce register usage and hence the computation overhead of sine and cosine values. This technique is compared against a method that saves computation by using uniform power delay profiles at the cost of generality, and we show that the performance is competitive. For single DFT, up to 19x speedup over a CPU implementation is observed using the scan based approach. For a simulation with 512 channels and a 1024 point DFT, the scan method gives a speedup of 141x with respect to the CPU, which compares favourably to the more restrictive uniform PDP method.

read more

Citations
More filters
Proceedings ArticleDOI

Frequency domain multipath fading channel simulator integrated with OFDM transmitter for E-UTRAN baseband traffic generator

TL;DR: Frequency domain representation of commonly accepted Tapped Delay Line (TDL) model is discussed and three transformation algorithms are evaluated to develop an efficient 3GPP compliant method to simulate multiple independent fading radio channels in software defined E-EUTRAN traffic generator.
Proceedings ArticleDOI

Performance analysis of frequency domain simulator of multi-UE E-UTRAN fading channel with intercarrier interference

TL;DR: The research shows that direct frequency domain linear ICI approximation, represented by the frequency domain model 2, offers good accuracy in term of ICI synthesis for the most practical Doppler frequencies and simultaneously requires less operations if the number of simulated UEs is large.
References
More filters
Proceedings ArticleDOI

On channel estimation in OFDM systems

TL;DR: The authors present the MMSE and LS estimators and a method for modifications compromising between complexity and performance and the symbol error rate for a 18-QAM system is presented by means of simulation results.
Book

Programming Massively Parallel Processors: A Hands-on Approach

TL;DR: Programming Massively Parallel Processors: A Hands-on Approach as discussed by the authors shows both student and professional alike the basic concepts of parallel programming and GPU architecture, and various techniques for constructing parallel programs are explored in detail.
Journal ArticleDOI

Programming Massively Parallel Processors. A Hands-on Approach

TL;DR: This comprehensive test/reference provides a foundation for the understanding and implementation of parallel programming skills which are needed to achieve breakthrough results by developing parallel applications that perform well on certain classes of Graphic Processor Units (GPUs).
Journal ArticleDOI

New trends in high performance computing

TL;DR: The automatically tuned linear algebra software (ATLAS) project is described, as well as the fundamental principles that underly it, with the present emphasis on the basic linear algebra subprograms (BLAS), a widely used, performance-critical, linear algebra kernel library.

Automated Empirical Optimizations of Software and the ATLAS Project (LAPACK Working Note 147)

TL;DR: This paper describes the ATLAS (Automatically Tuned Linear Algebra Software) project, as well as the fundamental principles that underly it, with the present emphasis on the Basic Linear Al algebra Subprograms (BLAS), a widely used, performance-critical, linear algebra kernel library.