scispace - formally typeset
Open AccessProceedings ArticleDOI

A high performance split-radix FFT with constant geometry architecture

Joyce Kwong, +1 more
- pp 1537-1542
Reads0
Chats0
TLDR
A new parallel FFT architecture which combines the split-radix algorithm with a constant geometry interconnect structure which achieves 46% lower power than a parallel radix-4 design at 4.5GS/s when computing a 128-point real-valued transform.
Abstract
High performance hardware FFTs have numerous applications in instrumentation and communication systems. This paper describes a new parallel FFT architecture which combines the split-radix algorithm with a constant geometry interconnect structure. The split-radix algorithm is known to have lower multiplicative complexity than both radix-2 and radix-4 algorithms. However, it conventionally involves an "L-shaped" butterfly datapath whose irregular shape has uneven latencies and makes scheduling difficult. This work proposes a split-radix datapath that avoids the L-shape. With this, the split-radix algorithm can be mapped onto a constant geometry interconnect structure in which the wiring in each FFT stage is identical, resulting in low multiplexing overhead. Further, we exploit the lower arithmetic complexity of split-radix to lower dynamic power, by gating the multipliers during trivial multiplications. The proposed FFT achieves 46% lower power than a parallel radix-4 design at 4.5GS/s when computing a 128-point real-valued transform.

read more

Citations
More filters
Journal ArticleDOI

Low-Power Split-Radix FFT Processors Using Radix-2 Butterfly Units

TL;DR: Simulation results show that compared with the conventional radix-2 shared-memory implementations, the proposed design achieves over 20% lower power consumption when computing a 1024-point complex-valued transform.
Journal ArticleDOI

Towards design and automation of a scalable split-radix FFT processor for high throughput applications

TL;DR: This paper has analyzed in details the operational latency and mathematical consistency of the proposed split-radix 4/8 FFT algorithm which is comparable in latency with radix-8 algorithm but is considerably less complex for large-size FFTs.
Journal ArticleDOI

Datapath-regular implementation and scaled technique for N=3×2m DFTs

TL;DR: A fast algorithm for efficiently computing a DFT of size 3×2m, adopting the scaled DFT technique, which can reduce the number of operations and improve precision.
Journal ArticleDOI

A performance-efficient and datapath-regular implementation of modified split-radix fast Fourier transform

TL;DR: A simplified algorithm is proposed for the modified split radix FFT (MSRFFT) algorithm, reducing the number of real coefficients evaluated from 5/8N −2t o 15/32N − 2 and the numberof groups of decomposition from 4 to 3 and the proposed implementation method can save execution time on CPUs and general processing units (GPUs).
References
More filters
Journal ArticleDOI

Real-valued fast Fourier transform algorithms

TL;DR: A new implementation of the real-valued split-radix FFT is presented, an algorithm that uses fewer operations than any otherreal-valued power-of-2-length FFT.
Journal ArticleDOI

`Split radix' FFT algorithm

TL;DR: A new N = 2n fast Fourier transform algorithm is presented, which has fewer multiplications and additions than radix 2n, n = 1, 2, 3 algorithms, has the same number of multiplications as the Raderi-Brenner algorithm, but much fewer additions.
Journal ArticleDOI

An Adaptation of the Fast Fourier Transform for Parallel Processing

TL;DR: A modified version of the Fast Fourier Transform is developed and described and it is suggested that this form is of general use in the development and classification of various modifications and extensions of the algorithm.
Journal ArticleDOI

A 1-GS/s FFT/IFFT processor for UWB applications

TL;DR: A novel 128-point FFT/IFFT processor for ultrawideband (UWB) systems and the proposed pipelined FFT architecture, called mixed-radix multipath delay feedback (MRMDF), can provide a higher throughput rate by using the multidata-path scheme.
Journal ArticleDOI

A 2.4-GS/s FFT Processor for OFDM-Based WPAN Applications

TL;DR: A novel simplification method to reduce the hardware cost in multiplication units of the multiple-path FFT approach is proposed and a multidata scaling scheme to reduce wordlengths while preserving the signal-to-quantization-noise ratio is presented.
Related Papers (5)