scispace - formally typeset
Search or ask a question

Showing papers on "Split-radix FFT algorithm published in 1997"


Journal ArticleDOI
TL;DR: A new formulation of fast Fourier transformation kernels for radix 2, 3, 4, and 5 are presented, which have a perfect balance of multiplies and adds and give higher performance on machines that have a single multiply--add (mult--add) instruction.
Abstract: We present a new formulation of fast Fourier transformation (FFT) kernels for radix 2, 3, 4, and 5, which have a perfect balance of multiplies and adds. These kernels give higher performance on machines that have a single multiply--add (mult--add) instruction. We demonstrate the superiority of this new kernel on IBM and SGI workstations.

181 citations


Journal ArticleDOI
TL;DR: A simple procedure for designing finite-extent impulse response (FIR) discrete-time filters using the FFT algorithm is described and extension of the design method to higher dimensions is straightforward.
Abstract: The fast Fourier transform (FFT) algorithm has been used in a variety of applications in signal and image processing. In this article, a simple procedure for designing finite-extent impulse response (FIR) discrete-time filters using the FFT algorithm is described. The zero-phase (or linear phase) FIR filter design problem is formulated to alternately satisfy the frequency domain constraints on the magnitude response bounds and time domain constraints on the impulse response support. The design scheme is iterative in which each iteration requires two FFT computations. The resultant filter is an equiripple approximation to the desired frequency response. The main advantage of the FFT-based design method is its implementational simplicity and versatility. Furthermore, the way the algorithm works is intuitive and any additional constraint can be incorporated in the iterations, as long as the convexity property of the overall operations is preserved. In one-dimensional cases, the most widely used equiripple FIR filter design algorithm is the Parks-McClellan algorithm (1972). This algorithm is based on linear programming, and it is computationally efficient. However, it cannot be generalized to higher dimensions. Extension of our design method to higher dimensions is straightforward. In this case two multidimensional FFT computations are needed in each iteration.

108 citations


Proceedings ArticleDOI
19 May 1997
TL;DR: The design of a custom CMOS Fast Fourier Transform (FFT) processor for computing 256-point complex FFT, well suited for real-time spectrum analysis in instrumentation and measurement applications is described.
Abstract: This paper describes in detail the design of a custom CMOS Fast Fourier Transform (FFT) processor for computing 256-point complex FFT. The FFT is well suited for real-time spectrum analysis in instrumentation and measurement applications. The FFT butterfly processor consists of one parallel-parallel multiplier and two adders. It is capable of computing one butterfly computation every 100 ns thus it can compute 256-complex point FFT in 25.6 /spl mu/s excluding data input and output processes.

35 citations


Proceedings ArticleDOI
H. Guo1, C.S. Burrus1
21 Apr 1997
TL;DR: An algorithm that uses the discrete wavelet transform as a tool to compute the discrete Fourier transform (DFT) and the Cooley-Tukey FFT is shown to be a special case of the proposed algorithm when the wavelets in use are trivial.
Abstract: We propose an algorithm that uses the discrete wavelet transform (DWT) as a tool to compute the discrete Fourier transform (DFT). The Cooley-Tukey FFT is shown to be a special case of the proposed algorithm when the wavelets in use are trivial. If no intermediate coefficients are dropped and no approximations are made, the proposed algorithm computes the exact result, and its computational complexity is on the same order of the FFT, i.e. O(N log/sub 2/ N). The main advantage of the proposed algorithm is that the good time and frequency localization of wavelets can be exploited to approximate the Fourier transform for many classes of signals resulting in much less computation. Thus the new algorithm provides an efficient complexity vs. accuracy tradeoff. When approximations are allowed, under certain sparsity conditions, the algorithm can achieve linear complexity, i.e. O(N). The proposed algorithm also has built-in noise reduction capability.

33 citations


Journal ArticleDOI
TL;DR: An error propagation model is proposed for the in-place decimation-in-time version of the radix-2 FFT algorithm and an accurate error expression and error variance for the computation of FFT are derived.
Abstract: An error propagation model is proposed for the in-place decimation-in-time version of the radix-2 FFT algorithm. With the model, an accurate error expression and error variance for the computation of FFT are derived. This correspondence deals with fixed-point and block floating-point arithmetic. Simulation results agree closely with the theoretical predicted ones. We find that some roundoff errors at different stages correlate with each other. The density of correlations is closely associated with the round-off approach used in butterfly calculations.

28 citations


Journal ArticleDOI
TL;DR: In this article, a method for bootstrapping stationary Gaussian sequences is studied, where the FFT is applied to the original data, randomized in the frequency domain, and the inverse FFT was applied.
Abstract: A method for bootstrapping stationary Gaussian sequences is studied. The FFT is applied to the original data, randomized in the frequency domain, and the inverse FFT is applied. The result is a sequence whose second order properties are similar to those of the original sequence. Conditions under which the method is valid are given.

22 citations


Journal ArticleDOI
01 Aug 1997
TL;DR: A new pruning method for an FFT type of transform structure is proposed, whose novelty lies in the fact that it is able to complete a previously pruned transform or to progress from one level of pruning to another.
Abstract: A new pruning method for an FFT type of transform structure is proposed. Its novelty lies in the fact that, besides being able to prune the transform, it is able to complete a previously pruned transform or to progress from one level of pruning to another. The method can be directly applied to fast progressive image coding.

14 citations


Book ChapterDOI
01 Sep 1997
TL;DR: This paper proposes a fast parallel implementation of Discrete Fourier Transform (DFT) using FPGAs using zero-order interpolation, which is faster than most standard DSP designs for FFT and outperforms other FPGA-based implementations for F FT.
Abstract: In this paper we propose a fast parallel implementation of Discrete Fourier Transform (DFT) using FPGAs. Our design is based on the Arithmetic Fourier Transform (AFT) using zero-order interpolation. For a given problem of size N, AFT requires only O(N2) additions and O(N) real multiplications with constant factors. Our design employes 2p + 1 PEs (1 ≤ p ≤ N), O(N) memory and fixed 1/O with the host. It is scalable over p (1 ≤ p ≤ N) and can solve larger problems with the same hardware by increasing the memory. All the PEs have fixed architecture. Our implementation is faster than most standard DSP designs for FFT. It also outperforms other FPGA-based implementations for FFT, in terms of speed and adaptability to larger problems.

14 citations


Patent
Karl Schwarz1
30 Jun 1997
TL;DR: In this paper, each of p or 2p out of n measured projections of the length N are convolved with a modified filter kernel h (k) using Fast Fourier Transform (FFT) and IFFFT simultaneously in one step.
Abstract: In a method for image reconstruction in imaging technology for the implementation of a fast convolution with the transformation length M while allowing slight over-convolution errors, each of p or 2p out of n measured projections of the length N are convolved with a modified filter kernel h (k) using Fast Fourier Transform (FFT) and Inverse Fast Fourier Transform (IFFT) simultaneously in one step.

11 citations


Proceedings ArticleDOI
12 Apr 1997
TL;DR: The results of this work will serve as a framework for creating an object-oriented, poly-functional FFT implementation which will automatically choose the most efficient algorithm given user-specified constraints.
Abstract: A large number of fast Fourier transform (FFT) algorithms have been developed over the years. Among these, the most promising are the radix-2, radix-4, split-radix, fast Hartley transform (FHT), quick Fourier transform (QFT), and the decimation-in-time-frequency (DITF) algorithms. We present a rigorous analysis of these algorithms that includes the number of mathematical operations, computational time, memory requirements, and object code size. The results of this work will serve as a framework for creating an object-oriented, poly-functional FFT implementation which will automatically choose the most efficient algorithm given user-specified constraints.

11 citations


Journal ArticleDOI
TL;DR: In this article, a set of efficient formulas to evaluate the deflections of the vertical on the sphere using gridded data was presented, including the Vening-Meinesz formula, the topographic indirect effect on the vertical as well as the terrain corrections.
Abstract: This paper presents a set of efficient formulas to evaluate the deflections of the vertical on the sphere using gridded data. The Vening-Meinesz formula, the topographic indirect effect on the deflections of the vertical as well as the terrain corrections are expressed as both 2D and 1D convolutions on the sphere, and consequently can be evaluated by the 2D and the 1D fast Fourier transform (FFT). When compared with the results obtained from pointwise integration, the use of the 1D FFT gives identical results, and therefore these results were used as control values in this paper. The use of the spherical 2D FFT improves significantly the computational efficiency with little sacrifice of accuracy (0.6″ rms difference from the 1D FFT results). The planar 2D FFT, which is as efficient as the spherical 2D FFT, gives worse results (1.2″ rms difference from the 1D FFT results) because of the extra approximations.

Patent
25 Mar 1997
TL;DR: In this paper, a system for processing a signal made up of at least one spectral component is presented. But the system is not suitable for high-dimensional data, and it requires non-complex arithmetic operations to compute the phase and amplitude of the signal.
Abstract: A system for processing a signal made up of at least one spectral component. The system includes a sampling circuit for periodically sampling the signal to obtain a sequence of discrete data samples representing the signal; and a computational circuit for computing a fast Fourier transform (FFT) of the sequence of discrete data samples to an extent which the FFT is computed using substantially exclusively non-complex arithmetic operations, and for using a resultant portion of the FFT to obtain a phase and amplitude of the at least one spectral component making up the signal.

Journal ArticleDOI
TL;DR: A certain equivalence relation is defined on the set of bijections that lead to FFT algorithms, and its connection with isomorphism classes of group extensions is studied.
Abstract: Fast Fourier transform algorithms rely upon the choice of certain bijective mappings between the indices of the data arrays. The two basic mappings used in the literature lead to Cooley–Tukey algorithms or to prime factor algorithms. But many other bijections also lead to FFT algorithms, and a complete classification of these mappings is provided. One particular choice leads to a new FFT algorithm that generalizes the prime factor algorithm. It has the advantage of reducing the floating point operation count by reducing the number of trigonometric function evaluations.

Journal ArticleDOI
TL;DR: A polynomial-based approach is simpler and derives naturally from the fact that exact distributions for many discrete data models, including those reviewed, arise from polynomials, and facilitates a synthesis of diverse algorithms used in the field of exact inference.

Journal ArticleDOI
TL;DR: A general split-radix algorithm is presented to compute 2D discrete Fourier transforms of sequence length q'2/sup m/ by q' 2/Sup m/ where q is an odd integer to achieve savings in the number of operations.
Abstract: A general split-radix algorithm is presented to compute 2D discrete Fourier transforms of sequence length q'2/sup m/ by q'2/sup m/ where q is an odd integer. By setting different values of q, DFT's of various sequence lengths can be efficiently computed. When q=3 for example, savings in the number of operations can be achieved in comparison with that needed by other algorithms.

Journal ArticleDOI
TL;DR: Theorems that identify the symmetry in f[x, y] based on the depth of the quadtree to expedite 2-D FFT computation of coherent digital images are presented and applied in transform coding systems and lossy compression of images.
Abstract: The discrete Fourier transform (DFT) of a real sequence f[x, y] of size N/spl times/N, where N=2/sup n/, can be computed by a two-dimensional (2-D) FFT of size N/4, or smaller if f[x, y] is known to have certain symmetries. This paper presents theorems that identify the symmetry in f[x, y] based on the depth of the quadtree to expedite 2-D FFT computation of coherent digital images. In principle, it establishes that if the quadtree of f[x, y] has maximum depth k

01 Jan 1997
TL;DR: Time-localization can be achieved by first sliding a window along the signal and then taking the FT over each interval, and the Gabor transform, which uses a Gaussian window function, is often used.
Abstract: analysis has been the Fourier transform (FT) using the fast Fourier transform (FFT) algorithm. However, the FFT method assumes the signal to be stationary and is thereby insensitive to its varying features. Time-localization can be achieved by first sliding a window along the signal and then taking the FT over each interval. The Gabor transform (GT), which uses a Gaussian window function, is often ap

Journal ArticleDOI
TL;DR: In this article, the exchange-correlation potential in crystals is calculated using fast Fourier transform (FFT) instead of the standard FFT algorithm, which has much greater numerical accuracy than the standard method.

Proceedings ArticleDOI
04 May 1997
TL;DR: It will be shown that complex numbers can be approximated accurately by cyclotomic integers, and combine this idea with Chinese remaindering strategies in the cyclOTomic integers to give a O(b1+?
Abstract: Many applications of fast fourier transforms (FFT's), such as computer- tomography, geophysical signal processing, high resolution imaging radars, and prediction filters, require high precision output. The usual method of fixed point computation of FFT's of vectors of length 2l leads to an average loss of l/2 bits of precision. This phenomenon, often referred to as computational noise, causes major problems for arithmetic units with limited precision which are often used for real time applications. Several researchers have noted that calculation of FFT's with algebraic integers avoids computational noise entirely, see, e.g., [3]. We will show that complex numbers can be approximated accurately by cyclotomic integers, and combine this idea with Chinese remaindering strategies in the cyclotomic integers to, roughly, give a O(b1+? L log (L)) algorithm to compute b-bit precision FFT's of length L. The first part of the paper will describe the FFT strategy, assuming good app..

Journal ArticleDOI
E.A. Hashish1
TL;DR: In this paper, the performance of the FFT inversion method is investigated for the case of band limited scattered field data and a modification of the inversion algorithm is presented to increase the accuracy of the method in this case.
Abstract: The inverse scattering techniques have become widely used in many applications. These techniques are based on either continuous profile inversion or discrete multi-layer models. The FFT inversion method has been newly introduced for the detection of discrete multi-layer models. In this paper, the performance of the FFT inversion method is investigated for the case of band limited scattered field data. A modification of the inversion algorithm is presented to increase the accuracy of the method in this case. The proposed algorithm exhibits much better accuracy than the directly applied FFT inversion method.

Proceedings ArticleDOI
17 Nov 1997
TL;DR: This paper proposes a concurrent fault-detection scheme for FFT processors that requires no extra computations for locating a pair of faulty butterfly units and can be used for highly reliable real-time systems.
Abstract: This paper proposes a concurrent fault-detection scheme for FFT processors. In the scheme, fault detection is made by comparing the pair of outputs from butterfly units based on the FFT algorithm. The hardware overhead for the scheme is O(N) where N is the number of input data. This scheme requires no extra computations for locating a pair of faulty butterfly units, therefore, the scheme can be used for highly reliable real-time systems.

Journal ArticleDOI
TL;DR: In this article, an analytical Fourier transform method (AFT) was proposed to calculate semiclassical eigenvalues of multidimensional systems in the regular regime, which combines the best properties of those FFT procedures: accuracy, stability and simplicity.

Proceedings ArticleDOI
20 Aug 1997
TL;DR: It is shown that a fast Fourier transform method of generating time series samples of stationary, zero-mean, correlated Gaussian noise typically requires an order of magnitude less operations and memory elements than other algorithms.
Abstract: Rapid generation of time series samples of stationary, zero-mean, correlated Gaussian noise will accelerate digital communication system simulations. In this paper, we show that a fast Fourier transform (FFT) method of generating such samples typically requires an order of magnitude less operations and memory elements than other algorithms.

Journal ArticleDOI
TL;DR: These two modified FFT algorithms have been successfully applied in harmonic analysis of power system and clearly increase analysis precision, and reduce the demand to sampling frequency thus greatly enhance computing speed.