Showing papers on "Prime-factor FFT algorithm published in 2017"

PDF

Open Access

Proceedings Article•DOI•

Correcting soft errors online in fast fourier transform

[...]

Xin Liang¹, Jieyang Chen¹, Dingwen Tao¹, Sihuan Li¹, Panruo Wu¹, Hongbo Li¹, Kaiming Ouyang¹, Yuanlai Liu¹, Fengguang Song², Zizhong Chen¹ - Show less +6 more•Institutions (2)

University of California¹, Indiana University – Purdue University Indianapolis²

12 Nov 2017

TL;DR: Wang et al. as mentioned in this paper presented an online ABFT scheme for FFT so that soft errors can be detected online and the corrupted computation can be terminated in a much more timely manner.

...read moreread less

Abstract: While many algorithm-based fault tolerance (ABFT) schemes have been proposed to detect soft errors offline in the fast Fourier transform (FFT) after computation finishes, none of the existing ABFT schemes detect soft errors online before the computation finishes. This paper presents an online ABFT scheme for FFT so that soft errors can be detected online and the corrupted computation can be terminated in a much more timely manner. We also extend our scheme to tolerate both arithmetic errors and memory errors, develop strategies to reduce its fault tolerance overhead and improve its numerical stability and fault coverage, and finally incorporate it into the widely used FFTW library - one of the today's fastest FFT software implementations. Experimental results demonstrate that: (1) the proposed online ABFT scheme introduces much lower overhead than the existing offline ABFT schemes; (2) it detects errors in a much more timely manner; and (3) it also has higher numerical stability and better fault coverage.

...read moreread less

33 citations

Journal Article•DOI•

A Memory-Based FFT Processor Design With Generalized Efficient Conflict-Free Address Schemes

[...]

Kaifeng Xia¹, Bin Wu¹, Tao Xiong¹, Tian-Chun Ye¹•Institutions (1)

Chinese Academy of Sciences¹

03 Mar 2017-IEEE Transactions on Very Large Scale Integration Systems

TL;DR: The design and implementation of memory-based fast Fourier transform (FFT) processors with generalized efficient, conflict-free address schemes and a method, named high-radix–small-butterfly (HRSB), to decrease the computation cycles and eliminate the complexity of the processing engine are presented.

...read moreread less

Abstract: This paper presents the design and implementation of memory-based fast Fourier transform (FFT) processors with generalized efficient, conflict-free address schemes. We unified the conflict-free address schemes of three different FFT lengths, including the single-power points, the common nonsingle-power points, and the nonsingle-power points applied with a prime factor algorithm. Though the three cases differ in terms of decomposition, they are all compatible with memory-based architecture by the way of the proposed address schemes. Moreover, the decomposition algorithm utilizes a method, named high-radix–small-butterfly (HRSB), to decrease the computation cycles and eliminate the complexity of the processing engine. In addition, an efficient index generator, a simplified multipath delay commutator engine, and a unified Winograd Fourier transform algorithm butterfly core were also designed. We designed two FFT examples in long-term evolution system to verify the availability of the address scheme, including a 2n (128–2048)-point FFT unit and a 35 different point (12–1296) DFT unit. Compared with previous works with similar address schemes, this paper supports more generalized lengths and achieves more flexible throughput.

...read moreread less

30 citations

Journal Article•DOI•

Hybrid Genetic Algorithm and Modified Iterative Fourier Transform Algorithm for Large Thinned Array Synthesis

[...]

Can Cui¹, Wen Tao Li¹, Xiu Tiao Ye¹, Xiaowei Shi¹•Institutions (1)

Xidian University¹

03 May 2017-IEEE Antennas and Wireless Propagation Letters

TL;DR: In this article, a hybrid algorithm based on the GA and modified iterative Fourier transform (MIFT) technique called HGAMIFT is proposed for large thinned array synthesis, which can achieve a better solution in terms of the peak sidelobe level when compared to the iterative FFT technique.

...read moreread less

Abstract: In this letter, a hybrid algorithm based on the genetic algorithm (GA) and modified iterative Fourier transform (MIFT) technique called HGAMIFT is proposed for large thinned array synthesis. By employing a perturbation mechanism, the MIFT can achieve a better solution in terms of the peak sidelobe level when compared to the iterative Fourier transform technique. Moreover, a control factor to determine the proportion of individuals from GA and MIFT as well as the crossover and mutation rate is introduced to help the HGAMIFT maintain the diversity of population in the early phase while avoiding stagnation in the late phase. Thus, a resulting enhanced search ability and fast convergence velocity can be obtained simultaneously. Several thinned arrays that comprise different array sizes and aperture shapes have been synthesized, which validate the superior performance of the proposed algorithm.

...read moreread less

29 citations

Journal Article•DOI•

Simulation of non-stationary wind velocity field on bridges based on Taylor series

[...]

Yongle Li¹, Koffi Togbenou¹, Huoyue Xiang¹, Ning Chen²•Institutions (2)

Southwest Jiaotong University¹, Hunan University of Science and Technology²

01 Oct 2017-Journal of Wind Engineering and Industrial Aerodynamics

TL;DR: In this paper, a spectral representation-based technique for non-stationary wind velocity field simulation is proposed by combining Cholesky decomposition and Taylor series expansion, which is mainly developed for the simulation of nonstationary WV on long-span cable supported bridges.

...read moreread less

26 citations

Journal Article•DOI•

Improvement in Computation Time of 3-D Scattering Center Extraction Using the Shooting and Bouncing Ray Technique

[...]

Dal-Jae Yun¹, Jae-In Lee¹, Kyung Taek Bae¹, Ji-Hee Yoo¹, Kyoung-Il Kwon², Noh-Hoon Myung¹ - Show less +2 more•Institutions (2)

KAIST¹, Agency for Defense Development²

25 May 2017-IEEE Transactions on Antennas and Propagation

TL;DR: A modified FFT-based scheme is proposed to improve computation time without increasing the memory requirement in the conventional 3-D scattering center extraction algorithm using the shooting and bouncing ray technique.

...read moreread less

Abstract: We present a fast 3-D scattering center extraction algorithm using the shooting and bouncing ray technique. The proposed algorithm generates a 3-D inverse synthetic aperture radar image from which a set of 3-D scattering centers is then extracted using the CLEAN algorithm. In the conventional extraction algorithm, computation time is improved using the fast Fourier transform (FFT)-based scheme. However, the memory requirement is greatly increased because of the high oversampling required to mitigate the interpolation error. Evaluation of memory-time tradeoffs indicates that the conventional algorithm is still too time consuming given the constraints of practical memory. Thus, a modified FFT-based scheme is proposed to improve computation time without increasing the memory requirement. Through modifying the ray spread function, the distortion from the interpolation error is mitigated without the use of high oversampling. Implementation based on the proposed FFT-based scheme accelerates the image formation process significantly. Numerical simulations of realistic targets are presented to demonstrate the performance of the proposed algorithm.

...read moreread less

21 citations

Journal Article•DOI•

FFT formulations of adaptive Fourier decomposition

[...]

You Gao¹, Min Ku², Tao Qian¹, Jianzhong Wang³•Institutions (3)

University of Macau¹, University of Aveiro², Sam Houston State University³

01 Nov 2017-Journal of Computational and Applied Mathematics

TL;DR: The discretization of the 1-D AFD integration via with discrete Fourier transform (DFT), incorporating fast Fouriertransform (FFT) is explored, showing that the new algorithm, called FFT-AFD, reduces the computational complexity from O(MN2) to O( MNlogN), the latter being the same as FFT.

...read moreread less

20 citations

Journal Article•DOI•

Guaranteed-Stable Sliding DFT Algorithm With Minimal Computational Requirements

[...]

Chun-Su Park¹•Institutions (1)

Sungkyunkwan University¹

15 Oct 2017-IEEE Transactions on Signal Processing

TL;DR: The proposed oSDFT algorithm directly computes the DFT bins of the shifted window by simply adding (or subtracting) the bins of a previous window and an updating vector and it is shown that the updating vector can be efficiently computed with a low complexity in the sliding transform scenario.

...read moreread less

Abstract: The discrete Fourier transform (DFT) is the most widely used technique for determining the frequency spectra of digital signals. However, in the sliding transform scenario where the transform window is shifted one sample at a time and the transform process is repeated, the use of DFT becomes difficult due to its heavy computational burden. This paper proposes an optimal sliding DFT (oSDFT) algorithm that achieves both the lowest computational requirement and the highest computational accuracy among existing sliding DFT algorithms. The proposed oSDFT algorithm directly computes the DFT bins of the shifted window by simply adding (or subtracting) the bins of a previous window and an updating vector. We show that the updating vector can be efficiently computed with a low complexity in the sliding transform scenario. Our simulations demonstrate that the proposed algorithm outperforms the existing sliding DFT algorithms in terms of computational accuracy and processing time.

...read moreread less

19 citations

Proceedings Article•DOI•

FFT and sparse FFT techniques and applications

[...]

Badri Narayan Mohapatra¹, Rashmita Kumari Mohapatra•Institutions (1)

Centurion University of Technology and Management¹

01 Feb 2017

TL;DR: Different impact of fft and sparse fft in the areas of medical imaging, wireless communication, and many applications like digital signal processing, partial differential equation solvers, communications, image processing, all most all Fourier coefficients are very small.

...read moreread less

Abstract: Currently, the FFT is used in different areas, starting from identification of frequency on mechanical vibration to image enhancement. Real-time computation by interpret the acquired data can be easily possible by Fast Fourier Transform (FFT). The best with an efficient algorithm is FFT so it is the foundation for analyzing, monitoring, and controlling various systems. Many applications like digital signal processing, partial differential equation solvers, communications, image processing, all most all Fourier coefficients are very small and known as sparse. In this paper we present different impact of fft and sparse fft in the areas of medical imaging, wireless communication.

...read moreread less

15 citations

Journal Article•DOI•

Elementary Number Theory and Rader's FFT

[...]

Shlomo Engelberg

07 Aug 2017-Siam Review

TL;DR: Rader's FFT is derived, Rader's zero-padding technique is described, and the performance of the unpadded and the zero-padded approaches is examined.

...read moreread less

Abstract: This note provides a self-contained introduction to Rader's fast Fourier transform (FFT). We start by explaining the need for an additional type of FFT. The properties of the multiplicative group o...

...read moreread less

15 citations

Journal Article•DOI•

Area-Time Efficient Architecture of FFT-Based Montgomery Multiplication

[...]

Wangchen Dai¹, Donald Donglong Chen¹, Ray C. C. Cheung¹, Çetin Kaya Koç•Institutions (1)

City University of Hong Kong¹

01 Mar 2017-IEEE Transactions on Computers

TL;DR: This paper integrates the fast Fourier transform (FFT) method into the McLaughlin’s framework, and presents an improved FFT-based Montgomery modular multiplication (MMM) algorithm achieving high area-time efficiency.

...read moreread less

Abstract: The modular multiplication operation is the most time-consuming operation for number-theoretic cryptographic algorithms involving large integers, such as RSA and Diffie-Hellman. Implementations reveal that more than 75 percent of the time is spent in the modular multiplication function within the RSA for more than 1,024-bit moduli. There are fast multiplier architectures to minimize the delay and increase the throughput using parallelism and pipelining. However such designs are large in terms of area and low in efficiency. In this paper, we integrate the fast Fourier transform (FFT) method into the McLaughlin’s framework, and present an improved FFT-based Montgomery modular multiplication (MMM) algorithm achieving high area-time efficiency. Compared to the previous FFT-based designs, we inhibit the zero-padding operation by computing the modular multiplication steps directly using cyclic and nega-cyclic convolutions. Thus, we reduce the convolution length by half. Furthermore, supported by the number-theoretic weighted transform, the FFT algorithm is used to provide fast convolution computation. We also introduce a general method for efficient parameter selection for the proposed algorithm. Architectures with single and double butterfly structures are designed obtaining low area-latency solutions, which we implemented on Xilinx Virtex-6 FPGAs. The results show that our work offers a better area-latency efficiency compared to the state-of-the-art FFT-based MMM architectures from and above 1,024-bit operand sizes. We have obtained area-latency efficiency improvements up to 50.9 percent for 1,024-bit, 41.9 percent for 2,048-bit, 37.8 percent for 4,096-bit and 103.2 percent for 7,680-bit operands. Furthermore, the operating latency is also outperformed with high clock frequency for length-64 transform and above.

...read moreread less

14 citations

Proceedings Article•DOI•

A fast and accurate chirp rate estimation algorithm based on the fractional Fourier transform

[...]

Ahmet Serbes, Omair Aldimashki

01 Aug 2017

TL;DR: In this work, a fast and accurate chirp-rate estimation algorithm is presented and it is shown that utilization of the golden section search algorithm to find the maximum magnitude of the fractional Fourier transform domains not only accelerates the process, but also increases the accuracy in a noisy environment.

...read moreread less

Abstract: In this work, a fast and accurate chirp-rate estimation algorithm is presented. The algorithm is based on the fractional Fourier transform. It is shown that utilization of the golden section search algorithm to find the maximum magnitude of the fractional Fourier transform domains not only accelerates the process, but also increases the accuracy in a noisy environment. Simulation results validate the proposed algorithm and show that the accuracy of parameter estimation nearly achieves the Cramer-Rao lower bound for SNR values as low as −7dB.

...read moreread less

Journal Article•DOI•

Hybrid Watermarking Algorithm using Finite Radon and Fractional Fourier Transform

[...]

J. B. Sharmaa¹, K.K. Sharma², Sunil Dutt Purohit¹, Abdon Atangana³•Institutions (3)

Rajasthan Technical University¹, Malaviya National Institute of Technology, Jaipur², University of the Free State³

01 Jan 2017-Fundamenta Informaticae

TL;DR: A watermarking scheme based on finite radon transform (FRAT), fractional Fourier Transform (FRFT) and singular value decomposition is proposed, which provides additional degree of freedom in security, robustness, payload capacity and visual transparence.

...read moreread less

Abstract: Watermarking is proposed as solution to authentication, copyright protection and security requirements of multimedia objects (speech, image and video). In this paper a watermarking scheme based on finite radon transform (FRAT), fractional Fourier Transform (FRFT) and singular value decomposition is proposed. In the proposed scheme, image to be watermarked is first transformed by finite radon transform, the radon transformed image is further transformed by FRFT, and singular values of FRFT transformed image are modified to embed the watermark. Inverse transformation is applied to obtain watermarked image. Simulations are performed under various test conditions with different FRFT transform angles for improved robustness and visual transparence of watermarked image. Results of the proposed scheme are better in comparison to the existing schemes for most of the attacks. Proposed scheme provide additional degree of freedom in security, robustness, payload capacity and visual transparence. Proposed scheme can also be used to communicate or store the watermarked image as erasure code, to reduce communication errors over a network, due to the use of FRAT.

...read moreread less

Posted Content•

Coded Fourier Transform

[...]

Qian Yu¹, Mohammad Ali Maddah-Ali², A. Salman Avestimehr¹•Institutions (2)

University of Southern California¹, Bell Labs²

17 Oct 2017-arXiv: Distributed, Parallel, and Cluster Computing

TL;DR: In this article, the authors consider the problem of computing the Fourier transform of high-dimensional vectors, distributedly over a cluster of machines consisting of a master node and multiple worker nodes, where the worker nodes can only store and process a fraction of the inputs.

...read moreread less

Abstract: We consider the problem of computing the Fourier transform of high-dimensional vectors, distributedly over a cluster of machines consisting of a master node and multiple worker nodes, where the worker nodes can only store and process a fraction of the inputs. We show that by exploiting the algebraic structure of the Fourier transform operation and leveraging concepts from coding theory, one can efficiently deal with the straggler effects. In particular, we propose a computation strategy, named as coded FFT, which achieves the optimal recovery threshold, defined as the minimum number of workers that the master node needs to wait for in order to compute the output. This is the first code that achieves the optimum robustness in terms of tolerating stragglers or failures for computing Fourier transforms. Furthermore, the reconstruction process for coded FFT can be mapped to MDS decoding, which can be solved efficiently. Moreover, we extend coded FFT to settings including computing general $n$-dimensional Fourier transforms, and provide the optimal computing strategy for those settings.

...read moreread less

Journal Article•DOI•

A Continuous-Flow Memory-Based Architecture for Real-Valued FFT

[...]

Xiu-Bin Mao¹, Zhen-Guo Ma¹, Feng Yu¹, Qian-Jian Xing¹•Institutions (1)

Zhejiang University¹

16 Mar 2017-IEEE Transactions on Circuits and Systems Ii-express Briefs

TL;DR: The proposed architecture is based on a modified radix-2 algorithm, which removes redundant operations to reduce resource usage and a new data-flow graph and address mapping scheme are proposed that satisfy the requirement of continuous-flow operation and minimize the memory usage.

...read moreread less

Abstract: This brief proposes a continuous-flow memory-based architecture for fast Fourier transform (FFT) computation for real-valued signals. The proposed architecture is based on a modified radix-2 algorithm, which removes redundant operations to reduce resource usage. A new data-flow graph and address mapping scheme are proposed that satisfy the requirement of continuous-flow operation and minimize the memory usage. The proposed processing element takes advantage of pipelined FFT architectures to avoid bank conflicts in each stage. Compared with prior works, the proposed design has the advantage of supporting continuous-flow operation and normal-order output while minimizing the resource usage.

...read moreread less

Journal Article•DOI•

The Sliding Windowed Infinite Fourier Transform [Tips & Tricks]

[...]

Logan L. Grado, Matthew D. Johnson, Theoden I. Netoff

01 Sep 2017-IEEE Signal Processing Magazine

TL;DR: For real-time applications that require recalculating the DFT at each sample or over only a subset of the N center frequencies, the FFT is far from optimal.

...read moreread less

Abstract: The discrete Fourier transform (DFT) is the standard tool for spectral analysis in digital signal processing, typically computed using the fast Fourier transform (FFT). However, for real-time applications that require recalculating the DFT at each sample or over only a subset of the N center frequencies of the DFT, the FFT is far from optimal.

...read moreread less

Proceedings Article•DOI•

An area efficient high speed optimized FFT algorithm

[...]

Betsy Rose Manuel¹, E. Konguvel¹, M. Kannan¹•Institutions (1)

Madras Institute of Technology¹

01 Mar 2017

TL;DR: The design of low power Radix-8 DIT FFT is presented, which aims at reducing the number of multipliers that are used to compute the FFT by swapping the input terms and reordering them to reduce the power consumption.

...read moreread less

Abstract: In recent years the Fast Fourier Transform is widely used in a number of applications as it is considered to be an efficient algorithm to compute the Discrete Fourier Transform. The process of computing the FFT for large sequence real time data becomes complex and tedious. Hence it is necessary to design a system that can perform the FFT computation of large sequence data with reduced power consumption. This paper presents the design of low power Radix-8 DIT FFT. The proposed design aims at reducing the number of multipliers that are used to compute the FFT. This is achieved by swapping the input terms and reordering them. This leads to a reduction in the number of multipliers used to perform the computation and thereby causing a reduction in the power consumption. This method is highly advantageous when the input signals are lengthy since the number of multipliers used is large in number consuming very high power. In order to optimize the FFT architecture the number of multipliers is reduced thereby causing a significant reduction in power. The prototype for Radix-2 (8 point) and Radix-4 (16 point) optimized FFT is designed, implemented and simulated using Altera ModelSim DE2 EP2C35F672C6 FPGA device. The proposed Radix-2 (8 point) and Radix-4 (16 point) optimized FFT operates at a speed of 10.41 Gbps and 21.23 Gbps respectively.

...read moreread less

Journal Article•DOI•

Optimized Spectrum Permutation for the Multidimensional Sparse FFT

[...]

Andre Rauh¹, Gonzalo R. Arce¹•Institutions (1)

University of Delaware¹

01 Jan 2017-IEEE Transactions on Signal Processing

TL;DR: It is shown that permutation parameters are of key importance and should not be chosen randomly but instead can be optimized and a connection is made between the sparse Fourier transform algorithm and lattice theory, thus establishing a rigorous understanding of the effect of the permutations on the algorithm performance.

...read moreread less

Abstract: A multidimensional sparse fast Fourier transform algorithm is introduced via generalizations of key concepts used in the one-dimensional (1-D) sparse Fourier transform algorithm. It is shown that permutation parameters are of key importance and should not be chosen randomly but instead can be optimized. A connection is made between the sparse Fourier transform algorithm and lattice theory, thus establishing a rigorous understanding of the effect of the permutations on the algorithm performance. Lattice theory is then used to optimize the set of parameters to achieve a more robust and better performing algorithm. Other algorithms using pseudorandom spectrum permutation can also benefit from the methods developed in this paper. The contributions address the case of the exact $k$ -sparse Fourier transform but the underlying concepts can be applied to the general case of finding a $k$ -sparse approximation of the Fourier transform of an arbitrary signal. Simulations illustrate the efficiency and accuracy of the proposed algorithm. The optimizations of the parameters and the improved performance are shown in simulations for the 2-D case where worst case and average case peak signal-to-noise ratio (PSNR) improves by several decibels.

...read moreread less

Journal Article•DOI•

An acceleration of FFT-based algorithms for the match-count problem

[...]

Kensuke Baba¹•Institutions (1)

Fujitsu¹

01 Sep 2017-Information Processing Letters

TL;DR: This paper provides a method to reduce the number of computations of the FFT required in the F FT-based algorithm, which is applicable to other algorithms based on the convolution theorem, including algorithms for the weighted version of the match-count problem.

...read moreread less

Proceedings Article•DOI•

Fast interpolation of bandlimited functions

[...]

Samuel F. Potter¹, Nail A. Gumerov¹, Ramani Duraiswami¹•Institutions (1)

University of Maryland, College Park¹

01 Mar 2017

TL;DR: This work extends a recent method for computing a periodized fast multipole method with an adaptive algorithm that reduces the required number of multipole-to-local and local- to-local translations by an order of magnitude, and results in an algorithm that is competitive with other nonuniform fast Fourier transforms.

...read moreread less

Abstract: The nonuniform fast Fourier transform comprises a set of algorithms which approximately interpolate the usual discrete Fourier transform in the time and/or frequency domain. A nonuniform fast Fourier transform based on the fast multipole method was previously developed but passed over in favor of other approaches [1, 2]. This work extends a recent method for computing a periodized fast multipole method [3] with an adaptive algorithm that reduces the required number of multipole-to-local and local-to-local translations by an order of magnitude. This combination improves the speed and accuracy of the original algorithm, and results in an algorithm that is competitive with other nonuniform fast Fourier transforms. Numerical experiments are carried out comparing our implementation with others, demonstrating its viability.

...read moreread less

Proceedings Article•DOI•

The Truncated Fourier Transform for Mixed Radices

[...]

Robin Larrieu¹•Institutions (1)

École Polytechnique¹

23 Jul 2017

TL;DR: This paper presents a generalization of the Truncated Fourier Transform (TFT) for arbitrary orders that allows to benefit from the advantages of the TFT in the general case.

...read moreread less

Abstract: The standard version of the Fast Fourier Transform (FFT) is applied to problems of size n=2k. For this reason, FFT-based evaluation interpolation schemes often reduce a problem of size l to a problem of size n , where n is the smallest power of two with l ≤ n. However, this method presents "jumps" in the complexity at powers of two; and on the other hand, n-l values are computed that are actually unnecessary for the interpolation. To mitigate this problem, a truncated variant of the FFT was designed to avoid the computation of these unnecessary values. In the initial formulation [10], it is assumed that n is a power of two, but some use cases (for example in finite fields) may require more general values of n. This paper presents a generalization of the Truncated Fourier Transform (TFT) for arbitrary orders. This allows to benefit from the advantages of the TFT in the general case.

...read moreread less

Journal Article•DOI•

Fast Fourier transform benchmark on X86 Xeon system for multimedia data processing

[...]

Young Soo Park¹, Koo-Rack Park¹, Jin-Mook Kim², Hwa-Young Jeong³•Institutions (3)

Kongju National University¹, Sun Moon University², Kyung Hee University³

01 Feb 2017-Multimedia Tools and Applications

TL;DR: I benchmarking the well-known Fast Fourier Transforms Library at X86 Xeon E5 2690 v3 system, and measuring the performance over a range of a transform size.

...read moreread less

Abstract: I benchmarking the well-known Fast Fourier Transforms Library at X86 Xeon E5 2690 v3 system. Fourier transform image processing is an important tool that is used to decompose the image into sine and cosine components. If the input image represented by the equation in the spatial domain, output from the Fourier transform represents the image in the fourier or the frequency domain. Each point represents a particular frequency included in the spatial domain image in the Fourier domain image. Fourier transform is used widely for image analysis, image filtering, image compression and image reconstruction as a wide variety of applications. Fourier transform plays a important role in signal processing, image processing and speech recognition. It has been used in a wide range of sectors. For example, this is often a signal processing, is used in digital signal processing applications, such as voice recognition, image processing. The Discrete Fourier transform is a specific kind of Fourier transform. It maps the sequence over time to sequence over frequencies. If it implemented as a discrete Fourier transform, the time complexity is O (N2). It's actually not a better way to use. Alternatively, the Fast Fourier Transform is possible to easily perform a Discrete Fourier Transform of parallelism with only O (n log n) algorithm. Fast Fourier Transform is widely used in a variety of scientific computing program. If you are using the correct library can improve the performance of the program, without any additional effort. I have a well-known fast Fourier transform library was going to perform a benchmarking on X86 based Intel Xeon E5 2690 systems. In the machine's current Intel Xeon X86 Linux system. I have installed Intel IPP library, FFTW3 Library (West FFT), Kiss -FFT library and the numutils library on Intel X86 Xeon E5 based systems. The benchmark performed at C, and measuring the performance over a range of a transform size. It benchmarks both real and complex transforms in one dimension.

...read moreread less

Journal Article•DOI•

Sparse fast Clifford Fourier transform

[...]

Rui Wang¹, Yi-xuan Zhou¹, Yan-liang Jin¹, Wenming Cao², Wenming Cao³ - Show less +1 more•Institutions (3)

Shanghai University¹, University of Missouri², Shenzhen University³

31 Oct 2017-Journal of Zhejiang University Science C

TL;DR: This paper presents a new algorithm called sparse fast CFT (SFCFT), which can greatly improve the computing performance in scalar and vector fields and can effectively improve the performance of multivector signal processing.

...read moreread less

Abstract: The Clifford Fourier transform (CFT) can be applied to both vector and scalar fields However, due to problems with big data, CFT is not efficient, because the algorithm is calculated in each semaphore The sparse fast Fourier transform (sFFT) theory deals with the big data problem by using input data selectively This has inspired us to create a new algorithm called sparse fast CFT (SFCFT), which can greatly improve the computing performance in scalar and vector fields The experiments are implemented using the scalar field and grayscale and color images, and the results are compared with those using FFT, CFT, and sFFT The results demonstrate that SFCFT can effectively improve the performance of multivector signal processing

...read moreread less

Journal Article•DOI•

Optimal time-distributed fast Fourier transform: Application to online iterative learning control—experimental high-speed nanopositioning example ☆

[...]

Jiangbo Liu¹, Bo Yan², Qingze Zou¹•Institutions (2)

Rutgers University¹, University of Electronic Science and Technology of China²

01 Feb 2017-Mechatronics

TL;DR: The proposed OTD-FFT technique, the FFT computation of an online sampled data sequence is optimally distributed among all the sampling periods without increasing the total computational complexity, arriving at the minimal per-sampling-period computational complexity.

...read moreread less

Journal Article•DOI•

FFT-Based Method With Near-Matrix Compression

[...]

Wei-Bin Kong, Hou-Xing Zhou¹, Kai-Lai Zheng², Xing Mu¹, Wei Hong¹ - Show less +1 more•Institutions (2)

Southeast University¹, Nanjing University of Posts and Telecommunications²

01 Nov 2017-IEEE Transactions on Antennas and Propagation

TL;DR: In this paper, a novel grouping scheme of the basis functions within the framework of the fast Fourier transform (FFT)-based method is proposed for creating a block-sparse structure for the near-matrix, and then FFT-based method with nearmatrix compression is established for the efficient analysis of multiscale problems.

...read moreread less

Abstract: In this paper, a novel grouping scheme of the basis functions within the framework of the fast Fourier transform (FFT)-based method is proposed for creating a block-sparse structure for the near-matrix, and then FFT-based method with near-matrix compression is established for the efficient analysis of multiscale problems. For a multiscale problem, if an FFT-based method is required to maintain both higher efficiency and higher accuracy at calculating the far interactions, then the near-matrix will be inevitably very large. Compared with the traditional FFT-based method, the proposed method can significantly reduce the near-matrix memory requirement without increasing the time spent. Several numerical examples are provided to demonstrate the correctness and the efficiency of the proposed method.

...read moreread less

Journal Article•DOI•

Parallel Nonuniform Discrete Fourier Transform (P-NDFT) Over a Random Wireless Sensor Network

[...]

Xi Xu¹, Rashid Ansari¹, Ashfaq Khokhar²•Institutions (2)

University of Illinois at Chicago¹, Iowa State University²

01 Dec 2017-IEEE Transactions on Parallel and Distributed Systems

TL;DR: A distributed hybrid structure consisting of local Non-equispaced Discrete Fourier Transform (NDFT) and global FFT computations is designed and implemented on the SIDnet-SWANS platform, and the tradeoffs between communication cost, execution time, and energy consumption are studied.

...read moreread less

Abstract: Reduced execution time and increased power efficiency are important objectives in the distributed execution of collaborative signal processing tasks over wireless sensor networks (WSNs) Meanwhile, Fourier transforms are among the most widely used frequency analysis tools in WSNs for studying the behavior of sensed phenomena Several energy-efficient in-network Fourier transform computation algorithms have been proposed for WSNs Most of these works assume that the sensors are equally spaced over a one-dimensional (1D) region However, in practice, the sensors are usually randomly distributed over a two-dimensional (2D) plane Consequently, the conventional 2D Fast Fourier Transform (FFT) designed for data sampled on a uniform grid is not applicable in such environments We address this problem by designing a distributed hybrid structure consisting of local Non-equispaced Discrete Fourier Transform (NDFT) and global FFT computations First, the NDFT method is applied within suitably selected clusters to obtain the initial uniform Fourier coefficients within allowable estimation error bounds We investigate both classical linear and generalized interpolation methods for computing the NDFT coefficients within each cluster Second, a separable 2D FFT is applied over all clusters using our proposed energy-efficient 1D FFT computation method, which reduces communication costs by employing a novel binary representation mapping strategy for data exchanges between sensors The proposed techniques are implemented on the SIDnet-SWANS platform, and the tradeoffs between communication cost, execution time, and energy consumption are studied

...read moreread less

Proceedings Article•DOI•

Effectiveness of partitioning strategies of Fast Fourier Transform in GPU implementations

[...]

Kamil Wieloch¹, Kamil Stokfiszewski¹, Mykhaylo Yatsymirskyy¹•Institutions (1)

Lodz University of Technology¹

01 Sep 2017

TL;DR: The conducted experiments show that the partitioning of the Radix-2 FFT computational scheme into 4-point and 8-point subtransform blocks in which calculations are performed in a sequential, single instruction multiple thread (SIMT) manner results in significantly faster execution of the FFT calculations on the selected GPU architectures than the standard parallel 2-point butterfly base operation approach.

...read moreread less

Abstract: In this paper authors present the results of the effectiveness comparison between the variants of the Radix-2 Deci-mation in Time (DIT) Fast Fourier Transform (FFT) algorithm's implementations on graphics processing units (GPUs) which differ in the way the calculations are distributed among GPUs computational resources. The conducted experiments show that the partitioning of the FFT computational scheme into 4-point and 8-point subtransform blocks in which calculations are performed in a sequential, single instruction multiple thread (SIMT) manner results in significantly faster execution of the FFT calculations on the selected GPU architectures than the standard parallel 2-point butterfly base operation approach. Moreover the proposed partitioning scheme can be extended to arbitrary subtransform block size for a given N-point Radix-2 FFT implementation dedicated to particular GPU architectures.

...read moreread less

Journal Article•DOI•

An efficient pipelined architecture for real-valued Fast Fourier Transform

[...]

M. Aravind Kumar, K. Manjunatha Chari¹•Institutions (1)

Gandhi Institute of Technology and Management¹

03 Apr 2017-International Journal of Electronics

TL;DR: This paper has adopted the symmetric property and designed an efficient pipelined architecture for 16-point DIF FFT and modified the complex multiplier with reduced internal real multipliers which are in turn replaced by an modified canonic signed digit multiplier (CSDM) with resource-sharing technique.

...read moreread less

Abstract: Real-valued Fast Fourier Transform (FFT) plays an important role in today’s digital world because of the fact that most of the signals contain real values. The FFT computation of real signals using conventional techniques requires more hardware space with high power consumption, which is the most important task for a researcher while designing VLSI architectures. This can be eradicated by clearly analysing the symmetric property of the real-valued signals. In this paper, we have adopted the symmetric property and designed an efficient pipelined architecture for 16-point DIF FFT. The pipeline scheme reduce the processing time at the cost of some registers and in order to contribute efficiently for power reduction we have modified the complex multiplier with reduced internal real multipliers which are in turn replaced by an modified canonic signed digit multiplier (CSDM) with resource-sharing technique. The complete module is synthesised and simulated using Xilinx ISE 14.1 with the target device is ...

...read moreread less

Journal Article•DOI•

Design of a recursive single-bin DFT algorithm for sparse spectrum analysis

[...]

Wei Wang, Ying Tang, Xiongxing Zhang, Wang Kening

20 Apr 2017-Cluster Computing

TL;DR: The RSB-DFT algorithm has the potential to become an alternative and efficient tool for sparse spectrum analysis and is demonstrated as the highest compared with the DFT, FFT, Goertzel algorithm.

...read moreread less

Abstract: Discrete Fourier transform (DFT) is the basic means of spectrum analysis in the field of digital signal processing, and the fast Fourier transform (FFT) has become the most popular algorithm which decreases the computational complexity from quadratical to linearithmic. However, engineers are often challenged to detect a single or just a few of the frequency components. For this kind of sparse spectrum analysis, the FFT no longer has advantage because it always computes all the frequency components. This paper proposes a recursive single-bin DFT (RSB-DFT) algorithm to compute one specific frequency spectrum, whose theoretical derivation is elaborated and implementation steps are given as a flow diagram. A 16-point RSB-DFT calculation example is also given to exhibit computation process of the algorithm. An application example for bioimpedance spectroscopy (BIS) measurement demonstrates that the proposed RSB-DFT algorithm can compute specific single spectral lines accurately. The computation efficiency of the proposed RSB-DFT algorithm is demonstrated as the highest compared with the DFT, FFT, Goertzel algorithm, which means that the RSB-DFT algorithm has the potential to become an alternative and efficient tool for sparse spectrum analysis.

...read moreread less

Journal Article•DOI•

High-throughput Low-complexity Mixed-radix FFT Processor using a Dual-path Shared Complex Constant Multiplier

[...]

Tram Thi Bao Nguyen, Hanho Lee

28 Feb 2017-Journal of Semiconductor Technology and Science

TL;DR: The synthesis results demonstrate that the proposed FFT processor can lead to a 16% reduction in hardware complexity and higher throughput compared to conventional architectures.

...read moreread less

Abstract: This paper presents a high-throughput lowcomplexity 512-point eight-parallel mixed-radix multipath delay feedback (MDF) fast Fourier transform (FFT) processor architecture for orthogonal frequency division multiplexing (OFDM) applications. To decrease the number of twiddle factor (TF) multiplications, a mixed-radix 2 4 /2 3 FFT algorithm is adopted. Moreover, a dual-path shared canonical signed digit (CSD) complex constant multiplier using a multi-layer scheme is proposed for reducing the hardware complexity of the TF multiplication. The proposed FFT processor is implemented using TSMC 90-nm CMOS technology. The synthesis results demonstrate that the proposed FFT processor can lead to a 16% reduction in hardware complexity and higher throughput compared to conventional architectures.

...read moreread less

Proceedings Article•DOI•

Low-complexity twiddle factor generator for FFT processors

[...]

Chu Yu¹, Kuang-Hsiao Lee¹, Chien-Feng Kuo¹•Institutions (1)

National Ilan University¹

01 Jan 2017

TL;DR: A low-complexity twiddle factor generator suitable for radix-8/4/2 fast Fourier transformation (FFT) and approximately costs constant chip area for an arbitrary point of FFT processor, almost independent of the FFT processing size.

...read moreread less

Abstract: This paper presents a low-complexity twiddle factor (TF) generator suitable for radix-8/4/2 fast Fourier transformation (FFT). The proposed TF generator employs a full complex multiplier only and simple logic units to produce all the twiddle factors required for use in FFT computation. This generator approximately costs constant chip area for an arbitrary point of FFT processor, almost independent of the FFT processing size.

...read moreread less