Realization of area efficient QR factorization using unified division, square root, and inverse square root hardware

doi:10.1109/EIT.2009.5189620

Proceedings ArticleDOI

Realization of area efficient QR factorization using unified division, square root, and inverse square root hardware

- pp 245-250

TLDR

Unified hardware architecture for fast, area efficient QR factorization based on the Householder transformation is presented and the design and implementation of the proposed hardware is presented with synthesis results based on FPGA hardware.

Abstract:

The QR factorization is used in many signal processing and communication applications such as echo cancellation, adaptive beamforming and multiple-inputmultiple- output (MIMO) systems. However, division, square root and inverse square root operations required by the QR algorithm are very difficult to implement because they are computationally slow and area-consuming arithmetic operations. This paper presents unified hardware architecture for fast, area efficient QR factorization based on the Householder transformation. Newton-Raphson, and Goldschmidt algorithms are used for fast division, square root and inverse square root blocks. By using a unified architecture, area and power requirements for QR factorization are reduced without decreasing overall speed. The design and implementation of the proposed hardware is presented with synthesis results based on FPGA hardware.

Citations

PDF

Open Access

More filters

Proceedings ArticleDOI

FPGA implementation of fast QR decomposition based on givens rotation

Semih Aslan, +2 more

TL;DR: An improved fixed-point hardware design of QR decomposition, specifically optimized for Xilinx FPGAs is introduced, and a Givens Rotation algorithm is implemented by using a folded systolic array and the CORDIC algorithm, making this very suitable for high-speed FPGA or ASIC designs.

...read moreread less

Journal ArticleDOI

Stress Recognition from Heterogeneous Data

Bo Zhang

- 01 Jan 2016 -

Journal of Image and Graphics

TL;DR: This thesis proposes an approach based on a SVM classifier (Support Vector Machine) and shows that the reaction time can be used to estimate the level of stress of an individual in addition or not to the physiological signals.

...read moreread less

Journal ArticleDOI

Algorithm, Architecture, and Floating-Point Unit Codesign of a Matrix Factorization Accelerator

Ardavan Pedram, +2 more

- 01 Aug 2014 -

IEEE Transactions on Computers

TL;DR: This paper examines the mapping of algorithms encountered when solving dense linear systems and linear least-squares problems to a custom Linear Algebra Processor and exposes the benefits of redesigning floating point units and their surrounding data-paths to support these complicated operations.

...read moreread less

Proceedings ArticleDOI

Floating Point Architecture Extensions for Optimized Matrix Factorization

Ardavan Pedram, +2 more

TL;DR: This paper examines the mapping of algorithms encountered when solving dense linear systems and linear least-squares problems to a custom Linear Algebra Processor and exposes the benefits of redesigning floating point units and their surrounding data-paths to support these complicated operations.

...read moreread less

Journal ArticleDOI

NoC-Based FPGA Acceleration for Monte Carlo Simulations with Applications to SPECT Imaging

P. J. Kinsman, +1 more

- 01 Mar 2013 -

IEEE Transactions on Computers

TL;DR: This paper presents a compute architecture for accelerating Monte Carlo simulations based on the Network-on-Chip (NOC) paradigm for on-chip communication and demonstrates through the complete implementation of a Monte Carlo-based image reconstruction algorithm for Single-Photon Emission Computed Tomography (SPECT) imaging that this complex problem can be accelerated by two orders of magnitude.

...read moreread less

References

PDF

Open Access

More filters

Book

Digital arithmetic

Milo D. Ercegovac, +1 more

TL;DR: Digital Arithmetic, two of the field's leading experts, deliver a unified treatment of digital arithmetic, tying underlying theory to design practice in a technology-independent manner, to develop sound solutions, avoid known mistakes, and repeat successful design decisions.

...read moreread less

Journal ArticleDOI

Division algorithms and implementations

S.F. Obermann, +1 more

- 01 Aug 1997 -

IEEE Transactions on Computers

TL;DR: A taxonomy of division algorithms is presented which classifies the algorithms based upon their hardware implementations and impact on system design, finding that, for low-cost implementations where chip area must be minimized, digit recurrence algorithms are suitable.

...read moreread less

Book ChapterDOI

CHAPTER 9 – Digit-Serial Arithmetic

Milos D. Ercegovac, +1 more

TL;DR: In this article, the authors considered a mixed system in which all operands and results are serial, although it is possible to have some inputs and outputs are serial and others parallel.

...read moreread less

Journal ArticleDOI

Improving Goldschmidt division, square root, and square root reciprocal

Milos D. Ercegovac, +4 more

- 01 Jul 2000 -

IEEE Transactions on Computers

TL;DR: The aim of this paper is to accelerate division, square root, and square root reciprocal computations when the Goldschmidt method is used on a pipelined multiplier by replacing the last iteration by the addition of a correcting term that can be looked up during the early iterations.

...read moreread less

Book

Numerical Methods: Algorithms and Applications

Laurene V. Fausett

TL;DR: This chapter discusses methods for Solving Systems of Linear Equations, Ordinary Differential Equations: Higher-Order Equations and First-Order Systems, and Numerical Differentiation and Integration.

...read moreread less