Topic

QR decomposition

About: QR decomposition is a research topic. Over the lifetime, 3504 publications have been published within this topic receiving 100599 citations. The topic is also known as: QR factorization.

...read moreread less

Papers published on a yearly basis

1 / 2

Papers

PDF

Open Access

More filters

Journal Article•DOI•

A Randomized Blocked Algorithm for Efficiently Computing Rank-revealing Factorizations of Matrices

[...]

Per-Gunnar Martinsson, Sergey Voronin

27 Oct 2016-SIAM Journal on Scientific Computing

TL;DR: In this paper, a technique for computing partial rank-revealing factorizations, such as a partial QR factorization or a partial singular value decomposition, is described, which is inspired by the Gram-Schmidt algorithm and has the same asymptotic flop count.

...read moreread less

Abstract: This manuscript describes a technique for computing partial rank-revealing factorizations, such as a partial QR factorization or a partial singular value decomposition. The method takes as input a tolerance $\varepsilon$ and an $m\times n$ matrix $\boldsymbol{\mathsf{A}}$ and returns an approximate low-rank factorization of $\boldsymbol{\mathsf{A}}$ that is accurate to within precision $\varepsilon$ in the Frobenius norm (or some other easily computed norm). The rank $k$ of the computed factorization (which is an output of the algorithm) is in all examples we examined very close to the theoretically optimal $\varepsilon$-rank. The proposed method is inspired by the Gram--Schmidt algorithm and has the same $O(mnk)$ asymptotic flop count. However, the method relies on randomized sampling to avoid column pivoting, which allows it to be blocked, and hence accelerates practical computations by reducing communication. Numerical experiments demonstrate that the accuracy of the scheme is for every matrix that was...

...read moreread less

97 citations

Journal Article•DOI•

Architecture and FPGA Design of Dichotomous Coordinate Descent Algorithms

[...]

Jie Liu¹, Yuriy Zakharov¹, B. Weaver¹•Institutions (1)

University of York¹

01 Nov 2009-IEEE Transactions on Circuits and Systems I-regular Papers

TL;DR: This paper presents architectures and field-programmable gate-array designs of two variants of the DCD algorithm, known as cyclic and leading DCD algorithms, and proposes fixed-point designs that provide an accuracy performance that is very close to the performance of floating-point counterparts and require significantly lower FPGA resources than techniques based on QR decomposition.

...read moreread less

Abstract: In the areas of signal processing and communications, such as antenna-array beamforming, adaptive filtering, multiuser and multiple-input-multiple-output (MIMO) detection, channel estimation and equalization, echo and interference cancellation, and others, solving linear systems of equations often provides an optimal performance. However, this is also a very complicated operation that designers try to avoid by proposing different suboptimal techniques. The dichotomous coordinate descent (DCD) algorithm allows linear systems of equations to be solved with high computational efficiency. In this paper, we present architectures and field-programmable gate-array (FPGA) designs of two variants of the DCD algorithm, which are known as cyclic and leading DCD algorithms. For each of these techniques, we present serial designs, group-2 and group-4 designs, as well as a design with parallel update of the residual vector for the cyclic DCD algorithm. These designs have different degrees of parallelism, thus enabling a tradeoff between FPGA resources and computation time. The serial designs require the smallest FPGA resources; they are well suited for applications where many parallel solvers are required, e.g., for detection in MIMO-orthogonal-frequency-division-multiplexing communication systems. The parallelism introduced in the proposed group-2 and group-4 designs allows faster convergence to the true solution at the expense of an increase in FPGA resources. The design with parallel update of the residual vector provides the fastest convergence speed; however, if the system size is high, it may result in a significant increase in FPGA resources. The proposed fixed-point designs provide an accuracy performance that is very close to the performance of floating-point counterparts and require significantly lower FPGA resources than techniques based on QR decomposition.

...read moreread less

96 citations

Proceedings Article•DOI•

VLSI Implementation of a High-Speed Iterative Sorted MMSE QR Decomposition

[...]

P. Luethi¹, Andreas Burg¹, S. Haene¹, D. Perels¹, Norbert Felber¹, Wolfgang Fichtner¹ - Show less +2 more•Institutions (1)

ETH Zurich¹

27 May 2007

TL;DR: The architecture and results of the first VLSI implementation of an iterative sorted QR decomposition preprocessor for MIMO receivers are described, which performs MIMM channel preprocessing using Givens rotations and provides the base for an improved layered stream decoding.

...read moreread less

Abstract: The QR decomposition is an important, but often underestimated prerequisite for pseudo- or non-linear detection methods such as successive interference cancellation or sphere decoding for multiple-input multiple-output (MIMO) systems. The ability of concurrent iterative sorting during the QR decomposition introduces a moderate overall latency, but provides the base for an improved layered stream decoding. This paper describes the architecture and results of the first VLSI implementation of an iterative sorted QR decomposition preprocessor for MIMO receivers. The presented architecture performs MIMO channel preprocessing using Givens rotations in order to compute the minimum mean squared error QR decomposition

...read moreread less

95 citations

Book Chapter•DOI•

Conjugate gradient bundle adjustment

[...]

Martin Byröd¹, Kalle Åström¹•Institutions (1)

Lund University¹

05 Sep 2010

TL;DR: This work improves on the latest published approaches to bundle adjustment with conjugate gradients by making full use of the least squares nature of the problem and shows how a certain property of the preconditioned system allows us to reduce the work per iteration to roughly half of the standard CG algorithm.

...read moreread less

Abstract: Bundle adjustment for multi-view reconstruction is traditionally done using the Levenberg-Marquardt algorithm with a direct linear solver, which is computationally very expensive. An alternative to this approach is to apply the conjugate gradients algorithm in the inner loop. This is appealing since the main computational step of the CG algorithm involves only a simple matrix-vector multiplication with the Jacobian. In this work we improve on the latest published approaches to bundle adjustment with conjugate gradients by making full use of the least squares nature of the problem. We employ an easy-to-compute QR factorization based block preconditioner and show how a certain property of the preconditioned system allows us to reduce the work per iteration to roughly half of the standard CG algorithm.

...read moreread less

94 citations

Journal Article•DOI•

Efficient Implementations of the Generalized Lasso Dual Path Algorithm

[...]

Taylor Arnold, Ryan J. Tibshirani

09 Mar 2016-Journal of Computational and Graphical Statistics

TL;DR: In this paper, the generalized lasso dual path algorithm given by Tibshirani and Taylor in 2011 is considered and a generic approach that covers any penalty matrix D and any (full column rank) matrix X of predictor variables is described.

...read moreread less

Abstract: We consider efficient implementations of the generalized lasso dual path algorithm given by Tibshirani and Taylor in 2011. We first describe a generic approach that covers any penalty matrix D and any (full column rank) matrix X of predictor variables. We then describe fast implementations for the special cases of trend filtering problems, fused lasso problems, and sparse fused lasso problems, both with X = I and a general matrix X. These specialized implementations offer a considerable improvement over the generic implementation, both in terms of numerical stability and efficiency of the solution path computation. These algorithms are all available for use in the genlasso R package, which can be found in the CRAN repository.

...read moreread less

94 citations

Collapse

Network Information

Performance

Metrics

3,607

Papers

106,604

Citations

No. of papers in the topic in previous years
Year	Papers
2023	31
2022	73
2021	90
2020	132
2019	126
2018	139

QR decomposition

Papers published on a yearly basis

Papers

Trending Questions (4)

Network Information

Related Topics (5)

Performance

Metrics