scispace - formally typeset
Search or ask a question
Topic

QR decomposition

About: QR decomposition is a research topic. Over the lifetime, 3504 publications have been published within this topic receiving 100599 citations. The topic is also known as: QR factorization.


Papers
More filters
Journal ArticleDOI
TL;DR: Numerical experiments show that the new methods for numerical integration of ODEs on the orthogonal Stiefel manifold are superior to standard Lie group methods with respect to arithmetic complexity, and may be more reliable than projection methods, owing to their intrinsic nature.
Abstract: Numerical integration of ODEs on the orthogonal Stiefel manifold is considered. Points on this manifold are represented as n × k matrices with orthonormal columns, of particular interest is the case when $n\gg k$. Mainly two requirements are imposed on the integration schemes. First, they should have arithmetic complexity of order nk2. Second, they should be intrinsic in the sense that they require only the ODE vector field to be defined on the Stiefel manifold, as opposed to, for instance, projection methods. The design of the methods makes use of retractions maps. Two algorithms are proposed, one where the retraction map is based on the QR decomposition of a matrix, and one where it is based on the polar decomposition. Numerical experiments show that the new methods are superior to standard Lie group methods with respect to arithmetic complexity, and may be more reliable than projection methods, owing to their intrinsic nature.

37 citations

Book ChapterDOI
18 Aug 1996
TL;DR: An hierarchical approach for design of performance models for parallel algorithms in linear algebra based on a parallel machine model and the hierarchical structure of the ScaLAPACK library is presented.
Abstract: Performance models are important in the design and analysis of linear algebra software for scalable high performance computer systems. They can be used for estimation of the overhead in a parallel algorithm and measuring the impact of machine characteristics and block sizes on the execution time. We present an hierarchical approach for design of performance models for parallel algorithms in linear algebra based on a parallel machine model and the hierarchical structure of the ScaLAPACK library. This suggests three levels of performance models corresponding to existing ScaLAPACK routines. As a proof of the concept a performance model of the high level QR factorization routine pdgeqrf is presented. We also derive performance models of lower level ScaLAPACK building blocks such as pdgeqr2, pdlarft, pdlarfb, pdlarfg, pdlarf, pdnrm2, and pdscal, which are used in the high level model for pdgeqrf. Predicted performance results are compared to measurements on an Intel Paragon XP/S system. The accuracy of the top level model is over 90% for measured matrix and block sizes and different process grid configurations.

37 citations

30 Nov 2006
TL;DR: This report collects the flop count expressions for both real and complex kernels and also presents brief outlines of the derivations for the flip count expressions.
Abstract: : In the course of designing or evaluating signal processing algorithms, one often must determine the computational workload needed to implement the algorithms on a digital computer. The floating-point operation (flop) counts for real versions of the most common signal processing kernels are well documented. However, the flop counts for kernels operating on complex inputs are not as readily found. This report collects the flop count expressions for both real and complex kernels and also presents brief outlines of the derivations for the flop count expressions. Specifically, the following computational kernels are addressed: (1) the dimensions of the two multiplicands (m x n and n x p) for the matrix-matrix multiplication; (2) the length of the vector n for the fast Fourier transform; (3) the size of the triangular system n for forward and back substitutions; (4) the dimensions of the input matrix m x n for the Householder QR decomposition, eigenvalue decomposition, and singular value decomposition.

37 citations

Journal ArticleDOI
TL;DR: An optimal algorithm to perform the parallel QR decomposition of a dense matrix of size N is proposed and it is deduced that the complexity of such a decomposition is asymptotically 2.
Abstract: An optimal algorithm to perform the parallel QR decomposition of a dense matrix of size N is proposed. It is deduced that the complexity of such a decomposition is asymptotically 2N, when an unlimited number of processors is available.

37 citations

Journal ArticleDOI
TL;DR: An a posteriori error bound is derived that gives useful estimates of the relative accuracy of the computed singular values and it is shown that high relative accuracy in the computed SVD can be expected for matrices that are diagonal scalings of a well-conditioned matrix.

37 citations


Network Information
Related Topics (5)
Optimization problem
96.4K papers, 2.1M citations
85% related
Network packet
159.7K papers, 2.2M citations
84% related
Robustness (computer science)
94.7K papers, 1.6M citations
83% related
Wireless network
122.5K papers, 2.1M citations
83% related
Wireless sensor network
142K papers, 2.4M citations
82% related
Performance
Metrics
No. of papers in the topic in previous years
YearPapers
202331
202273
202190
2020132
2019126
2018139