A parameterized ordering for cache-, register- and pipeline-efficient Givens QR decomposition (Q1284142)

scientific article; zbMATH DE number 1271706

Language	Label	Description	Also known as
default for all languages	No label defined
English	A parameterized ordering for cache-, register- and pipeline-efficient Givens QR decomposition	scientific article; zbMATH DE number 1271706

Statements

instance of

scholarly article

0 references

title

A parameterized ordering for cache-, register- and pipeline-efficient Givens QR decomposition (English)

0 references

0 references

0 references

Advances in Computational Mathematics

0 references

publication date

30 March 1999

0 references

review text

A number of methods are known in the literature to calculate the QR decomposition of a matrix. The authors start with the basic Givens algorithm and construct a dependency graph such that any valid ordering through that graph yields an algorithm which is numerically identical to the original. Then they develop parametrized orderings through the graph, motivated by a simple model of a superscalar processor with a cache, registers, and a pipeline. The final algorithm simultaneously minimizes the number of loads and stores, while not stalling the floating point pipeline.

0 references

zbMATH Keywords

QR decomposition

0 references

Givens algorithm

0 references

graph

0 references

ordering

0 references

superscalar processor

0 references