A parameterized ordering for cache-, register- and pipeline-efficient Givens QR decomposition (Q1284142)
From MaRDI portal
| This is the item page for this Wikibase entity, intended for internal use and editing purposes. Please use this page instead for the normal view: A parameterized ordering for cache-, register- and pipeline-efficient Givens QR decomposition |
scientific article; zbMATH DE number 1271706
| Language | Label | Description | Also known as |
|---|---|---|---|
| default for all languages | No label defined |
||
| English | A parameterized ordering for cache-, register- and pipeline-efficient Givens QR decomposition |
scientific article; zbMATH DE number 1271706 |
Statements
A parameterized ordering for cache-, register- and pipeline-efficient Givens QR decomposition (English)
0 references
30 March 1999
0 references
A number of methods are known in the literature to calculate the QR decomposition of a matrix. The authors start with the basic Givens algorithm and construct a dependency graph such that any valid ordering through that graph yields an algorithm which is numerically identical to the original. Then they develop parametrized orderings through the graph, motivated by a simple model of a superscalar processor with a cache, registers, and a pipeline. The final algorithm simultaneously minimizes the number of loads and stores, while not stalling the floating point pipeline.
0 references
QR decomposition
0 references
Givens algorithm
0 references
graph
0 references
ordering
0 references
superscalar processor
0 references
0.8371445536613464
0 references
0.7849810123443604
0 references
0.7673691511154175
0 references
0.7633941769599915
0 references