A parameterized ordering for cache-, register- and pipeline-efficient Givens QR decomposition (Q1284142)

From MaRDI portal





scientific article; zbMATH DE number 1271706
Language Label Description Also known as
default for all languages
No label defined
    English
    A parameterized ordering for cache-, register- and pipeline-efficient Givens QR decomposition
    scientific article; zbMATH DE number 1271706

      Statements

      A parameterized ordering for cache-, register- and pipeline-efficient Givens QR decomposition (English)
      0 references
      0 references
      0 references
      30 March 1999
      0 references
      A number of methods are known in the literature to calculate the QR decomposition of a matrix. The authors start with the basic Givens algorithm and construct a dependency graph such that any valid ordering through that graph yields an algorithm which is numerically identical to the original. Then they develop parametrized orderings through the graph, motivated by a simple model of a superscalar processor with a cache, registers, and a pipeline. The final algorithm simultaneously minimizes the number of loads and stores, while not stalling the floating point pipeline.
      0 references
      QR decomposition
      0 references
      Givens algorithm
      0 references
      graph
      0 references
      ordering
      0 references
      superscalar processor
      0 references

      Identifiers

      0 references
      0 references
      0 references
      0 references
      0 references