The Padé iterations for the matrix sign function and their reciprocals are optimal (Q649549)

From MaRDI portal





scientific article
Language Label Description Also known as
default for all languages
No label defined
    English
    The Padé iterations for the matrix sign function and their reciprocals are optimal
    scientific article

      Statements

      The Padé iterations for the matrix sign function and their reciprocals are optimal (English)
      0 references
      0 references
      0 references
      0 references
      2 December 2011
      0 references
      A common way to compute the matrix sign function is through rational iterations of the form \(z_{k+1} = \phi(z_k)\), for some rational function \(\phi(z)\) having attractive fixed points at \(1\) and \(-1\). One family of such rational functions is the \textit{Padé family of iterations}, defined as follows. Given \(z\), a variable representing a complex number, and \(\xi = 1-z^2\), we have: \[ \text{sign}(z) = \frac{z}{(z^2)^{1/2}} = \frac{z}{(1-\xi)^{1/2}} = zf(\xi), \] where \(f(\xi) = (1-\xi)^{-1/2}\). Letting \(\frac{P_{m,n}(\xi)}{Q_{m,n}(\xi)}\) be the \((m,n)\) Padé approximant to \(f(\xi)\) (\(m,n\) non-negative integers such that \(m+n\geq 1\)), the iteration \[ z_{k+1} = \frac{z_kP_{(m,n)}(1-z_k^2)}{Q_(m,n)(1-z_k^2)} =: \phi_{2m+1,2n}(z), \] where \(P_{(m,n)}(z)\) and \(Q_{(m,n)}(z)\) are polynomials of degrees \(2m+1\) and \(2n\), respectively, has been proven to be locally convergent to \(1\) and \(-1\) with order of convergence \(m+n+1\), for \(m \geq n-1\). Iterations based on the reciprocal behave similarly. Given an integer \(s>1\), and allowing non-negative integers \(m,n\) to vary over all values such that \(m+n=2s-1\), the family \(z_{k+1} = \phi_{(m,n)}(z_k)\) is the union of the Padé family and its reciprocal family. The authors prove that the functions \(\phi_{(m,n)}\) define the unique rational iterations having the lowest sum of the degrees of the numerator and the denominator among those rational iterations converging locally to \(1\) and \(-1\) with order at least \(s\). While not guaranteeing that such iterations can be evaluated with minimal cost, this property is nonetheless important for computational efficiency since, in the generic case, using Horner's scheme, the number of arithmetic operations required to evaluate \(\frac{a(z)}{b(z)}\) is directly related to the sum of the degrees of \(a\) and \(b\).
      0 references
      rational iterations
      0 references
      matrix functions
      0 references
      matrix sign function
      0 references
      local convergence
      0 references
      Padé approximation
      0 references
      root-finding algorithm
      0 references
      Newton's method
      0 references
      Newton-Schulz iteration
      0 references
      Halley's method
      0 references
      computational efficiency
      0 references
      Horner's scheme
      0 references
      0 references

      Identifiers

      0 references
      0 references
      0 references
      0 references
      0 references
      0 references
      0 references
      0 references