The Padé iterations for the matrix sign function and their reciprocals are optimal (Q649549)
From MaRDI portal
scientific article
Language | Label | Description | Also known as |
---|---|---|---|
English | The Padé iterations for the matrix sign function and their reciprocals are optimal |
scientific article |
Statements
The Padé iterations for the matrix sign function and their reciprocals are optimal (English)
0 references
2 December 2011
0 references
A common way to compute the matrix sign function is through rational iterations of the form \(z_{k+1} = \phi(z_k)\), for some rational function \(\phi(z)\) having attractive fixed points at \(1\) and \(-1\). One family of such rational functions is the \textit{Padé family of iterations}, defined as follows. Given \(z\), a variable representing a complex number, and \(\xi = 1-z^2\), we have: \[ \text{sign}(z) = \frac{z}{(z^2)^{1/2}} = \frac{z}{(1-\xi)^{1/2}} = zf(\xi), \] where \(f(\xi) = (1-\xi)^{-1/2}\). Letting \(\frac{P_{m,n}(\xi)}{Q_{m,n}(\xi)}\) be the \((m,n)\) Padé approximant to \(f(\xi)\) (\(m,n\) non-negative integers such that \(m+n\geq 1\)), the iteration \[ z_{k+1} = \frac{z_kP_{(m,n)}(1-z_k^2)}{Q_(m,n)(1-z_k^2)} =: \phi_{2m+1,2n}(z), \] where \(P_{(m,n)}(z)\) and \(Q_{(m,n)}(z)\) are polynomials of degrees \(2m+1\) and \(2n\), respectively, has been proven to be locally convergent to \(1\) and \(-1\) with order of convergence \(m+n+1\), for \(m \geq n-1\). Iterations based on the reciprocal behave similarly. Given an integer \(s>1\), and allowing non-negative integers \(m,n\) to vary over all values such that \(m+n=2s-1\), the family \(z_{k+1} = \phi_{(m,n)}(z_k)\) is the union of the Padé family and its reciprocal family. The authors prove that the functions \(\phi_{(m,n)}\) define the unique rational iterations having the lowest sum of the degrees of the numerator and the denominator among those rational iterations converging locally to \(1\) and \(-1\) with order at least \(s\). While not guaranteeing that such iterations can be evaluated with minimal cost, this property is nonetheless important for computational efficiency since, in the generic case, using Horner's scheme, the number of arithmetic operations required to evaluate \(\frac{a(z)}{b(z)}\) is directly related to the sum of the degrees of \(a\) and \(b\).
0 references
rational iterations
0 references
matrix functions
0 references
matrix sign function
0 references
local convergence
0 references
Padé approximation
0 references
root-finding algorithm
0 references
Newton's method
0 references
Newton-Schulz iteration
0 references
Halley's method
0 references
computational efficiency
0 references
Horner's scheme
0 references