The convergence behavior of Ritz values in the presence of close eigenvalues (Q1095582)
From MaRDI portal
scientific article
Language | Label | Description | Also known as |
---|---|---|---|
English | The convergence behavior of Ritz values in the presence of close eigenvalues |
scientific article |
Statements
The convergence behavior of Ritz values in the presence of close eigenvalues (English)
0 references
1987
0 references
Let A be a symmetric \(n\times n\) matrix, w a vector in \(E^ n\), and \(K_ i=span(w,Aw,...,A^{i-1}w)\subset E^ n\). Let \(T_ i\) mean the orthogonal projection upon \(K_ i\) and \(A| K_ i\) the restriction of A to \(K_ i\). Then, as known, the eigenvalues \(\theta_ 1^{(i)},...,\theta_ i^{(i)}\) of the matrix \(A_ i=T_ iA| K_ i,\) called Ritz values of degree i, approximate the corresponding eigenvalues \(\lambda_ 1,...,\lambda_ n\) of A and converge to them as i increases. However, in the case that A has almost multiple eigenvalues the convergence behavior of the Ritz values can be ``rather bizarre''. The desire to analyze this phenomenon theoretically has motivated the present paper. The authors start by making a numerical experiment which corresponds to the case that an almost double eigenvalue occurs at the beginning of the spectrum. One then observes that \(\theta_ 1^{(i)}\) first seems to converge to the cluster, stays in its immediate vicinity for quite a while, moving with very small steps, starts moving again and quite fast, to \(\lambda_ 1\). \(\theta_ 2^{(i)}\) first seems to converge to \(\lambda_ 3\) and achieves its immediate vicinity, but about at the same time that \(\theta_ 1^{(i)}\) resumes its march to \(\lambda_ 1\), \(\theta_ 2^{(i)}\) does so to \(\lambda_ 2\), and quite soon afterwards we have \(\theta_ 1^{(i)}-\lambda_ 1\approx \theta_ 2^{(i)}-\lambda_ 2\). As soon as \(\theta_ 2^{(i)}\) leaves the vicinity of \(\lambda_ 3\) its place is occupied by \(\theta_ 3^{(i)}.\) Sections 7-10 are devoted to explain theoretically the above observations, to find quantitative estimates for the local behavior of the rate of convergence and to compare the results with experimental observations. Since the known Kaniel-Paige-Saad a priori upper bounds do not reflect the local convergence behavior, another approach is needed. All the new results obtained are based on the orthogonality of the so- called Ritz polynomials. The definition and properties of these polynomials and the minimum properties of Ritz values are given in Sec. 5. However, the Kaniel-Paige-Saad approach cannot be replaced by that of the authors, since the latter does not have the a priori character of the former. Finally, in Sec. 12, a theorem is given in order to explain the ``superlinear convergence'' of Ritz values. This phenomenon occurs by large values of i. Every section contains a long discussion.
0 references
close eigenvalues
0 references
symmetric matrices
0 references
Ritz values
0 references
numerical experiment
0 references
double eigenvalue
0 references
rate of convergence
0 references
Kaniel-Paige-Saad a priori upper bounds
0 references
Ritz polynomials
0 references
superlinear convergence
0 references
0 references