Extrema of sums of heterogeneous quadratic forms (Q1375540): Difference between revisions
From MaRDI portal
Set profile property. |
ReferenceBot (talk | contribs) Changed an Item |
||
Property / cites work | |||
Property / cites work: The geometry of matrix eigenvalue methods / rank | |||
Normal rank | |||
Property / cites work | |||
Property / cites work: Nondegenerate critical manifolds / rank | |||
Normal rank | |||
Property / cites work | |||
Property / cites work: Least squares matching problems / rank | |||
Normal rank | |||
Property / cites work | |||
Property / cites work: Least-Squares Estimation, Linear Programming, and Momentum: A Geometric Parametrization of Local Minima / rank | |||
Normal rank | |||
Property / cites work | |||
Property / cites work: Q3905961 / rank | |||
Normal rank | |||
Property / cites work | |||
Property / cites work: Q4105006 / rank | |||
Normal rank | |||
Property / cites work | |||
Property / cites work: Q3323137 / rank | |||
Normal rank |
Revision as of 09:07, 28 May 2024
scientific article
Language | Label | Description | Also known as |
---|---|---|---|
English | Extrema of sums of heterogeneous quadratic forms |
scientific article |
Statements
Extrema of sums of heterogeneous quadratic forms (English)
0 references
24 August 1998
0 references
This big paper focuses on the following problem arising in various situations in multivariate statistical analysis. Find the maximum of the quadratic functional \[ f_{ \text{quad}} (X)=\sum^K_{i=1} x^T_iA_ix_i \] under the constraint that the vectors \(x_1, \dots, x_k\in \mathbb{R}^n\) form an orthonormal system where \(A_1, \dots, A_k\) \((k\leq n)\) are given \(k\) symmetric positive definite \(n\times n\) matrices and the \(n\times k\) matrix \(X=[x_1, \dots, x_k]\) contains the enumerated vectors as their columns. The set of orthonormal \(k\)-tuples in \(\mathbb{R}^n\) is called a Stiefel manifold and denoted by \(V_{n,k}\); \(X\in V_{n,k}\) when the set of column vectors of the \(n\times k\) matrix \(X\) is an element of \(V_{n,k}\). As \(V_{n,k}\) is a compact manifold and \(f_{\text{quad}}\) is continuous on \(V_{n,k}\) a finite global maximum exists and it is attained at some point. To characterize the critical points of the functional let us denote by \(A(X)= [A_1x_1, \dots, A_kx_k]\) the \(n\times k\) matrix. The main result is the following theorem. Theorem 3.1. \(X\in V_{n,k}\) is a critical point of \(f_{\text{quad}}\) if and only if \(A(X)=XS\), where \(S\) is a symmetric \(k\times k\) matrix. At an arbitrary critical point the matrix \(S=X^TA(X)\) is not necessarily positive semidefinite. The next lemma shows that this is necessary at the global maximum points of \(f_{\text{quad}}\). Lemma 3.1. If \(X\in V_{n,k}\) is a global maximum of the functional \(f_{\text{quad}}\), then the corresponding matrix \(S=X^T A(X)\) is positive semidefinite. Theorem 3.1 and Lemma 3.1 together yield that at global maximum points \(A(X)=XS\), where \(S\) is a positive semidefinite matrix. Beyond it a computational iteration algorithm is proposed. Choosing an arbitrary initial orthonormal system \(X^{(0)}= [x_1^{(0)}, \dots, x_k^{(0)}]\in V_{n,k}\), the sequence \(X^{(1)}, X^{(2)}, \dots\) in \(V_{n,k}\) is constructed in the following way: from the \(m\)th element of this sequence \(X^{(m)} =[x_1^{(m)}, \dots, x_k^{(m)}]\in V_{n,k}\) the next one is obtained by a polar decomposition of the matrix \(A(X^{(m)})\) as \(A(X^{(m)})= X^{(m+1)} S^{(m+1)}\) \((m=0,1, \dots,)\) where \((X^{(m+1)})^T X^{(m+1)} =I_k\) and \(S^{(m+1)}\geq 0\). This polar decomposition is unique if \(A_1x_1^{(m)}, \dots, A_kx_k^{(m)}\) are linearly independent. It is proved that \(f_{\text{quad}} (X^{(m)})\) is a nondecreasing sequence and \(\text{dist} (X^{(m)}, {\mathcal C})\to 0\) as \(m\to\infty\) where \({\mathcal C}= \{X\in V_{n,k}:X\) is a critical point of \(f_{\text{quad}}\}\). In the case when \(f_{\text{quad}}\) has isolated critical points the algorithm converges to one of the critical points. The critical points of the quadratic functional assuming some relations between the matrices \(A_1, \dots, A_k\) are considered. Structural properties of the functionals \(f_{\text{quad}}\) and \[ f_{\text{bilin}} (X,Y)= \sum^k_{i=1} y^T_iA_ix_i \] where \(x_1, \dots, x_k\) and \(y_1, \dots, y_k\) are orthonormal vectors are discussed.
0 references
convergence
0 references
multivariate statistical analysis
0 references
maximum
0 references
quadratic functional
0 references
Stiefel manifold
0 references
critical points
0 references
iteration algorithm
0 references
polar decomposition
0 references
0 references