Efficient parallel factorization and solution of structured and unstructured linear systems (Q2486566)

scientific article

Language	Label	Description	Also known as
English	Efficient parallel factorization and solution of structured and unstructured linear systems	scientific article

Statements

instance of

scholarly article

0 references

title

Efficient parallel factorization and solution of structured and unstructured linear systems (English)

0 references

published in

Journal of Computer and System Sciences

0 references

publication date

5 August 2005

0 references

review text

The paper provides efficient parallel algorithms for exactly factoring some classes of \(n\)-dimensional symmetric positive definite matrices (SPD) in time \(O(\text{log}^2\,\,n)\) with a near optimal number of processors. A parallel random access machine (PRAM) model of parallel computation with unit cost arithmetic operations including division over a finite field is supposed. Prior work did not generally require unit cost division over a finite field. The considered input matrices have entries that are rational numbers given as a ratio of integers with at most a polynomial number of bits \(\beta\). Only bit precision \(O(n(\beta +\text{log n}))\) is required. It is the asymptotically optimal bit precision for \(\beta\geq\text{log}\,\,n\) since the determinant, exact \(LU\) factorization, and matrix inverse require bit precision at least \(\Omega (n\beta)\). The algorithms are randomized. They give outputs within the stated bounds with high likelihood \(\geq 1-{1 \over n^{\Omega (1)}}\) using a constant number of random variables ranging over a domain of size \((n\| A\| )^{O(1)}\). Recursive factorization (RF) of SPD matrices is computed using the Newton's iteration, the Newton-Hensel lifting, and the variable diagonal technique. These techniques are extended into the multilevel pipelined framework using the generalization of the stream contraction method by \textit{V. Pan} and \textit{J. Reif} [Inf. Process. Lett. 40, 79--83 (1991; Zbl 0748.68025)]. \(LU\) and \(QR\) factorizations for dense matrices and \(LU\) factorizations for sparse matrices which are \(s(n)\)-separable are presented. They reduce the known parallel time bounds from \(\Omega (\log^3 n)\) to \(O(\text{log}^2n)\) without increase of processors. The algorithms are further specialized to structured matrices. \(LU\) factorizations for Toeplitz matrices and matrices of bounded displacement rank in time \(O(\text{log}^2\,\,n)\) using \(P(n)\) processors are developed. Here \(P(n)\) denotes the number of arithmetic processors used to multiply two polynomials of degree \(n\) in \(O(\text{log}\,n)\) parallel time. Thus the processor reduction from \(n^2\) to \((P(n)\) is achieved. These results are applied with the same parallel time and processor bound to solve the problems of polynomial resultant, Padé approximants of rational functions, and with a factor \(O(\text{log}\,n)\) more time polynomial greatest common divisors (GCD) and extended GCD.

0 references

zbMATH Keywords

parallel algorithms

0 references

linear systems

0 references

\(LU\) factorization

0 references

dense matrices

0 references

sparse matrices

0 references