Programming matrix algorithms-by-blocks for thread-level parallelism
From MaRDI portal
Publication:2989071
DOI10.1145/1527286.1527288zbMath1364.65105OpenAlexW1986834688WikidataQ113310536 ScholiaQ113310536MaRDI QIDQ2989071
Ernie Chan, Gregorio Quintana-Ortí, Enrique S. Quintana-Ortí, Field G. van Zee, Robert A. van de Geijn
Publication date: 19 May 2017
Published in: ACM Transactions on Mathematical Software (Search for Journal in Brave)
Full work available at URL: http://hdl.handle.net/10234/22583
Parallel numerical computation (65Y05) Numerical linear algebra (65F99) Numerical algorithms for specific classes of architectures (65Y10)
Related Items
Algorithm 1022: Efficient Algorithms for Computing a Rank-Revealing UTV Factorization on Parallel Computing Architectures, Exploring large macromolecular functional motions on clusters of multicore processors, Accelerating the reduction to upper Hessenberg, tridiagonal, and bidiagonal forms through hybrid GPU-based computing, Solving dense generalized eigenproblems on multi-threaded architectures, Parallel Matrix Multiplication: A Systematic Journey, Look-ahead in the two-sided reduction to compact band forms for symmetric eigenvalue problems and the SVD, Solving sequences of generalized least-squares problems on multi-threaded architectures, Householder QR Factorization With Randomization for Column Pivoting (HQRRP), Towards an Efficient Tile Matrix Inversion of Symmetric Positive Definite Matrices on Multicore Architectures, BLIS: A Framework for Rapidly Instantiating BLAS Functionality, A Parallel Sparse Direct Solver via Hierarchical DAG Scheduling, Implementing Multifrontal Sparse Solvers for Multicore Architectures with Sequential Task Flow Runtime Systems
Uses Software