Programming matrix algorithms-by-blocks for thread-level parallelism
From MaRDI portal
Publication:2989071
DOI10.1145/1527286.1527288zbMath1364.65105OpenAlexW1986834688WikidataQ113310536 ScholiaQ113310536MaRDI QIDQ2989071
Ernie Chan, Gregorio Quintana-Ortí, Enrique S. Quintana-Ortí, Field G. van Zee, Robert A. van de Geijn
Publication date: 19 May 2017
Published in: ACM Transactions on Mathematical Software (Search for Journal in Brave)
Full work available at URL: http://hdl.handle.net/10234/22583
Parallel numerical computation (65Y05) Numerical linear algebra (65F99) Numerical algorithms for specific classes of architectures (65Y10)
Related Items (12)
Algorithm 1022: Efficient Algorithms for Computing a Rank-Revealing UTV Factorization on Parallel Computing Architectures ⋮ Exploring large macromolecular functional motions on clusters of multicore processors ⋮ Accelerating the reduction to upper Hessenberg, tridiagonal, and bidiagonal forms through hybrid GPU-based computing ⋮ Solving dense generalized eigenproblems on multi-threaded architectures ⋮ Parallel Matrix Multiplication: A Systematic Journey ⋮ Look-ahead in the two-sided reduction to compact band forms for symmetric eigenvalue problems and the SVD ⋮ Solving sequences of generalized least-squares problems on multi-threaded architectures ⋮ Householder QR Factorization With Randomization for Column Pivoting (HQRRP) ⋮ Towards an Efficient Tile Matrix Inversion of Symmetric Positive Definite Matrices on Multicore Architectures ⋮ BLIS: A Framework for Rapidly Instantiating BLAS Functionality ⋮ A Parallel Sparse Direct Solver via Hierarchical DAG Scheduling ⋮ Implementing Multifrontal Sparse Solvers for Multicore Architectures with Sequential Task Flow Runtime Systems
Uses Software
This page was built for publication: Programming matrix algorithms-by-blocks for thread-level parallelism