Programming matrix algorithms-by-blocks for thread-level parallelism

From MaRDI portal

Publication:2989071

Jump to:navigation, search

DOI10.1145/1527286.1527288zbMath1364.65105OpenAlexW1986834688WikidataQ113310536 ScholiaQ113310536MaRDI QIDQ2989071

Ernie Chan, Gregorio Quintana-Ortí, Enrique S. Quintana-Ortí, Field G. van Zee, Robert A. van de Geijn

Publication date: 19 May 2017

Published in: ACM Transactions on Mathematical Software (Search for Journal in Brave)

Full work available at URL: http://hdl.handle.net/10234/22583

zbMATH Keywords

linear algebra high-performance computing libraries multithreaded architectures

Mathematics Subject Classification ID

Parallel numerical computation (65Y05) Numerical linear algebra (65F99) Numerical algorithms for specific classes of architectures (65Y10)

Related Items

Algorithm 1022: Efficient Algorithms for Computing a Rank-Revealing UTV Factorization on Parallel Computing Architectures, Exploring large macromolecular functional motions on clusters of multicore processors, Accelerating the reduction to upper Hessenberg, tridiagonal, and bidiagonal forms through hybrid GPU-based computing, Solving dense generalized eigenproblems on multi-threaded architectures, Parallel Matrix Multiplication: A Systematic Journey, Look-ahead in the two-sided reduction to compact band forms for symmetric eigenvalue problems and the SVD, Solving sequences of generalized least-squares problems on multi-threaded architectures, Householder QR Factorization With Randomization for Column Pivoting (HQRRP), Towards an Efficient Tile Matrix Inversion of Symmetric Positive Definite Matrices on Multicore Architectures, BLIS: A Framework for Rapidly Instantiating BLAS Functionality, A Parallel Sparse Direct Solver via Hierarchical DAG Scheduling, Implementing Multifrontal Sparse Solvers for Multicore Architectures with Sequential Task Flow Runtime Systems

Uses Software

Retrieved from "https://portal.mardi4nfdi.de/w/index.php?title=Publication:2989071&oldid=15999655"