Algorithm 784: GEMM-based level 3 BLAS
From MaRDI portal
Publication:4256917
DOI10.1145/292395.292426zbMath0930.65048OpenAlexW1988098298WikidataQ113310177 ScholiaQ113310177MaRDI QIDQ4256917
Per Ling, Charles F. Van Loan, Bo Kågström
Publication date: 9 February 2000
Published in: ACM Transactions on Mathematical Software (Search for Journal in Brave)
Full work available at URL: https://doi.org/10.1145/292395.292426
performanceparallelizationmemory hierarchyvectorizationblocked algorithmsGEMM-based level 3 BLASmatrix-matrix kernels
Parallel numerical computation (65Y05) Complexity and performance of numerical algorithms (65Y20) Direct numerical methods for linear systems and matrix inversion (65F05) Packaged methods for numerical algorithms (65Y15)
Related Items
Towards an accurate performance modeling of parallel sparse factorization, RECSY and SCASY Library Software: Recursive Blocked and Parallel Algorithms for Sylvester-Type Matrix Equations with Some Applications, CSDP, A C library for semidefinite programming
Uses Software