Accelerating numerical dense linear algebra calculations with GPUs
DOI10.1007/978-3-319-06548-9_1zbMATH Open1317.65078OpenAlexW36826159MaRDI QIDQ5261390FDOQ5261390
Authors: Mark Ralph Gates, A. Haidar, Jakub Kurzak, Piotr Luszczek, Ichitaro Yamazaki, Jack Dongarra, Stanimire Tomov
Publication date: 3 July 2015
Published in: Numerical Computations with GPUs (Search for Journal in Brave)
Full work available at URL: https://doi.org/10.1007/978-3-319-06548-9_1
Recommendations
- Accelerating GPU kernels for dense linear algebra
- Towards dense linear algebra for hybrid GPU accelerated manycore systems
- Linear algebra software for large-scale accelerated multicore computing
- Efficient batch LU and QR decomposition on GPU
- Divide and conquer on hybrid GPU-accelerated multicore systems
Direct numerical methods for linear systems and matrix inversion (65F05) Parallel numerical computation (65Y05) Numerical computation of eigenvalues and eigenvectors of matrices (65F15) Numerical solutions to overdetermined systems, pseudoinverses (65F20) Numerical algorithms for specific classes of architectures (65Y10)
Cited In (15)
- Simulating Low Precision Floating-Point Arithmetic
- Divide and conquer on hybrid GPU-accelerated multicore systems
- ELSI -- an open infrastructure for electronic structure solvers
- GPU-acceleration of the ELPA2 distributed eigensolver for dense symmetric and Hermitian eigenproblems
- Nambu-covariant many-body theory. I: Perturbative approximations
- A LAPACK implementation of the dynamic mode decomposition
- GPU acceleration of all-electron electronic structure theory using localized numeric atom-centered basis functions
- Efficient batch LU and QR decomposition on GPU
- Accelerating GPU kernels for dense linear algebra
- Generating extreme-scale matrices with specified singular values or condition number
- Exploiting lower precision arithmetic in solving symmetric positive definite linear systems and least squares problems
- Redesigning triangular dense matrix computations on GPUs
- Towards dense linear algebra for hybrid GPU accelerated manycore systems
- Achieving Native GPU Performance for Out-of-Card Large Dense Matrix Multiplication
- Algorithm 1019: A Task-based Multi-shift QR/QZ Algorithm with Aggressive Early Deflation
Uses Software
This page was built for publication: Accelerating numerical dense linear algebra calculations with GPUs
Report a bug (only for logged in users!)Click here to report a bug for this page (MaRDI item Q5261390)