Optimizing sparse matrix-matrix multiplication for the GPU
DOI10.1145/2699470zbMATH Open1347.65085OpenAlexW1980282429WikidataQ113310260 ScholiaQ113310260MaRDI QIDQ2828151FDOQ2828151
Authors: Steven Dalton, Luke Olson, Nathan Bell
Publication date: 24 October 2016
Published in: ACM Transactions on Mathematical Software (Search for Journal in Brave)
Full work available at URL: https://doi.org/10.1145/2699470
Recommendations
- Memory-Efficient Sparse Matrix-Matrix Multiplication by Row Merging on Many-Core Architectures
- A novel multi-GPU parallel optimization model for the sparse matrix-vector multiplication
- Sparse matrix-vector multiplication on GPGPUs
- Sparse matrix-vector multiplication on NVIDIA GPU
- GPU-accelerated sparse matrix-matrix multiplication by iterative row merging
Computational methods for sparse matrices (65F50) Parallel numerical computation (65Y05) Numerical algorithms for specific classes of architectures (65Y10)
Cites Work
- The University of Florida sparse matrix collection
- Title not available (Why is that?)
- Sparse matrix multiplication package (SMMP)
- An overview of the Trilinos project
- Using Mixed Precision for Sparse Matrix Computations to Enhance the Performance while Achieving 64-bit Accuracy
- Maximum matchings in general graphs through randomization
- Two Fast Algorithms for Sparse Matrices: Multiplication and Permuted Transposition
- Parallel Sparse Matrix-Matrix Multiplication and Indexing: Implementation and Experiments
- Exposing fine-grained parallelism in algebraic multigrid methods
Cited In (21)
- HPMaX: heterogeneous parallel matrix multiplication using CPUs and GPUs
- On optimizing multiplications of sparse matrices
- Accelerating Iterative SpMV for the Discrete Logarithm Problem Using GPUs
- Exploiting multiple levels of parallelism in sparse matrix-matrix multiplication
- Cache friendly sparse matrix-vector multiplication
- Memory-Efficient Sparse Matrix-Matrix Multiplication by Row Merging on Many-Core Architectures
- A new class of AMG interpolation methods based on matrix-matrix multiplications
- KBLAS: an optimized library for dense matrix-vector multiplication on GPU accelerators
- A two-scale approach for efficient on-the-fly operator assembly in massively parallel high performance multigrid codes
- Randomized GPU Algorithms for the Construction of Hierarchical Matrices from Matrix-Vector Operations
- Efficient CSR-based sparse matrix-vector multiplication on GPU
- A novel multi-GPU parallel optimization model for the sparse matrix-vector multiplication
- GraphBLAST: A High-Performance Linear Algebra-based Graph Framework on the GPU
- Title not available (Why is that?)
- Generating optimized sparse matrix vector product over finite fields
- A Communication Optimization Scheme for Basis Computation of Krylov Subspace Methods on Multi-GPUs
- Reducing communication costs for sparse matrix multiplication within algebraic multigrid
- Effective minimally-invasive GPU acceleration of distributed sparse matrix factorization
- Redesigning triangular dense matrix computations on GPUs
- GPU-accelerated sparse matrix-matrix multiplication by iterative row merging
- Achieving Native GPU Performance for Out-of-Card Large Dense Matrix Multiplication
Uses Software
This page was built for publication: Optimizing sparse matrix-matrix multiplication for the GPU
Report a bug (only for logged in users!)Click here to report a bug for this page (MaRDI item Q2828151)