ViennaCL---Linear Algebra Library for Multi- and Many-Core Architectures
From MaRDI portal
Publication:2830624
DOI10.1137/15M1026419zbMath1349.65740OpenAlexW2544527082WikidataQ62599811 ScholiaQ62599811MaRDI QIDQ2830624
Karl Rupp, Florian Rudolf, Philippe Tillet, Andreas Morhammer, Josef Weinbub, Siegfried Selberherr, T. Grasser, Ansgar Jüngel
Publication date: 28 October 2016
Published in: SIAM Journal on Scientific Computing (Search for Journal in Brave)
Full work available at URL: https://doi.org/10.1137/15m1026419
Computational methods for sparse matrices (65F50) Iterative numerical methods for linear systems (65F10) Packaged methods for numerical algorithms (65Y15) Numerical algorithms for specific classes of architectures (65Y10)
Related Items
Memory-Efficient Sparse Matrix-Matrix Multiplication by Row Merging on Many-Core Architectures, Focused wave interaction with a partially-immersed rectangular box using 2-D incompressible SPH on a GPU comparing with experiment and linear theory, FEMs -- a mechanics-oriented finite element modeling software, ViennaCL, Extreme Scale FMM-Accelerated Boundary Integral Equation Solver for Wave Scattering, ViennaCL---Linear Algebra Library for Multi- and Many-Core Architectures, High-performance processing of covariance matrices using GPU computations, Preparing sparse solvers for exascale computing, Operator Splitting and Finite Difference Schemes for Solving the EMI Model, An enhanced implicit viscosity ISPH method for simulating free-surface flow coupled with solid-liquid phase change
Uses Software
Cites Work
- Unnamed Item
- NETGEN: An advancing front 2D/3D-mesh generator based on abstract rules
- A GPU accelerated aggregation algebraic multigrid method
- Efficient transitive closure of sparse matrices over closed semirings
- ViennaCL---Linear Algebra Library for Multi- and Many-Core Architectures
- Programming CUDA and OpenCL: A Case Study Using Modern C++ Libraries
- A Unified Sparse Matrix Data Format for Efficient General Sparse Matrix-Vector Multiplication on Modern Processors with Wide SIMD Units
- The university of Florida sparse matrix collection
- Exposing Fine-Grained Parallelism in Algebraic Multigrid Methods
- Parallel Sparse Matrix-Matrix Multiplication and Indexing: Implementation and Experiments
- GPU-Accelerated Sparse Matrix-Matrix Multiplication by Iterative Row Merging
- Graph Clustering Via a Discrete Uncoupling Process
- Hiding Global Communication Latency in the GMRES Algorithm on Massively Parallel Machines
- Fine-Grained Parallel Incomplete LU Factorization
- Pipelined Iterative Solvers with Kernel Fusion for Graphics Processing Units