CUSPARSE
From MaRDI portal
Software:19903
swMATH7887MaRDI QIDQ19903FDOQ19903
Author name not available (Why is that?)
Cited In (50)
- A hierarchical parallel implementation for heterogeneous computing. Application to algebra-based CFD simulations on hybrid supercomputers
- Using a half-implicit integration scheme for the SPH-based solution of fluid-solid interaction problems
- The deal.II library, Version 9.1
- A Dynamic Pattern Factored Sparse Approximate Inverse Preconditioner on Graphics Processing Units
- The deal.II library, version 9.0
- A consistent multiphase flow model with a generalized particle shifting scheme resolved via incompressible SPH
- Acceleration strategies for explicit finite element analysis of metal powder-based additive manufacturing processes using graphical processing units
- The \texttt{deal.II} library, Version 9.3
- The \texttt{deal.II} library, version 9.4
- Programming CUDA and OpenCL: a case study using modern C++ libraries
- Accelerating SpMV multiplication in probabilistic model checkers using GPUs
- Auto-tuned Krylov methods on cluster of graphics processing unit
- A new sparse matrix vector multiplication graphics processing unit algorithm designed for finite element problems
- Efficient \(L_0\) resampling of point sets
- A parallel cyclic reduction algorithm for pentadiagonal systems with application to a convection-dominated Heston PDE
- Memory-Efficient Sparse Matrix-Matrix Multiplication by Row Merging on Many-Core Architectures
- GPU-accelerated preconditioned GMRES method for two-dimensional Maxwell's equations
- Sparse matrix-vector product
- A Feynman-Kac based numerical method for the exit time probability of a class of transport problems
- MPI-CUDA sparse matrix-vector multiplication for the conjugate gradient method with an approximate inverse preconditioner
- The deal.II library, version 9.2
- A parallel computing method using blocked format with optimal partitioning for SpMV on GPU
- Preconditioning Sparse Matrices with Alternating and Multiplicative Operator Splittings
- Low synchronization Gram–Schmidt and generalized minimal residual algorithms
- A data-parallel ILUPACK for sparse general and symmetric indefinite linear systems
- A parallel generalized relaxation method for high-performance image segmentation on GPUs
- A parallel algorithm for solving a partial eigenvalue problem for block-diagonal bordered matrices
- GPGPU-based parallel computing applied in the FEM using the conjugate gradient algorithm: a review
- Efficient CSR-based sparse matrix-vector multiplication on GPU
- A factored sparse approximate inverse preconditioned conjugate gradient solver on graphics processing units
- Updating incomplete factorization preconditioners for model order reduction
- Improved row-grouped CSR format for storing of sparse matrices on GPU
- Matrix-free GPU implementation of a preconditioned conjugate gradient solver for anisotropic elliptic PDEs
- Graph coloring using GPUs
- Sparse matrix-vector multiplication on GPGPUs
- Finite element integration on GPGPUs
- A task-scheduling approach for efficient sparse symmetric matrix-vector multiplication on a GPU
- Analysis of a splitting approach for the parallel solution of linear systems on GPU cards
- Incompressible SPH (ISPH) with fast Poisson solver on a GPU
- Portable implementation model for CFD simulations. Application to hybrid CPU/GPU supercomputers
- A guide for implementing tridiagonal solvers on GPUs
- Towards a parallel component in a GPU-CUDA environment: a case study with the L-BFGS Harwell routine
- AmgX: a library for GPU accelerated algebraic multigrid and preconditioned iterative methods
- GPU-accelerated sparse matrix-matrix multiplication by iterative row merging
- Cucheb: a GPU implementation of the filtered Lanczos procedure
- Sparse approximate inverse preconditioners on high performance GPU platforms
- Design and implementation of adaptive SpMV library for multicore and many-core architecture
- The eigenvalues slicing library (EVSL): algorithms, implementation, and software
- High-performance statistical computing in the computing environments of the 2020s
- Manycore algorithms for batch scalar and block tridiagonal solvers
This page was built for software: CUSPARSE