MAGMA
From MaRDI portal
MAGMA Q24666
Cited in
(only showing first 100 items - show all)- A parallel auxiliary grid algebraic multigrid method for graphic processing units
- Algorithm 953: Parallel library software for the multishift QR algorithm with aggressive early deflation
- Scientific computations on multi-core systems using different programming frameworks
- Evaluation of selected resource allocation and scheduling methods in heterogeneous many-core processors and graphics processing units
- Parallel hierarchical hybrid linear solvers for emerging computing platforms
- An inertia-free filter line-search algorithm for large-scale nonlinear programming
- \(\mathcal H\)-LU factorization on many-core systems
- A parallel algorithm for calculation of determinants and minors using arbitrary precision arithmetic
- Exploiting symmetry in tensors for high performance: multiplication with symmetric tensors
- Solving a large-scale thermal radiation problem using an interoperable executive library framework on petascale supercomputers
- ViennaCL-linear algebra library for multi- and many-core architectures
- Particle filtering: the need for speed
- Extending the length and time scales of Gram-Schmidt Lyapunov vector computations
- Programming the finite element method
- FLAME
- PaStiX
- ScaLAPACK
- VOLSCAT
- LAWRA
- PLAPACK
- Algorithm 826
- CALU
- RScaLAPACK
- CUBLAS
- MKL
- Elemental
- POOCLAPACK
- OpenCL
- SOLAR
- Cellss
- SBR Toolbox
- PLASMA
- clSpMV
- Algorithm 880
- CULA
- LogGOPSim
- NaSt3DGPF
- MR3-SMP
- PLASMA
- HSL_MA87
- IEL
- STREAM benchmark
- HSL_MA79
- QUARK
- SpGEMM
- StarPU
- FastFlow
- CUMP
- GPUprec
- MPIGMP
- SWARM
- Wool
- MINMOD
- Algorithm 953
- KBLAS
- Tcmalloc
- yaSpMV
- AxiSEM
- Accelerating the reduction to upper Hessenberg, tridiagonal, and bidiagonal forms through hybrid GPU-based computing
- DDSCAT
- PBLAS
- Algorithm 656
- CLBlast
- CLTune
- DAGuE
- SuperMatrix
- NFM-DS
- AdELL
- BiELL
- CoAdELL
- CSR5
- moderngpu
- PDHSEQR
- PDLAQR1
- gptk
- UHM
- GFortran
- KSVD
- LibSci
- Zippy
- Multi-GPU implementation of the lattice Boltzmann method
- H2Pack
- qr_mumps
- Implementing Multifrontal Sparse Solvers for Multicore Architectures with Sequential Task Flow Runtime Systems
- Hiding global communication latency in the GMRES algorithm on massively parallel machines
- A Scalable High Performant Cholesky Factorization for Multicore with GPU Accelerators
- BLIS: a framework for rapidly instantiating BLAS functionality
- High-order finite-element seismic wave propagation modeling with MPI on a large GPU cluster
- PLASMA: Parallel linear algebra software for multicore using OpenMP
- Design of a multicore sparse Cholesky factorization using DAGs
- Interoperable executive library for the simulation of biomedical processes
- SPEX Left LU
- Parallel direct methods for solving the system of linear equations with pipelining on a multicore using OpenMP
- Computing least squares condition numbers on hybrid multicore/GPU systems
- A Set of Batched Basic Linear Algebra Subprograms and LAPACK Routines
- Redesigning triangular dense matrix computations on GPUs
- Accelerating the solution of linear systems by iterative refinement in three precisions
- Divide and conquer on hybrid GPU-accelerated multicore systems
- H2Opus: a distributed-memory multi-GPU software package for non-local operators
- Exact likelihood-free Markov chain Monte Carlo for elliptically contoured distributions
This page was built for software: MAGMA