MAGMA - MaRDI portal

MaRDI QIDQ24666swMATHFDO

Official website http://icl.cs.utk.edu/magma/

Cited in

(only showing first 100 items - show all)

A parallel auxiliary grid algebraic multigrid method for graphic processing units
Algorithm 953: Parallel library software for the multishift QR algorithm with aggressive early deflation
Scientific computations on multi-core systems using different programming frameworks
Evaluation of selected resource allocation and scheduling methods in heterogeneous many-core processors and graphics processing units
Parallel hierarchical hybrid linear solvers for emerging computing platforms
An inertia-free filter line-search algorithm for large-scale nonlinear programming
\(\mathcal H\)-LU factorization on many-core systems
A parallel algorithm for calculation of determinants and minors using arbitrary precision arithmetic
Exploiting symmetry in tensors for high performance: multiplication with symmetric tensors
Solving a large-scale thermal radiation problem using an interoperable executive library framework on petascale supercomputers
ViennaCL-linear algebra library for multi- and many-core architectures
Particle filtering: the need for speed
Extending the length and time scales of Gram-Schmidt Lyapunov vector computations
Programming the finite element method
FLAME
PaStiX
ScaLAPACK
VOLSCAT
LAWRA
PLAPACK
Algorithm 826
CALU
RScaLAPACK
CUBLAS
MKL
Elemental
POOCLAPACK
OpenCL
SOLAR
Cellss
SBR Toolbox
PLASMA
clSpMV
Algorithm 880
CULA
LogGOPSim
NaSt3DGPF
MR3-SMP
PLASMA
HSL_MA87
IEL
STREAM benchmark
HSL_MA79
QUARK
SpGEMM
StarPU
FastFlow
CUMP
GPUprec
MPIGMP
SWARM
Wool
MINMOD
Algorithm 953
KBLAS
Tcmalloc
yaSpMV
AxiSEM
Accelerating the reduction to upper Hessenberg, tridiagonal, and bidiagonal forms through hybrid GPU-based computing
DDSCAT
PBLAS
Algorithm 656
CLBlast
CLTune
DAGuE
SuperMatrix
NFM-DS
AdELL
BiELL
CoAdELL
CSR5
moderngpu
PDHSEQR
PDLAQR1
gptk
UHM
GFortran
KSVD
LibSci
Zippy
Multi-GPU implementation of the lattice Boltzmann method
H2Pack
qr_mumps
Implementing Multifrontal Sparse Solvers for Multicore Architectures with Sequential Task Flow Runtime Systems
Hiding global communication latency in the GMRES algorithm on massively parallel machines
A Scalable High Performant Cholesky Factorization for Multicore with GPU Accelerators
BLIS: a framework for rapidly instantiating BLAS functionality
High-order finite-element seismic wave propagation modeling with MPI on a large GPU cluster
PLASMA: Parallel linear algebra software for multicore using OpenMP
Design of a multicore sparse Cholesky factorization using DAGs
Interoperable executive library for the simulation of biomedical processes
SPEX Left LU
Parallel direct methods for solving the system of linear equations with pipelining on a multicore using OpenMP
Computing least squares condition numbers on hybrid multicore/GPU systems
A Set of Batched Basic Linear Algebra Subprograms and LAPACK Routines
Redesigning triangular dense matrix computations on GPUs
Accelerating the solution of linear systems by iterative refinement in three precisions
Divide and conquer on hybrid GPU-accelerated multicore systems
H2Opus: a distributed-memory multi-GPU software package for non-local operators
Exact likelihood-free Markov chain Monte Carlo for elliptically contoured distributions

This page was built for software: MAGMA