The following pages link to MAGMA (Q24666):
Displaying 48 items.
- Scientific computations on multi-core systems using different programming frameworks (Q268844) (← links)
- Evaluation of selected resource allocation and scheduling methods in heterogeneous many-core processors and graphics processing units (Q275599) (← links)
- A parallel algorithm for calculation of determinants and minors using arbitrary precision arithmetic (Q285264) (← links)
- An inertia-free filter line-search algorithm for large-scale nonlinear programming (Q288393) (← links)
- Extending the length and time scales of Gram-Schmidt Lyapunov vector computations (Q347779) (← links)
- Multi-GPU implementation of the lattice Boltzmann method (Q356465) (← links)
- Parallel hierarchical hybrid linear solvers for emerging computing platforms (Q553205) (← links)
- Accelerating the reduction to upper Hessenberg, tridiagonal, and bidiagonal forms through hybrid GPU-based computing (Q608851) (← links)
- Particle filtering: the need for speed (Q623980) (← links)
- Parallel direct methods for solving the system of linear equations with pipelining on a multicore using OpenMP (Q645724) (← links)
- Exact likelihood-free Markov chain Monte Carlo for elliptically contoured distributions (Q906230) (← links)
- \(\mathcal H\)-LU factorization on many-core systems (Q1685030) (← links)
- Redesigning triangular dense matrix computations on GPUs (Q1693226) (← links)
- Hybrid algorithms for solving the algebraic eigenvalue problem with sparse matrices (Q1699408) (← links)
- Experiments with sparse Cholesky using a sequential task-flow implementation (Q1713223) (← links)
- Efficient determination of the Markovian time-evolution towards a steady-state of a complex open quantum system (Q1737434) (← links)
- The parallel tiled WZ factorization algorithm for multicore architectures (Q2299101) (← links)
- LU factorization on heterogeneous systems: an energy-efficient approach towards high performance (Q2403146) (← links)
- Numerical analysis of parallel implementation of the reorthogonalized ABS methods (Q2418160) (← links)
- Solving a large scale radiosity problem on GPU-based parallel computers (Q2517421) (← links)
- Interoperable executive library for the simulation of biomedical processes (Q2517440) (← links)
- High-order finite-element seismic wave propagation modeling with MPI on a large GPU cluster (Q2638279) (← links)
- H2Opus: a distributed-memory multi-GPU software package for non-local operators (Q2673500) (← links)
- BLIS: A Framework for Rapidly Instantiating BLAS Functionality (Q2828133) (← links)
- Algorithm 953 (Q2828159) (← links)
- An Efficient Multicore Implementation of a Novel HSS-Structured Multifrontal Solver Using Randomized Sampling (Q2830621) (← links)
- ViennaCL---Linear Algebra Library for Multi- and Many-Core Architectures (Q2830624) (← links)
- A Distributed and Incremental SVD Algorithm for Agglomerative Data Analysis on Large Networks (Q2834696) (← links)
- A Parallel Auxiliary Grid Algebraic Multigrid Method for Graphic Processing Units (Q2847754) (← links)
- (Q2861909) (← links)
- Divide and Conquer on Hybrid GPU-Accelerated Multicore Systems (Q2904837) (← links)
- Exploiting Symmetry in Tensors for High Performance: Multiplication with Symmetric Tensors (Q2940030) (← links)
- An efficient approach to solve very large dense linear systems with verified computing on clusters (Q2948101) (← links)
- Accelerating GPU Kernels for Dense Linear Algebra (Q3081346) (← links)
- A Scalable High Performant Cholesky Factorization for Multicore with GPU Accelerators (Q3081347) (← links)
- Sparse Matrix-Vector Multiplication on GPGPUs (Q3133580) (← links)
- Computing Least Squares Condition Numbers on Hybrid Multicore/GPU Systems (Q3459696) (← links)
- Solving a Large-Scale Thermal Radiation Problem Using an Interoperable Executive Library Framework on Petascale Supercomputers (Q3459768) (← links)
- Implementing High-performance Complex Matrix Multiplication via the 3m and 4m Methods (Q4581359) (← links)
- Accelerating the Solution of Linear Systems by Iterative Refinement in Three Precisions (Q4610143) (← links)
- Hiding Global Communication Latency in the GMRES Algorithm on Massively Parallel Machines (Q4917164) (← links)
- Hierarchical algorithms on hierarchical architectures (Q4993506) (← links)
- A Set of Batched Basic Linear Algebra Subprograms and LAPACK Routines (Q5025204) (← links)
- Design of a Multicore Sparse Cholesky Factorization Using DAGs (Q5200267) (← links)
- PLASMA (Q5237428) (← links)
- KBLAS (Q5270751) (← links)
- A High Performance QDWH-SVD Solver Using Hardware Accelerators (Q5270768) (← links)
- Implementing Multifrontal Sparse Solvers for Multicore Architectures with Sequential Task Flow Runtime Systems (Q5270774) (← links)