ScaLAPACK: A portable linear algebra library for distributed memory computers -- design issues and performance
DOI10.1016/0010-4655(96)00017-3zbMATH Open0926.65148OpenAlexW4239025233MaRDI QIDQ1294620FDOQ1294620
Authors: V. Pereyra
Publication date: 30 November 1999
Published in: Computer Physics Communications (Search for Journal in Brave)
Full work available at URL: https://doi.org/10.1016/0010-4655(96)00017-3
Recommendations
performancesoftwareparallel algorithmsparallel computersScaLAPACKdistributed memory computersdistributed linear algebra machinelinear algebra computations
Parallel numerical computation (65Y05) Packaged methods for numerical algorithms (65Y15) Symbolic computation and algebraic computation (68W30) Numerical linear algebra (65Fxx)
Cites Work
- Title not available (Why is that?)
- Algorithm 656: an extended set of basic linear algebra subprograms: model implementation and test programs
- Basic Linear Algebra Subprograms for Fortran Usage
- A set of level 3 basic linear algebra subprograms
- The NX message passing interface
- Parallel matrix transpose algorithms on distributed memory concurrent computers
- Title not available (Why is that?)
- On the correctness of some bisection-like parallel eigenvalue algorithms in floating point arithmetic
Cited In (25)
- Parallel Processing and Applied Mathematics
- Parallel multivariate slice sampling
- Considerations on the Implementation and Use of Anderson Acceleration on Distributed Memory and GPU-based Parallel Computers
- An efficient parallel high-order compact scheme for the 3D incompressible Navier–Stokes equations
- Experiencing a parallel mathematical library on a PC network
- A sparse nonsymmetric eigensolver for distributed memory architectures
- An efficient hybrid tridiagonal divide-and-conquer algorithm on distributed memory architectures
- Key concepts for parallel out-of-core LU factorization
- An efficient approach to solve very large dense linear systems with verified computing on clusters.
- ScaLAPACK Users' Guide
- Title not available (Why is that?)
- Scaling up parallel computation of tiled QR factorizations by a distributed scheduling runtime system and analytical modeling
- Algorithm 1022: Efficient Algorithms for Computing a Rank-Revealing UTV Factorization on Parallel Computing Architectures
- Practical task-oriented parallelism for Gaussian elimination in distributed memory
- ScaLAPACK
- Transient growth analysis of hypersonic flow over an elliptic cone
- Order \(10^4\) speedup in global linear instability analysis using matrix formation
- Title not available (Why is that?)
- Porting a sphere optimization program from LAPACK to ScaLAPACK
- Title not available (Why is that?)
- Title not available (Why is that?)
- A heterogeneous parallel LU factorization algorithm based on a basic column block uniform allocation strategy
- The impact of data distribution in accuracy and performance of parallel linear algebra subroutines
- Solving dense interval linear systems with verified computing on multicore architectures
- High performance verified computing using C-XSC
Uses Software
This page was built for publication: ScaLAPACK: A portable linear algebra library for distributed memory computers -- design issues and performance
Report a bug (only for logged in users!)Click here to report a bug for this page (MaRDI item Q1294620)