DOI10.1145/42288.42291zbMath0639.65016DBLPjournals/toms/DongarraCHH88OpenAlexW2038469228WikidataQ56455010 ScholiaQ56455010MaRDI QIDQ3780347
Jeremy J. du Croz, Jack J. Dongarra, Sven J. Hammarling, Richard J. Hanson
Publication date: 1988
Published in: ACM Transactions on Mathematical Software (Search for Journal in Brave)
Full work available at URL: http://www.acm.org/pubs/contents/journals/toms/1988-14/
Towards an efficient use of the BLAS library for multilinear tensor contractions ⋮
Running air pollution models on the connection machine ⋮
PROFIL/BIAS - A fast interval library ⋮
SOLUTION OF LARGE LINEAR SYSTEMS ON PIPELINED SIMD MACHINES ⋮
Logarithmic barriers for sparse matrix cones ⋮
Finite algorithms for robust linear regression ⋮
New software for large dense symmetric generalized eigenvalue problems using secondary storage ⋮
The Singular Value Decomposition: Anatomy of Optimizing an Algorithm for Extreme Scale ⋮
The design of a parallel dense linear algebra software library: Reduction to Hessenberg, tridiagonal, and bidiagonal form ⋮
A PARALLEL BLOCK LANCZOS ALGORITHM FOR DISTRIBUTED MEMORY ARCHITECTURES ⋮
PARALLEL CFD BENCHMARKS ON CRAY COMPUTERS ⋮
The solution of linear systems by using the Sherman-Morrison formula ⋮
A Davidson program for finding a few selected extreme eigenpairs of a large, sparse, real, symmetric matrix ⋮
Parallel benchmarks of turbulence in complex geometries ⋮
Gram-Schmidt orthogonalization: 100 years and more ⋮
High-performance numerical algorithms and software for subspace-based linear multivariable system identification ⋮
The parallel iterative methods (PIM) package for the solution of systems of linear equations on parallel computers ⋮
Explicit parallel block Cholesky algorithms on the CRAY APP ⋮
High performance solution of partial differential equations discretized using a Chebyshev spectral collocation method ⋮
Solving initial value problems for ordinary differential equations by two approaches: BDF and piecewise-linearized methods ⋮
Parallel solution of almost block diagonal systems on a hypercube ⋮
A penalty continuation method for the \(\ell_\infty\) solution of overdetermined linear systems ⋮
Automatic code selection for the dense symmetric generalized eigenvalue problem using ATMathCoreLib ⋮
High-Performance Tensor Contraction without Transposition ⋮
High Performance Implementation of Binomial Option Pricing ⋮
Two-stage least squares and indirect least squares algorithms for simultaneous equations models ⋮
Enhancing Performance and Robustness of ILU Preconditioners by Blocking and Selective Transposition ⋮
A KAM theory for conformally symplectic systems: efficient algorithms and their validation ⋮
A highly efficient implementation of a backpropagation learning algorithm using matrix ISA ⋮
Codes for almost block diagonal systems ⋮
Look-ahead in the two-sided reduction to compact band forms for symmetric eigenvalue problems and the SVD ⋮
Discontinuous Galerkin methods with nodal and hybrid modal/nodal triangular, quadrilateral, and polygonal elements for nonlinear shallow water flow ⋮
Deriving dense linear algebra libraries ⋮
Real-Time Radiation Treatment Planning with Optimality Guarantees via Cluster and Bound Methods ⋮
A direct heuristic algorithm for linear programming ⋮
Vectorizing codes for studying long-range transport of air pollutants ⋮
Parallel implementation of a multilevel modelling package ⋮
Block-Cholesky for parallel processing ⋮
Solving large dense systems of linear equations on systems with virtual memory and with cache ⋮
Comparisons of Gaussian elimination algorithms on a Cray Y-MP ⋮
Benchmarking in data envelopment analysis: an approach based on genetic algorithms and parallel programming ⋮
High performance BLAS formulation of the multipole-to-local operator in the fast multipole method ⋮
A high performance tool for the simulation of the dynamic pantograph-catenary interaction ⋮
Using Level 3 BLAS in Rotation-Based Algorithms ⋮
Linear algebra software for large-scale accelerated multicore computing ⋮
From steady solutions to chaotic flows in a Rayleigh-Bénard problem at moderate Rayleigh numbers ⋮
Restructuring the Tridiagonal and Bidiagonal QR Algorithms for Performance ⋮
Lattice quantum hadrodynamics on a CRAY Y-MP ⋮
Performance of parallel Cholesky factorization algorithms using BLAS ⋮
The Mailman algorithm: a note on matrix-vector multiplication ⋮
Accelerating scientific computations with mixed precision algorithms ⋮
Key concepts for parallel out-of-core LU factorization ⋮
High performance BLAS formulation of the adaptive fast multipole method ⋮
Numeric and symbolic evaluation of the Pfaffian of general skew-symmetric matrices ⋮
Communication lower bounds and optimal algorithms for numerical linear algebra ⋮
Inexact Bregman iteration for deconvolution of superimposed extended and point sources ⋮
Almost block diagonal linear systems: sequential and parallel solution techniques, and applications ⋮
Projection onto a Polyhedron that Exploits Sparsity ⋮
Solving emission tomography problems on vector machines ⋮
Running large air pollution models on high speed computers ⋮
BLIS: A Framework for Rapidly Instantiating BLAS Functionality ⋮
Reliable Generation of High-Performance Matrix Algebra ⋮
The Eigenvalues Slicing Library (EVSL): Algorithms, Implementation, and Software ⋮
\(O(n^ 3)\) noniterative heuristic algorithm for linear programs with error-free implementation. ⋮
High-performance sampling of generic determinantal point processes ⋮
A high order hybridizable discontinuous Galerkin method for incompressible miscible displacement in heterogeneous media ⋮
High-performance computing -- an overview ⋮
Mathematical software: Past, present, and future ⋮
Numerical algorithm delivery mechanisms ⋮
\(QR\)-like algorithms for eigenvalue problems ⋮
Numerical linear algebra algorithms and software ⋮
The impact of high-performance computing in the solution of linear systems: Trends and problems ⋮
Valuation of Structured Financial Products by Adaptive Multiwavelet Methods in High Dimensions ⋮
Computing the Gradient in Optimization Algorithms for the CP Decomposition in Constant Memory through Tensor Blocking ⋮
A block varaint of the GMRES method for unsymmetric linear systems ⋮
Parallel computing with block-iterative image reconstruction algorithms ⋮
STRFLO: A program for time-independent calculations of multiphoton processes in one-electron atomic systems. I: Quasienergy spectra and angular distributions ⋮
Analytical Modeling Is Enough for High-Performance BLIS
This page was built for publication: An extended set of FORTRAN basic linear algebra subprograms