A set of level 3 basic linear algebra subprograms

From MaRDI portal
Revision as of 23:56, 6 February 2024 by Import240129110113 (talk | contribs) (Created automatically from import240129110113)
(diff) ← Older revision | Latest revision (diff) | Newer revision → (diff)

Publication:4371637

DOI10.1145/77626.79170zbMath0900.65115OpenAlexW2002257715WikidataQ56455009 ScholiaQ56455009MaRDI QIDQ4371637

Jeremy J. du Croz, Sven J. Hammarling, Jack J. Dongarra, Iain S. Duff

Publication date: 23 March 1998

Published in: ACM Transactions on Mathematical Software (Search for Journal in Brave)

Full work available at URL: http://www.acm.org/pubs/contents/journals/toms/1990-16/




Related Items (only showing first 100 items - show all)

A parallel R-matrix program PRMAT for electron-atom and electron-ion scattering calculationsTowards an efficient use of the BLAS library for multilinear tensor contractionsObject-oriented programming in control system design: A surveyA new parallel sparse direct solver: Presentation and numerical experiments in large-scale structural mechanics parallel computingInterior-point solver for large-scale quadratic programming problems with bound constraintsPROFIL/BIAS - A fast interval librarySparse Matrix Methods for Circuit Simulation ProblemsUnnamed ItemAn implicitly restarted block Lanczos bidiagonalization method using Leja shiftsStabilizing canonical-ensemble calculations in the auxiliary-field Monte Carlo methodPerformance models and workload distribution algorithms for optimizing a hybrid CPU-GPU multifrontal solverBasis selection in LOBPCGThe design of a parallel dense linear algebra software library: Reduction to Hessenberg, tridiagonal, and bidiagonal formNonlinear eigenvalue and frequency response problems in industrial practiceA block representation for products of hyperbolic Householder transformsParallel benchmarks of turbulence in complex geometriesExplicit parallel block Cholesky algorithms on the CRAY APPHigh performance solution of partial differential equations discretized using a Chebyshev spectral collocation methodParallel solution of almost block diagonal systems on a hypercubeAn efficient approach to solve very large dense linear systems with verified computing on clustersSparse matrix factorization in the implicit finite element method on petascale architectureEfficient algorithm for proper orthogonal decomposition of block-structured adaptively refined numerical simulationsA sparse nonsymmetric eigensolver for distributed memory architecturesOptimal size of the block in block GMRES on GPUs: computational model and experimentsFactorizing the factorization -- a spectral-element solver for elliptic equations with linear operation countReorthogonalized block classical Gram-SchmidtComputer algebra systems - new strategies and techniquesFull multi grid method for electric field computation in point-to-plane streamer discharge in air at atmospheric pressureA \(\mu\)-mode BLAS approach for multidimensional tensor-structured problemsEnhancing Performance and Robustness of ILU Preconditioners by Blocking and Selective TranspositionA highly efficient implementation of a backpropagation learning algorithm using matrix ISAEfficient algorithms for the discrete Gabor transform with a long FIR windowCodes for almost block diagonal systemsRank-profile revealing Gaussian elimination and the CUP matrix decompositionUpper and lower I/O bounds for pebbling \(r\)-pyramidsA multiscale method for model order reduction in PDE parameter estimationA domain-decomposing parallel sparse linear system solverFast interval matrix multiplicationSparse direct factorizations through unassembled hyper-matricesLook-ahead in the two-sided reduction to compact band forms for symmetric eigenvalue problems and the SVDUnnamed ItemDeriving dense linear algebra librariesSolving sequences of generalized least-squares problems on multi-threaded architecturesCholesky and Gram-Schmidt Orthogonalization for Tall-and-Skinny QR Factorizations on Graphics ProcessorsAn efficient out-of-core multifrontal solver for large-scale unsymmetric element problemsApproximate eigenvectors as preconditionerNew parallel sparse direct solvers for multicore architecturesParallel implementation of a multilevel modelling packagePerformance evaluation of supercomputers using HPCC and IMB benchmarksBlock-Cholesky for parallel processingSolving large dense systems of linear equations on systems with virtual memory and with cacheA sparse proximal implementation of the LP dual active set algorithmDual multilevel optimizationComparisons of Gaussian elimination algorithms on a Cray Y-MPAugmented block Householder Arnoldi methodParallel solution of almost block diagonal systems on the CRAY Y-MP using level 3 BLASHigh performance BLAS formulation of the multipole-to-local operator in the fast multipole methodVBARMS: a variable block algebraic recursive multilevel solver for sparse linear systemsFast inclusion of interval matrix multiplicationFrom steady solutions to chaotic flows in a Rayleigh-Bénard problem at moderate Rayleigh numbersEfficient use of sparsity by direct solvers applied to 3D controlled-source EM problemsLattice quantum hadrodynamics on a CRAY Y-MPPerformance of parallel Cholesky factorization algorithms using BLASA massively-parallel electronic-structure calculations based on real-space density functional theoryEfficient iterative algorithms for the stochastic finite element method with application to acoustic scatteringAccelerating scientific computations with mixed precision algorithmsHigh performance BLAS formulation of the adaptive fast multipole methodSolving stable Sylvester equations via rational iterative schemesA mathematical model of the static pantograph/catenary interactionDiffusion forecasting model with basis functions from QR-decompositionSolving path problems on the GPURECSY and SCASY Library Software: Recursive Blocked and Parallel Algorithms for Sylvester-Type Matrix Equations with Some ApplicationsLAPACK-Based Condition Estimates for the Discrete-Time LQG DesignReproducibility strategies for parallel preconditioned conjugate gradientMultifrontal Computations on GPUs and Their Multi-core HostsThe parallel tiled WZ factorization algorithm for multicore architecturesUsing dual techniques to derive componentwise and mixed condition numbers for a linear function of a linear least squares solutionEfficient algorithm for simultaneous reduction to the \(m\)-Hessenberg-triangular-triangular formBLIS: A Framework for Rapidly Instantiating BLAS FunctionalityReliable Generation of High-Performance Matrix AlgebraBlock reduction of matrices to condensed forms for eigenvalue computationsDesigning linear algebra algorithms on the IBM 3090 vector multiprocessor with a hierarchical memory systemSelf-Stabilizing Prefix Tree Based Overlay NetworksGmsh: A 3-D finite element mesh generator with built-in pre- and post-processing facilitiesA parallel Davidson-type algorithm for several eigenvaluesMultifrontal parallel distributed symmetric and unsymmetric solversScaLAPACK: A portable linear algebra library for distributed memory computers -- design issues and performanceHigh-performance computing -- an overviewA review of frontal methods for solving linear systemsMathematical software: Past, present, and futureNumerical algorithm delivery mechanismsA frontal solver for the 21st centuryEvaluating recursive filters on distributed memory parallel computers\(QR\)-like algorithms for eigenvalue problemsNumerical linear algebra algorithms and softwareThe impact of high-performance computing in the solution of linear systems: Trends and problemsNodal high-order methods on unstructured grids. I: Time-domain solution of Maxwell's equationsUnnamed ItemA block varaint of the GMRES method for unsymmetric linear systemsSTRFLO: A program for time-independent calculations of multiphoton processes in one-electron atomic systems. I: Quasienergy spectra and angular distributions


Uses Software






This page was built for publication: A set of level 3 basic linear algebra subprograms