BLAS - MaRDI portal

MaRDI QIDQ15749swMATHFDO

Official website https://www.netlib.org/blas/

Cited in

(only showing first 100 items - show all)

On short recurrence Krylov type methods for linear systems with many right-hand sides
REVISITING MATRIX PRODUCT ON MASTER-WORKER PLATFORMS
RECSY and SCASY Library Software: Recursive Blocked and Parallel Algorithms for Sylvester-Type Matrix Equations with Some Applications
Efficient vector and parallel manipulation of tensor products
Two-dimensional block partitionings for the parallel sparse Cholesky factorization
Efficient sparse LU factorization with left-right looking strategy on shared memory multiprocessors
Error-free transformations of matrix multiplication by using fast routines of matrix multiplication and its applications
Linear algebra kernels for parallel domain decomposition methods
Dual BEM for crack growth analysis on distributed-memory multiprocessors
Householder QR factorization with randomization for column pivoting (HQRRP)
A multishift, multipole rational QZ method with aggressive early deflation
Computing the singular value decomposition with high relative accuracy
The Art of High Performance Computing for Computational Science, Vol. 1
scientific article; zbMATH DE number 1424342 (Why is no real title available?)
Parallel solution of almost block diagonal systems on the CRAY Y-MP using level 3 BLAS
LAPACK-Based Condition Estimates for the Discrete-Time LQG Design
Efficient generalized Hessenberg form and applications
Recursive Blocked Algorithms and Hybrid Data Structures for Dense Matrix Library Software
An Asynchronous Parallel Supernodal Algorithm for Sparse Gaussian Elimination
Exploiting zeros on the diagonal in the direct solution of indefinite sparse symmetric linear systems
Fast inclusion of interval matrix multiplication
Using Level 3 BLAS in Rotation-Based Algorithms
A mathematical model of the static pantograph/catenary interaction
Parallel solution of almost block diagonal systems on a hypercube
Improving and estimating the accuracy of Strassen's algorithm
scientific article; zbMATH DE number 733664 (Why is no real title available?)
Parallel reduction of four matrices to condensed form for a generalized matrix eigenvalue algorithm
Block algorithms for reordering standard and generalized Schur forms
Adaptive techniques for improving the performance of incomplete factorization preconditioning
Solving almost block diagonal systems on parallel computers
The parallel iterative methods (PIM) package for the solution of systems of linear equations on parallel computers
Parallel field computation based on coupling of differential and integral methods
Parallel profiling of water distribution networks using the Clément formula
High performance BLAS formulation of the multipole-to-local operator in the fast multipole method
An Unsymmetric-Pattern Multifrontal Method for Sparse LU Factorization
Fast Parallel Iterative Solution of Poisson’s and the Biharmonic Equations on Irregular Regions
A recursive formulation of Cholesky factorization of a matrix in packed storage
Undulant-block elimination and integer-preserving matrix inversion
Numerical linear algebra algorithms and software
Component-based derivation of a parallel stiff ODE solver implemented in a cluster of computers
Codes for almost block diagonal systems
Implementing High-performance Complex Matrix Multiplication via the 3m and 4m Methods
Solvers for the verified solution of parametric linear systems
2LEV-D2P4: a package of high-performance preconditioners for scientific and engineering applications
Parallel reduction of banded matrices to bidiagonal form
HARNESS fault tolerant MPI design, usage and performance issues
A framework for high-performance matrix multiplication based on hierarchical abstractions, algorithms and optimized low-level kernels
An object‐oriented interface for the dynamic memory management of sparse discrete mathematical operators in numerical scientific applications
Block-oriented J-Jacobi methods for Hermitian matrices
Optimizing a Parallel Conjugate Gradient Solver
Performance of parallel Cholesky factorization algorithms using BLAS
scientific article; zbMATH DE number 5920471 (Why is no real title available?)
Towards an efficient use of the BLAS library for multilinear tensor contractions
Block and Parallel Versions of One-Sided Bidiagonalization
A massively-parallel electronic-structure calculations based on real-space density functional theory
Computer algebra systems - new strategies and techniques
Algorithm 784: GEMM-based level 3 BLAS
scientific article; zbMATH DE number 4055028 (Why is no real title available?)
Using Strassen's algorithm to accelerate the solution of linear systems
scientific article; zbMATH DE number 1984687 (Why is no real title available?)
Sparse matrix factorization in the implicit finite element method on petascale architecture
Multishift Variants of the QZ Algorithm with Aggressive Early Deflation
Computing the matrix geometric mean: Riemannian versus Euclidean conditioning, implementation techniques, and a Riemannian BFGS method.
Sparse extensions to the FORTRAN Basic Linear Algebra Subprograms
Design, implementation and testing of extended and mixed precision BLAS
Programming methodology and performance issues for advanced computer architectures
Mathematical software: Past, present, and future
New parallel sparse direct solvers for multicore architectures
Parallel Factorization of Structured Matrices Arising in Stochastic Programming
SIMD parallel MCMC sampling with applications for big-data Bayesian analytics
Strategies for parallelizing the solution of rational matrix equations
Out-of-Core Implementations of Cholesky Factorization: Loop-Based versus Recursive Algorithms
Optimizing the multipole-to-local operator in the fast multipole method for graphical processing units
Automatic translation of Fortran to JVM bytecode
Full multi grid method for electric field computation in point-to-plane streamer discharge in air at atmospheric pressure
BLIS: a framework for rapidly instantiating BLAS functionality
Fast verification of solutions of matrix equations
An inverse free parallel spectral divide and conquer algorithm for nonsymmetric eigenproblems
High-Performance Tensor Contraction without Transposition
Minimum classification error training in example based speech and pattern recognition using sparse weight matrices
A Nondeterministic Parallel Algorithm for General Unsymmetric Sparse LU Factorization
A distributed-memory package for dense hierarchically semi-separable matrix computations using randomization
On Improving Linear Solver Performance: A Block Variant of GMRES
A Parallel Implementation of the Nonsymmetric QR Algorithm for Distributed Memory Architectures
Parallel strategies for computing the orthogonal factorizations used in the estimation of econometric models
Efficient algorithms for block downdating of least squares solutions
The automatic generation of sparse primitives
A Block Orthogonalization Procedure with Constant Synchronization Requirements
Block Gram-Schmidt downdating
Numerical methods. Principles, analysis and algorithms.
scientific article; zbMATH DE number 1984695 (Why is no real title available?)
Efficient iterative algorithms for the stochastic finite element method with application to acoustic scattering
Software Libraries for Linear Algebra Computations on High Performance Computers
The design, implementation, and evaluation of a symmetric banded linear solver for distributed-memory parallel computers
A column pre-ordering strategy for the unsymmetric-pattern multifrontal method
\(QR\)-like algorithms for eigenvalue problems
Parallel solution of hierarchical symmetric positive definite linear systems
scientific article; zbMATH DE number 1069170 (Why is no real title available?)
A piecewise-linearized algorithm based on the Krylov subspace for solving stiff ODEs
Applications of level 2 BLAS in the NAG library

This page was built for software: BLAS