Cited in
(only showing first 100 items - show all)- On short recurrence Krylov type methods for linear systems with many right-hand sides
- REVISITING MATRIX PRODUCT ON MASTER-WORKER PLATFORMS
- RECSY and SCASY Library Software: Recursive Blocked and Parallel Algorithms for Sylvester-Type Matrix Equations with Some Applications
- Efficient vector and parallel manipulation of tensor products
- Two-dimensional block partitionings for the parallel sparse Cholesky factorization
- Efficient sparse LU factorization with left-right looking strategy on shared memory multiprocessors
- Error-free transformations of matrix multiplication by using fast routines of matrix multiplication and its applications
- Linear algebra kernels for parallel domain decomposition methods
- Dual BEM for crack growth analysis on distributed-memory multiprocessors
- Householder QR factorization with randomization for column pivoting (HQRRP)
- A multishift, multipole rational QZ method with aggressive early deflation
- Computing the singular value decomposition with high relative accuracy
- The Art of High Performance Computing for Computational Science, Vol. 1
- scientific article; zbMATH DE number 1424342 (Why is no real title available?)
- Parallel solution of almost block diagonal systems on the CRAY Y-MP using level 3 BLAS
- LAPACK-Based Condition Estimates for the Discrete-Time LQG Design
- Efficient generalized Hessenberg form and applications
- Recursive Blocked Algorithms and Hybrid Data Structures for Dense Matrix Library Software
- An Asynchronous Parallel Supernodal Algorithm for Sparse Gaussian Elimination
- Exploiting zeros on the diagonal in the direct solution of indefinite sparse symmetric linear systems
- Fast inclusion of interval matrix multiplication
- Using Level 3 BLAS in Rotation-Based Algorithms
- A mathematical model of the static pantograph/catenary interaction
- Parallel solution of almost block diagonal systems on a hypercube
- Improving and estimating the accuracy of Strassen's algorithm
- scientific article; zbMATH DE number 733664 (Why is no real title available?)
- Parallel reduction of four matrices to condensed form for a generalized matrix eigenvalue algorithm
- Block algorithms for reordering standard and generalized Schur forms
- Adaptive techniques for improving the performance of incomplete factorization preconditioning
- Solving almost block diagonal systems on parallel computers
- The parallel iterative methods (PIM) package for the solution of systems of linear equations on parallel computers
- Parallel field computation based on coupling of differential and integral methods
- Parallel profiling of water distribution networks using the Clément formula
- High performance BLAS formulation of the multipole-to-local operator in the fast multipole method
- An Unsymmetric-Pattern Multifrontal Method for Sparse LU Factorization
- Fast Parallel Iterative Solution of Poisson’s and the Biharmonic Equations on Irregular Regions
- A recursive formulation of Cholesky factorization of a matrix in packed storage
- Undulant-block elimination and integer-preserving matrix inversion
- Numerical linear algebra algorithms and software
- Component-based derivation of a parallel stiff ODE solver implemented in a cluster of computers
- Codes for almost block diagonal systems
- Implementing High-performance Complex Matrix Multiplication via the 3m and 4m Methods
- Solvers for the verified solution of parametric linear systems
- 2LEV-D2P4: a package of high-performance preconditioners for scientific and engineering applications
- Parallel reduction of banded matrices to bidiagonal form
- HARNESS fault tolerant MPI design, usage and performance issues
- A framework for high-performance matrix multiplication based on hierarchical abstractions, algorithms and optimized low-level kernels
- An object‐oriented interface for the dynamic memory management of sparse discrete mathematical operators in numerical scientific applications
- Block-oriented J-Jacobi methods for Hermitian matrices
- Optimizing a Parallel Conjugate Gradient Solver
- Performance of parallel Cholesky factorization algorithms using BLAS
- scientific article; zbMATH DE number 5920471 (Why is no real title available?)
- Towards an efficient use of the BLAS library for multilinear tensor contractions
- Block and Parallel Versions of One-Sided Bidiagonalization
- A massively-parallel electronic-structure calculations based on real-space density functional theory
- Computer algebra systems - new strategies and techniques
- Algorithm 784: GEMM-based level 3 BLAS
- scientific article; zbMATH DE number 4055028 (Why is no real title available?)
- Using Strassen's algorithm to accelerate the solution of linear systems
- scientific article; zbMATH DE number 1984687 (Why is no real title available?)
- Sparse matrix factorization in the implicit finite element method on petascale architecture
- Multishift Variants of the QZ Algorithm with Aggressive Early Deflation
- Computing the matrix geometric mean: Riemannian versus Euclidean conditioning, implementation techniques, and a Riemannian BFGS method.
- Sparse extensions to the FORTRAN Basic Linear Algebra Subprograms
- Design, implementation and testing of extended and mixed precision BLAS
- Programming methodology and performance issues for advanced computer architectures
- Mathematical software: Past, present, and future
- New parallel sparse direct solvers for multicore architectures
- Parallel Factorization of Structured Matrices Arising in Stochastic Programming
- SIMD parallel MCMC sampling with applications for big-data Bayesian analytics
- Strategies for parallelizing the solution of rational matrix equations
- Out-of-Core Implementations of Cholesky Factorization: Loop-Based versus Recursive Algorithms
- Optimizing the multipole-to-local operator in the fast multipole method for graphical processing units
- Automatic translation of Fortran to JVM bytecode
- Full multi grid method for electric field computation in point-to-plane streamer discharge in air at atmospheric pressure
- BLIS: a framework for rapidly instantiating BLAS functionality
- Fast verification of solutions of matrix equations
- An inverse free parallel spectral divide and conquer algorithm for nonsymmetric eigenproblems
- High-Performance Tensor Contraction without Transposition
- Minimum classification error training in example based speech and pattern recognition using sparse weight matrices
- A Nondeterministic Parallel Algorithm for General Unsymmetric Sparse LU Factorization
- A distributed-memory package for dense hierarchically semi-separable matrix computations using randomization
- On Improving Linear Solver Performance: A Block Variant of GMRES
- A Parallel Implementation of the Nonsymmetric QR Algorithm for Distributed Memory Architectures
- Parallel strategies for computing the orthogonal factorizations used in the estimation of econometric models
- Efficient algorithms for block downdating of least squares solutions
- The automatic generation of sparse primitives
- A Block Orthogonalization Procedure with Constant Synchronization Requirements
- Block Gram-Schmidt downdating
- Numerical methods. Principles, analysis and algorithms.
- scientific article; zbMATH DE number 1984695 (Why is no real title available?)
- Efficient iterative algorithms for the stochastic finite element method with application to acoustic scattering
- Software Libraries for Linear Algebra Computations on High Performance Computers
- The design, implementation, and evaluation of a symmetric banded linear solver for distributed-memory parallel computers
- A column pre-ordering strategy for the unsymmetric-pattern multifrontal method
- \(QR\)-like algorithms for eigenvalue problems
- Parallel solution of hierarchical symmetric positive definite linear systems
- scientific article; zbMATH DE number 1069170 (Why is no real title available?)
- A piecewise-linearized algorithm based on the Krylov subspace for solving stiff ODEs
- Applications of level 2 BLAS in the NAG library
This page was built for software: BLAS