Design, implementation and testing of extended and mixed precision BLAS
From MaRDI portal
Publication:5461236
DOI10.1145/567806.567808zbMath1070.65523OpenAlexW2104281151WikidataQ113309773 ScholiaQ113309773MaRDI QIDQ5461236
No author found.
Publication date: 22 July 2005
Published in: ACM Transactions on Mathematical Software (Search for Journal in Brave)
Full work available at URL: https://digital.library.unt.edu/ark:/67531/metadc793919/
Complexity and performance of numerical algorithms (65Y20) Software, source code, etc. for problems pertaining to linear algebra (15-04) Numerical linear algebra (65Fxx)
Related Items (40)
Mixed precision algorithms in numerical linear algebra ⋮ Iterative refinement for symmetric eigenvalue decomposition ⋮ Convergence of Rump's method for inverting arbitrarily ill-conditioned matrices ⋮ Schur aggregation for linear systems and determinants ⋮ Accurate floating-point summation: a new approach ⋮ Algorithms for accurate, validated and fast polynomial evaluation ⋮ Compensated summation and dot product algorithms for floating-point vectors on parallel architectures: error bounds, implementation and application in the Krylov subspace methods ⋮ Formalization of Double-Word Arithmetic, and Comments on “Tight and Rigorous Error Bounds for Basic Building Blocks of Double-Word Arithmetic” ⋮ Super-fast validated solution of linear systems ⋮ Accurate evaluation of a polynomial and its derivative in Bernstein form ⋮ Floating-point arithmetic ⋮ On the numerical stability of algorithmic differentiation ⋮ Matrix computations and polynomial root-finding with preprocessing ⋮ Additive preconditioning and aggregation in matrix computations ⋮ Infinite-precision inner product and sparse matrix-vector multiplication using Ozaki scheme with Dot2 on manycore processors ⋮ Acceleration of iterative refinement for singular value decomposition ⋮ Radial basis function approximation methods with extended precision floating point arithmetic ⋮ Stochastic arithmetic in multiprecision ⋮ More accuracy at fixed precision. ⋮ Error-free transformations of matrix multiplication by using fast routines of matrix multiplication and its applications ⋮ Accurate, validated and fast evaluation of elementary symmetric functions and its application ⋮ Accurate quotient-difference algorithm: error analysis, improvements and applications ⋮ Accurate and efficient evaluation of Chebyshev tensor product surface ⋮ Randomized preprocessing of homogeneous linear systems of equations ⋮ Approximate Calculation of Sums II: Gaussian Type Quadrature ⋮ Strassen's Algorithm for Tensor Contraction ⋮ Accurate evaluation of a polynomial in Chebyshev form ⋮ Accurate summation, dot product and polynomial evaluation in complex floating point arithmetic ⋮ Computing prime harmonic sums ⋮ Performance and accuracy of hardware-oriented native-, emulated- and mixed-precision solvers in FEM simulations ⋮ Iterative refinement for singular value decomposition based on matrix multiplication ⋮ Performance and energy consumption of accurate and mixed-precision linear algebra kernels on GPUs ⋮ Efficient Calculations of Faithfully Rounded l 2 -Norms of n -Vectors ⋮ A new error-free floating-point summation algorithm ⋮ Iterative refinement for symmetric eigenvalue decomposition. II. Clustered eigenvalues ⋮ An accurate algorithm for evaluating rational functions ⋮ Accurate evaluation of polynomials in Legendre basis ⋮ PACF: a precision-adjustable computational framework for solving singular values ⋮ SIMD Parallel Sparse Matrix-Vector and Transposed-Matrix-Vector Multiplication in DD Precision ⋮ Improvement of error-free splitting for accurate matrix multiplication
Uses Software
This page was built for publication: Design, implementation and testing of extended and mixed precision BLAS