Design, implementation and testing of extended and mixed precision BLAS

From MaRDI portal
Publication:5461236

DOI10.1145/567806.567808zbMath1070.65523OpenAlexW2104281151WikidataQ113309773 ScholiaQ113309773MaRDI QIDQ5461236

No author found.

Publication date: 22 July 2005

Published in: ACM Transactions on Mathematical Software (Search for Journal in Brave)

Full work available at URL: https://digital.library.unt.edu/ark:/67531/metadc793919/




Related Items (40)

Mixed precision algorithms in numerical linear algebraIterative refinement for symmetric eigenvalue decompositionConvergence of Rump's method for inverting arbitrarily ill-conditioned matricesSchur aggregation for linear systems and determinantsAccurate floating-point summation: a new approachAlgorithms for accurate, validated and fast polynomial evaluationCompensated summation and dot product algorithms for floating-point vectors on parallel architectures: error bounds, implementation and application in the Krylov subspace methodsFormalization of Double-Word Arithmetic, and Comments on “Tight and Rigorous Error Bounds for Basic Building Blocks of Double-Word Arithmetic”Super-fast validated solution of linear systemsAccurate evaluation of a polynomial and its derivative in Bernstein formFloating-point arithmeticOn the numerical stability of algorithmic differentiationMatrix computations and polynomial root-finding with preprocessingAdditive preconditioning and aggregation in matrix computationsInfinite-precision inner product and sparse matrix-vector multiplication using Ozaki scheme with Dot2 on manycore processorsAcceleration of iterative refinement for singular value decompositionRadial basis function approximation methods with extended precision floating point arithmeticStochastic arithmetic in multiprecisionMore accuracy at fixed precision.Error-free transformations of matrix multiplication by using fast routines of matrix multiplication and its applicationsAccurate, validated and fast evaluation of elementary symmetric functions and its applicationAccurate quotient-difference algorithm: error analysis, improvements and applicationsAccurate and efficient evaluation of Chebyshev tensor product surfaceRandomized preprocessing of homogeneous linear systems of equationsApproximate Calculation of Sums II: Gaussian Type QuadratureStrassen's Algorithm for Tensor ContractionAccurate evaluation of a polynomial in Chebyshev formAccurate summation, dot product and polynomial evaluation in complex floating point arithmeticComputing prime harmonic sumsPerformance and accuracy of hardware-oriented native-, emulated- and mixed-precision solvers in FEM simulationsIterative refinement for singular value decomposition based on matrix multiplicationPerformance and energy consumption of accurate and mixed-precision linear algebra kernels on GPUsEfficient Calculations of Faithfully Rounded l 2 -Norms of n -VectorsA new error-free floating-point summation algorithmIterative refinement for symmetric eigenvalue decomposition. II. Clustered eigenvaluesAn accurate algorithm for evaluating rational functionsAccurate evaluation of polynomials in Legendre basisPACF: a precision-adjustable computational framework for solving singular valuesSIMD Parallel Sparse Matrix-Vector and Transposed-Matrix-Vector Multiplication in DD PrecisionImprovement of error-free splitting for accurate matrix multiplication


Uses Software



This page was built for publication: Design, implementation and testing of extended and mixed precision BLAS