Cited in
(35)- Algorithm 1022: Efficient Algorithms for Computing a Rank-Revealing UTV Factorization on Parallel Computing Architectures
- Algorithm 1022
- Deriving dense linear algebra libraries
- The BLAS API of BLASFEO: optimizing performance for small matrices
- Parallel matrix multiplication: a systematic journey
- Dominant speed factors of active set methods for fast MPC
- Families of Algorithms for Reducing a Matrix to Condensed Form
- Implementing High-performance Complex Matrix Multiplication via the 3m and 4m Methods
- FLAME
- QPC
- BLIS
- Elemental
- QSPLINE
- XMR
- AAFAC
- PUMMA
- BLIS
- cl1ck
- SBR Toolbox
- OpenBLAS
- QUARK
- BLISlab
- SuperMatrix
- ReLAPACK
- DPB
- randUTV
- iMod
- Restructuring the tridiagonal and bidiagonal QR algorithms for performance
- clSPARSE
- Solving dense generalized eigenproblems on multi-threaded architectures
- Algorithm 979: Recursive algorithms for dense linear algebra -- the ReLAPACK collection
- BLIS: a framework for rapidly instantiating BLAS functionality
- Linnea
- randUTV: a blocked randomized algorithm for computing a rank-revealing UTV factorization
- Householder QR factorization with randomization for column pivoting (HQRRP)
This page was built for software: libflame