Cited in
(81)- scientific article; zbMATH DE number 1424342 (Why is no real title available?)
- Recursive Blocked Algorithms and Hybrid Data Structures for Dense Matrix Library Software
- An Asynchronous Parallel Supernodal Algorithm for Sparse Gaussian Elimination
- A recursive formulation of Cholesky factorization of a matrix in packed storage
- Combined selection of tile sizes and unroll factors using iterative compilation
- A framework for high-performance matrix multiplication based on hierarchical abstractions, algorithms and optimized low-level kernels
- scientific article; zbMATH DE number 2080886 (Why is no real title available?)
- scientific article; zbMATH DE number 1984687 (Why is no real title available?)
- Design, implementation and testing of extended and mixed precision BLAS
- BLIS: a framework for rapidly instantiating BLAS functionality
- Lowest common ancestors in trees and directed acyclic graphs
- scientific article; zbMATH DE number 1857504 (Why is no real title available?)
- scientific article; zbMATH DE number 1984695 (Why is no real title available?)
- scientific article; zbMATH DE number 1302622 (Why is no real title available?)
- scientific article; zbMATH DE number 2087095 (Why is no real title available?)
- scientific article; zbMATH DE number 2090650 (Why is no real title available?)
- scientific article; zbMATH DE number 2086366 (Why is no real title available?)
- scientific article; zbMATH DE number 2089163 (Why is no real title available?)
- scientific article; zbMATH DE number 2089172 (Why is no real title available?)
- scientific article; zbMATH DE number 1940308 (Why is no real title available?)
- A Supernodal Approach to Sparse Partial Pivoting
- ScaLAPACK Users' Guide
- Cache optimization for structured and unstructured grid multigrid
- Large-Scale Scientific Computing
- scientific article; zbMATH DE number 1728268 (Why is no real title available?)
- ATLAS
- FLAME
- PAG
- PSPASES
- SPIRAL
- HPCVIEW
- GEMMW
- OCEANS
- EVENODD
- GEMM
- BLAS
- UHFFT
- FFTW
- SUMMA
- PAPI
- OSKI
- PUMMA
- DynTile
- BLIS
- PLuTo
- Algorithm 679
- GotoBLAS
- BLISlab
- Emmerald
- OProfile
- Algorithm 784
- Algorithm 656
- SuperMatrix
- ADAPT
- ESSL
- PMLP
- Optimizing locality and scalability of embedded Runge-Kutta solvers using block-based pipelining
- Computational Science - ICCS 2004
- Emmerald: a fast matrix–matrix multiply using Intel's SSE instructions
- Finding least common ancestors in directed acyclic graphs
- Towards performance evaluation of high-performance computing on multiple Java platforms
- scientific article; zbMATH DE number 2080291 (Why is no real title available?)
- Formal derivation of algorithms
- Communication lower bounds for distributed-memory matrix multiplication
- Adaptive Winograd's matrix multiplications
- Optimization of algorithms with OPAL
- Reliable generation of high-performance matrix algebra
- GPTune
- When cache blocking of sparse matrix vector multiply works and why
- Automated empirical optimizations of software and the ATLAS project
- scientific article; zbMATH DE number 1728263 (Why is no real title available?)
- Analytical modeling is enough for high-performance BLIS
- scientific article; zbMATH DE number 1756010 (Why is no real title available?)
- scientific article; zbMATH DE number 1844553 (Why is no real title available?)
- An efficient time-step-based self-adaptive algorithm for predictor-corrector methods of Runge-Kutta type
- scientific article; zbMATH DE number 1729264 (Why is no real title available?)
- rchol
- Accurate Symmetric Indefinite Linear Equation Solvers
- A methodology for speeding up loop kernels by exploiting the software information and the memory architecture
- Cache-aware multigrid methods for solving Poisson's equation in two dimensions
- Distribution of a class of divide and conquer recurrences arising from the computation of the Walsh-Hadamard transform
This page was built for software: PHiPAC