Cited in
(18)- MuLOT: multi-level optimization of the canonical polyadic tensor decomposition at large-scale
- Design of a high-performance GEMM-like tensor-tensor multiplication
- P3DFFT
- MADNESS
- BLIS
- TTC
- zlib
- FFmpeg
- TAMRESH
- CFOUR
- tthresh
- CESM
- Draco
- ATC
- Zstandard
- Spin summations
- TTC: a high-performance compiler for tensor transpositions
- Spin summations: a high-performance perspective
This page was built for software: HPTT