Communication Lower Bounds and Optimal Algorithms for Multiple Tensor-Times-Matrix Computation

From MaRDI portal
Publication:6154935

DOI10.1137/22M1510443arXiv2207.10437WikidataQ128541178 ScholiaQ128541178MaRDI QIDQ6154935FDOQ6154935


Authors: Hussam al Daas, Grey Ballard, Laura Grigori Edit this on Wikidata


Publication date: 16 February 2024

Published in: SIAM Journal on Matrix Analysis and Applications (Search for Journal in Brave)

Abstract: Multiple Tensor-Times-Matrix (Multi-TTM) is a key computation in algorithms for computing and operating with the Tucker tensor decomposition, which is frequently used in multidimensional data analysis. We establish communication lower bounds that determine how much data movement is required to perform the Multi-TTM computation in parallel. The crux of the proof relies on analytically solving a constrained, nonlinear optimization problem. We also present a parallel algorithm to perform this computation that organizes the processors into a logical grid with twice as many modes as the input tensor. We show that with correct choices of grid dimensions, the communication cost of the algorithm attains the lower bounds and is therefore communication optimal. Finally, we show that our algorithm can significantly reduce communication compared to the straightforward approach of expressing the computation as a sequence of tensor-times-matrix operations.


Full work available at URL: https://arxiv.org/abs/2207.10437







Cites Work






This page was built for publication: Communication Lower Bounds and Optimal Algorithms for Multiple Tensor-Times-Matrix Computation

Report a bug (only for logged in users!)Click here to report a bug for this page (MaRDI item Q6154935)