Efficient lineage for SUM aggregate queries

From MaRDI portal
Publication:4589119

DOI10.3233/AIC-140647zbMATH Open1373.68308arXiv1312.2990OpenAlexW1700011898MaRDI QIDQ4589119FDOQ4589119

Angelos Vasilakopoulos, Foto N. Afrati, Dimitris Fotakis

Publication date: 7 November 2017

Published in: AI Communications (Search for Journal in Brave)

Abstract: AI systems typically make decisions and find patterns in data based on the computation of aggregate and specifically sum functions, expressed as queries, on data's attributes. This computation can become costly or even inefficient when these queries concern the whole or big parts of the data and especially when we are dealing with big data. New types of intelligent analytics require also the explanation of why something happened. In this paper we present a randomised algorithm that constructs a small summary of the data, called Aggregate Lineage, which can approximate well and explain all sums with large values in time that depends only on its size. The size of Aggregate Lineage is practically independent on the size of the original data. Our algorithm does not assume any knowledge on the set of sum queries to be approximated.


Full work available at URL: https://arxiv.org/abs/1312.2990







Recommendations





This page was built for publication: Efficient lineage for SUM aggregate queries

Report a bug (only for logged in users!)Click here to report a bug for this page (MaRDI item Q4589119)