Algebraic data integration

From MaRDI portal
Publication:4577809

DOI10.1017/S0956796817000168zbMATH Open1475.68069arXiv1503.03571OpenAlexW2964296895MaRDI QIDQ4577809FDOQ4577809

Ryan Wisnesky, Patrick Schultz

Publication date: 3 August 2018

Published in: Journal of Functional Programming (Search for Journal in Brave)

Abstract: In this paper we develop an algebraic approach to data integration by combining techniques from functional programming, category theory, and database theory. In our formalism, database schemas and instances are algebraic (multi-sorted equational) theories of a certain form. Schemas denote categories, and instances denote their initial (term) algebras. The instances on a schema S form a category, S-Inst, and a morphism of schemas F : S -> T induces three adjoint data migration functors: Sigma_F : S-Inst -> T-Inst, defined by substitution along F, which has a right adjoint Delta_F : T-Inst -> S-Inst, which in turn has a right adjoint Pi_F : S-Inst -> T-Inst. We present a query language based on for/where/return syntax where each query denotes a sequence of data migration functors; a pushout-based design pattern for performing data integration using our formalism; and describe the implementation of our formalism in a tool we call AQL.


Full work available at URL: https://arxiv.org/abs/1503.03571




Recommendations



Cites Work


Cited In (10)

Uses Software





This page was built for publication: Algebraic data integration

Report a bug (only for logged in users!)Click here to report a bug for this page (MaRDI item Q4577809)