ERBlox: combining matching dependencies with machine learning for entity resolution
From MaRDI portal
(Redirected from Publication:518610)
Abstract: Entity resolution (ER), an important and common data cleaning problem, is about detecting data duplicate representations for the same external entities, and merging them into single representations. Relatively recently, declarative rules called matching dependencies (MDs) have been proposed for specifying similarity conditions under which attribute values in database records are merged. In this work we show the process and the benefits of integrating three components of ER: (a) Classifiers for duplicate/non-duplicate record pairs built using machine learning (ML) techniques, (b) MDs for supporting both the blocking phase of ML and the merge itself; and (c) The use of the declarative language LogiQL -an extended form of Datalog supported by the LogicBlox platform- for data processing, and the specification and enforcement of MDs.
Recommendations
Cites work
- scientific article; zbMATH DE number 5296741 (Why is no real title available?)
- scientific article; zbMATH DE number 6823187 (Why is no real title available?)
- scientific article; zbMATH DE number 839556 (Why is no real title available?)
- scientific article; zbMATH DE number 1391397 (Why is no real title available?)
- Bridging logic and kernel machines
- Data Quality and Record Linkage Techniques
- Data cleaning and query answering with matching dependencies and matching functions
- Foundations of Rule Learning
- Kernel methods and machine learning
- Nearest neighbor pattern classification
Cited in
(8)- Massively parallel entity matching with linear classification in low dimensional space
- Matching dependencies: semantics and query answering
- Entity resolution for probabilistic data
- Expressive power of entity-linking frameworks
- Theoretical foundations of entity resolution models
- Entity resolution oriented clustering algorithm
- ERBlox
- Data cleaning and query answering with matching dependencies and matching functions
This page was built for publication: ERBlox: combining matching dependencies with machine learning for entity resolution
Report a bug (only for logged in users!)Click here to report a bug for this page (MaRDI item Q518610)