Datasets for Content-and-Structure (CAS) Indexing

From MaRDI portal




We provide the datasets used in our paper Dynamic Interleaving of Content and Structure for Robust Indexing of Semi-Structured Hierarchical Data. There are three datasets: ServerFarm (SF) dataset XMark dataset Amazon dataset We created the ServerFarm dataset ourselves. Our Amazon dataset is based on a subset of the Amazon dataset byJulian McAuley(see http://jmcauley.ucsd.edu/data/amazon/links.html). The XMark dataset is a synthetic dataset based on the XMark benchmark (https://projects.cwi.nl/xmark/downloads.html)











This page was built for dataset: Datasets for Content-and-Structure (CAS) Indexing