Datasets for Content-and-Structure (CAS) Indexing
DOI10.5281/zenodo.3739263Zenodo3739263MaRDI QIDQ6697222FDOQ6697222
Dataset published at Zenodo repository.
Michael H. Böhlen, Sven Helmer, Kevin Wellenzohn
Publication date: 8 April 2020
Copyright license: Creative Commons Attribution 4.0 International
We provide the datasets used in our paper Dynamic Interleaving of Content and Structure for Robust Indexing of Semi-Structured Hierarchical Data. There are three datasets: ServerFarm (SF) dataset XMark dataset Amazon dataset We created the ServerFarm dataset ourselves. Our Amazon dataset is based on a subset of the Amazon dataset byJulian McAuley(see http://jmcauley.ucsd.edu/data/amazon/links.html). The XMark dataset is a synthetic dataset based on the XMark benchmark (https://projects.cwi.nl/xmark/downloads.html)
This page was built for dataset: Datasets for Content-and-Structure (CAS) Indexing