AMiner-534K: Knowledge Graph of AMiner benchmark for Author Name Disambiguation

From MaRDI portal
(Redirected from Dataset:6709456)



DOI10.5281/zenodo.5675801Zenodo5675801MaRDI QIDQ6709456FDOQ6709456

Dataset published at Zenodo repository.

Cristian Santini, Aldo Gangemi, Mehwish Alam, Harald Sack, Silvio Peroni, Gesese. Genet Asefa

Publication date: 11 November 2021

Copyright license: Creative Commons Attribution 4.0 International



This dataset is a knowledge graph extracted from aAMiner benchmarkfor a research project on knowledge graph embeddings (KGEs)for author disambiguation. Structural triples of the knowledge graph are split into training, testing and validation for applying representation learning methods. Textual literals and numeric literals were stored separately in order to implement multimodal approaches for KGEs (seearXiv:1802.00934). For the same reason, textual literals and numeric literals are already stored into sentence embeddings and anumeric matrixrespectively in the filestextual_literals.npyandnumeric_literals.npy. The fileand_eval.jsoncontains the evaluation dataset used for evaluating our AND architecture. For the script used to gather this dataset see the GitHub repository:https://github.com/sntcristian/and-kge/tree/main/aminer.







This page was built for dataset: AMiner-534K: Knowledge Graph of AMiner benchmark for Author Name Disambiguation