OpenAIRE ScholeXplorer Service: Scholix JSON Dump

From MaRDI portal
Dataset:6724756



DOI10.5281/zenodo.8074885Zenodo8074885MaRDI QIDQ6724756FDOQ6724756

Dataset published at Zenodo repository.

Sandro la Bruzzo, Paolo Manghi

Publication date: 16 March 2022



This dataset contains the GZ-compressed dump of the Scholix links (schema Version 4) exposed by the OpenAIRE ScholeXplorer service. Itconsists of 417+Mi bi-directional links (i.e. 975+Mi directed links) between literature-dataset and dataset-dataset involving 24+ Mi literature objects and 37+ Mi datasets (showing an increase of around 160Mi links wrt the previous release). Links are collected from publishers (CrossRef, EventData), data centers (DataCite and data centers), institutional/thematic repositories (OpenAIRE), life-science databases (EMBL-EBI), and inferred by OpenAIRE via text-mining around 14Mi publicationsPDFs. The dataset is structuredin 30 compressed files, each of at most ~10 Gb, for a total of ~328GB. Note that the dataset matches a new version of the schema (schema Version 4). Changes are minor, backward compatible, and regardoptional fields and extensions of vocabularies.The readme.doc file includes a description of the schema changes andstatistics about the dataset.







This page was built for dataset: OpenAIRE ScholeXplorer Service: Scholix JSON Dump