OpenAIRE Graph Beginner's Kit Dataset
DOI10.5281/zenodo.14891799Zenodo14891799MaRDI QIDQ6724830FDOQ6724830
Dataset published at Zenodo repository.
Michele de Bonis, Alessia Bardi, Harry Dimitropoulos, Marek Horst, Ioannis Foufoulas, Alexandros Ioannidis, Miriam Baglioni, Thanasis Vergoulis, Claudio Atzori, Gianbattista Bloisi, Andrea Mannocci, Paolo Manghi, Argiro Kokogiannaki, Antonis Lempesis, Sandro la Bruzzo, Serafeim Chatzopoulos, Katerina Iatropoulou, Michele Artini
Publication date: 19 February 2025
Copyright license: Creative Commons Attribution 4.0 International
The OpenAIRE Graph is an Open Access dataset containing metadata about research products (literature, datasets, software, etc.) linked to other entities of the research ecosystem like organisations, project grants, and data sources. The large size of the OpenAIRE Graph is a major impediment for beginners to familiarise with the underlying data model and explore its contents. Working with the Graph in its full size typically requires access to a huge distributed computing infrastructure which cannot be easily accessible to everyone. The OpenAIRE Beginners Kit aims to address this issue. It consists of two components: A subset of the OpenAIRE Graph composed of the research products published between 2024-06-01 and 2024-12-31, all the entities connected to them and the respective relationships. The subset is composed of the following parts: publication.tar: metadata records about research literature (includes types of publications listed here) dataset.tar: metadata records about research data (includes the subtypes listed here) software.tar: metadata records about research software (includes the subtypes listed here) otherresearchproduct.tar: metadata records about research products that cannot be classified as research literature, data or software (includes types of products listed here) organization.tar: metadata records about organizations involved in the research life-cycle, such as universities, research organizations, funders. datasource.tar: metadata records about data sourceswhose content is available in the OpenAIRE Graph. They includeinstitutional and thematic repositories, journals, aggregators, funders' databases. project.tar: metadata records about project grants. relation.tar: metadata records about relations between entities in the graph. communities_infrastructures.tar: metadata records about research communities and research infrastructuresEach file is a tar archive containing gz files, each with one json per line. Each json is compliant to the schema available at https://doi.org/10.5281/zenodo.14608526
This page was built for dataset: OpenAIRE Graph Beginner's Kit Dataset