Data associated with "Developing a standardized but extendable framework to increase the findability of infectious disease datasets"
DOI10.5281/zenodo.7530501Zenodo7530501MaRDI QIDQ6683991FDOQ6683991
Dataset published at Zenodo repository.
José Bento, Chunlei Wu, Luke V. Rasmussen, Qinglong Wu, Justin Starren, Xinghua Zhou, Marco A. Alvarado Cano, Tor C. Savidge, Reed S. Shabman, Liliana Brown, Ginger Tsueng, Laura D. Hughes, Lars Pache, Candice Czech, Andrew I. Su, Jiwen Xin, Mengjia Kang (Marjorie)
Publication date: 29 August 2022
Copyright license: Creative Commons Attribution 4.0 International
Data associated with Developing a standardized but extendable framework to increase the findability of infectious disease datasets Includes: NIAID Dataset schema NIAID ComputationalTool schema Crosswalk between NIAIDschemas and common schemas Survey of Schema.org-compliant repositories The open access movement and scientific reproducibility concerns have led the biomedical research community to embrace efforts to make scientific datasets openly accessible. While many datasets are now available, there are still challenges in ensuring that they are Findable, Accessible, Interoperable, and Reusable (FAIR). To improve the FAIRness of datasets, we evaluated dataset repositories for compliance with Schema.org standards a collection of standards developed to increase metadata searchability across the internet. Adoption of the Schema.org Dataset standard was highly variable in biomedical research datasets, and the standard omitted many desirable metadata fields. We customized the Schema.org Dataset standard to catalog datasets collected across a Systems Biology research consortium consisting of 15 Centers. We developed a reusable process for creating a schema which is interoperable with other standards, but still extendable and customizable to a particular context. Here, we describe our process along with the associated gains in FAIRness, and discuss ongoing challenges with dataset discoverability the first step to ensure that the vast amount of open data published by the research community is reused to its maximum value.
This page was built for dataset: Data associated with "Developing a standardized but extendable framework to increase the findability of infectious disease datasets"