GraSCCo_PHI - Graz Synthetic Clinical text Corpus with Protected Health Information Annotations
DOI10.5281/zenodo.11502329Zenodo11502329MaRDI QIDQ6676724FDOQ6676724
Dataset published at Zenodo repository.
Udo Hahn, Martin Boeker, Rebekka Kiser, Faller Jakob, Luise Modersohn, Andrea Riedel, Frank A Meineke, Franz Matthies, Christina Lohr
Publication date: 9 September 2024
Copyright license: Creative Commons Attribution 4.0 International
GraSCCo_PHI - Graz Synthetic Clinical text Corpus with Protected Health Information Annotations GraSCCo is a collection of artificially generated semi-structured and unstructured German-language clinical summaries. These summaries are formulated as letters from the hospital to the patient's GP after in-patient or out-patient care. Details: Stefan Schulz. (2022). GraSCCo (Version v1) [Data set]. Zenodo.https://doi.org/10.5281/zenodo.6539131 Modersohn L, Schulz S, Lohr C, Hahn U. GRASCCO - The First Publicly Shareable, Multiply-Alienated German Clinical Text Corpus. Stud Health Technol Inform. 2022;296:66-72. doi:10.3233/SHTI220805 This is the GraSSCo with annotations of Proteced Health Information as an external source of Lohr C, Matthies F, Faller J, et al. De-Identifying GRASCCO - A Pilot Study for the De-Identification of the German Medical Text Project (GeMTeX) Corpus. Stud Health Technol Inform. 2024;317:171-179. doi:10.3233/SHTI240853 (https://pubmed.ncbi.nlm.nih.gov/39234720/) This repository contains the annotations in XMI and JSON exports created with the INCEpTION annotation platform (https://inception-project.github.io/), also the annotation guideline document, TypeSystem.xml and layer.json (needed for import in INCEpTION).
This page was built for dataset: GraSCCo_PHI - Graz Synthetic Clinical text Corpus with Protected Health Information Annotations