It/Fr/En-Wiki-100 datasets

From MaRDI portal
(Redirected from Dataset:6718355)




The 3 datasets derived from the Italian (ItWiki-100), French(FrWiki-100) and English(EnWiki-100) Wikipedia dumps, with articles tagged with related portals (100 most common per language). If you use this data you may cite these works: Gasparetto A, Marcuzzo M, Zangari A, Albarelli A. (2022) A Survey on Text Classification Algorithms: From Text to Predictions. Information 13, no. 2: 83. https://doi.org/10.3390/info13020083 Gasparetto A, Zangari A, Marcuzzo M, Albarelli A. (2022) A survey on text classification: Practical perspectives on the Italian language. PLOS ONE 17(7): e0270904. https://doi.org/10.1371/journal.pone.0270904











This page was built for dataset: It/Fr/En-Wiki-100 datasets