Linguistic features of Twitter rumor propagation trees on the CLNews19-20 dataset

From MaRDI portal
(Redirected from Dataset:6702716)



DOI10.5281/zenodo.10913226Zenodo10913226MaRDI QIDQ6702716FDOQ6702716

Dataset published at Zenodo repository.

Nicolás Riquelme, René Venegas, Eduardo Puraivan, Fabián Riquelme

Publication date: 3 April 2024

Copyright license: Creative Commons Attribution 4.0 International



Linguistic features applied to 140 Twitter rumor propagation trees (53 of type 1: false rumors, and 87 of type 2: true rumors) about Chilean topics collected during the Chilean social outbreak (2019-2020). These rumor propagation trees come from the CLNews19-20 dataset (DOI 10.5281/zenodo.5851204). There are 38 different linguistics features applied: number of paragraphs total number of sentences standard deviation of sentences mean words maximum number of words mean characters maximum number of characters mean characters without spaces maximum number of characters without spaces adjective idf minimum number of adpositions maximum number of adpositions mean adpositions median adpositions maximum number of auxiliaries total number of auxiliaries mean auxiliaries median auxiliaries auxiliary idf auxiliary tfidf standard deviation of numerals proper noun idf minimum number of symbols maximum number of symbols total number of symbols mean symbols median symbols symbol idf symbol tfidf number of paragraphs MDT conditionals MDT counterarguments MDT Connectors Opinion Justifiers MDT Connectors Opinion Generalizers AS VeryPositive Affin AS Negative Nrc AS Angry Nrc AS Fear Nrc







This page was built for dataset: Linguistic features of Twitter rumor propagation trees on the CLNews19-20 dataset