quanteda.textstats

From MaRDI portal
Software:57601



CRANquanteda.textstatsMaRDI QIDQ57601

Textual Statistics for the Quantitative Analysis of Textual Data

Kohei Watanabe, Kenneth Benoit, Haiyan Wang, Jiong Wei Lua, Jouni Kuha

Last update: 2 November 2023

Software version identifier: 0.96.1, 0.96.2, 0.90, 0.91, 0.92, 0.93, 0.94.1, 0.94, 0.95, 0.96.3, 0.96, 0.96.4


Copyright license: GNU General Public License, version 3.0

Textual statistics functions formerly in the 'quanteda' package. Textual statistics for characterizing and comparing textual data. Includes functions for measuring term and document frequency, the co-occurrence of words, similarity and distance between features and documents, feature entropy, keyword occurrence, readability, and lexical diversity. These functions extend the 'quanteda' package and are specially designed for sparse textual data.