quanteda.textstats

From MaRDI portal
Revision as of 19:55, 12 March 2024 by Import240312060351 (talk | contribs) (Created automatically from import240312060351)
(diff) ← Older revision | Latest revision (diff) | Newer revision → (diff)

Software:57601



CRANquanteda.textstatsMaRDI QIDQ57601

Textual Statistics for the Quantitative Analysis of Textual Data

Jiong Wei Lua, Kohei Watanabe, Kenneth Benoit, Haiyan Wang, Jouni Kuha

Last update: 2 November 2023

Copyright license: GNU General Public License, version 3.0

Software version identifier: 0.96.1, 0.96.2, 0.90, 0.91, 0.92, 0.93, 0.94.1, 0.94, 0.95, 0.96.3, 0.96, 0.96.4



Textual statistics functions formerly in the 'quanteda' package. Textual statistics for characterizing and comparing textual data. Includes functions for measuring term and document frequency, the co-occurrence of words, similarity and distance between features and documents, feature entropy, keyword occurrence, readability, and lexical diversity. These functions extend the 'quanteda' package and are specially designed for sparse textual data.