textrecipes
From MaRDI portal
Software:84280
CRANtextrecipesMaRDI QIDQ84280
Extra 'Recipes' for Text Processing
Last update: 15 November 2023
Copyright license: MIT license, File License
Software version identifier: 1.0.2, 1.0.3, 0.0.1, 0.0.2, 0.1.0, 0.2.0, 0.2.1, 0.2.2, 0.2.3, 0.3.0, 0.4.0, 0.4.1, 0.5.0, 0.5.1, 0.5.2, 1.0.0, 1.0.1, 1.0.3, 1.0.4, 1.0.5, 1.0.6
Converting text to numerical features requires specifically created procedures, which are implemented as steps according to the 'recipes' package. These steps allows for tokenization, filtering, counting (tf and tfidf) and feature hashing.