piecemaker (Q111109)

From MaRDI portal
Tools for Preparing Text for Tokenizers
Language Label Description Also known as
English
piecemaker
Tools for Preparing Text for Tokenizers

    Statements

    0 references
    1.0.1
    3 March 2022
    0 references
    1.0.0
    6 August 2021
    0 references
    1.0.2
    2 June 2023
    0 references
    0 references
    0 references
    0 references
    2 June 2023
    0 references
    Tokenizers break text into pieces that are more usable by machine learning models. Many tokenizers share some preparation steps. This package provides those shared steps, along with a simple tokenizer.
    0 references
    0 references
    0 references
    0 references
    0 references
    0 references
    0 references
    0 references
    0 references