piecemaker (Q111109): Difference between revisions

From MaRDI portal
Importer (talk | contribs)
Changed an Item
Added link to MaRDI item.
 
links / mardi / namelinks / mardi / name
 

Latest revision as of 19:56, 12 March 2024

Tools for Preparing Text for Tokenizers
Language Label Description Also known as
English
piecemaker
Tools for Preparing Text for Tokenizers

    Statements

    0 references
    1.0.1
    3 March 2022
    0 references
    1.0.0
    6 August 2021
    0 references
    1.0.2
    2 June 2023
    0 references
    0 references
    0 references
    0 references
    2 June 2023
    0 references
    Tokenizers break text into pieces that are more usable by machine learning models. Many tokenizers share some preparation steps. This package provides those shared steps, along with a simple tokenizer.
    0 references
    0 references
    0 references
    0 references
    0 references
    0 references
    0 references
    0 references
    0 references