sentencepiece (Q89656)

From MaRDI portal





Text Tokenization using Byte Pair Encoding and Unigram Modelling
Language Label Description Also known as
default for all languages
No label defined
    English
    sentencepiece
    Text Tokenization using Byte Pair Encoding and Unigram Modelling

      Statements

      Identifiers