Concept information
Preferred term
sentencepiece
Broader concept
Definitional context(s)
- SentencePiece is a powerful and flexible method for unsupervised tokenization and subword segmentation and provides an implementation of the BPE algorithm. (Lamar & Kaya, 2023)
Example
- SentencePiece allows the user to specify special characters that cannot be crossed when constructing subword tokens both during training of the tokenizer and during tokenization of a sentence. (Soulos, Rao, Smith, Rosen, Celikyilmaz, McCoy, Jiang, Haley, Fernandez, Palangi, Gao & Smolensky, 2021)
In other languages
-
French
URI
http://data.loterre.fr/ark:/67375/8LP-RLF76QH0-3
{{label}}
{{#each values }} {{! loop through ConceptPropertyValue objects }}
{{#if prefLabel }}
{{/if}}
{{/each}}
{{#if notation }}{{ notation }} {{/if}}{{ prefLabel }}
{{#ifDifferentLabelLang lang }} ({{ lang }}){{/ifDifferentLabelLang}}
{{#if vocabName }}
{{ vocabName }}
{{/if}}