Skip to main

Vocabulary of natural language processing

Search from vocabulary

Concept information

Término preferido

byT5  

Definición

  • Pre-trained byte-level Transformer models based on the T5 architecture. (Xue et al., 2022)

Concepto genérico

Ejemplo

  • We see that byT5 alone achieves excellent BLEU scores. (Jude Ogundepo, Oladipo, Adeyemi, Ogueji & Lin, 2022)
  • Whereas byT5 only uses byte sequences instead of subwords and differs in hyperparameters Charformer uses convolution and combines character blocks to obtain latent subword representations. (Libovický, Schmid & Fraser, 2022)

En otras lenguas

URI

http://data.loterre.fr/ark:/67375/8LP-ZS8B6SL1-Z

Descargue este concepto:

RDF/XML TURTLE JSON-LD última modificación 26/4/24