Skip to main content

Vocabulary of natural language processing

Search from vocabulary

Concept information

Preferred term

n-gram  

Definition

  • Sequence of aligned words assigned with probabilities, which represent how likely the word sequences occur in a training corpus. ( https://zenodo.org/record/5646896)

Broader concept

Synonym(s)

  • n gram
  • ngram

Example

  • BLEU calculates the number of matches for each n-gram based on the maximum number of times the n-gram occurs in common with any one of the references. (Sai, Mohankumar, Arora & Khapra, 2020)
  • If the n-gram is a term the input is labelled as positive training example. (Rigouts Terryn, Hoste, Drouin & Lefever, 2020)
  • Ngrams are annotated with part-of-speech tags (e.g. in the phrase he burnt the toast burnt is a verb; in the burnt toast burnt is an adjective) and head-modifier dependencies (e.g. in the phrase the little black book little modifies book). (Lin, Michel, Aiden Lieberman, Orwant, Brockman & Petrov, 2012)
  • Shorter n-grams were not found to improve performance on development data and hence are not extracted. (Bergsma, Lin & Goebel, 2008)

In other languages

URI

http://data.loterre.fr/ark:/67375/8LP-DB86GKBS-4

Download this concept:

RDF/XML TURTLE JSON-LD Last modified 5/21/24