Skip to main content

Vocabulary of natural language processing

Search from vocabulary

Concept information

Preferred term

diacritic mark  

Broader concept

Example

  • Diacritic marks help reduce the number of possible lexical word choices assigned to a source word which leads to better quality translated sentences. (Alqahtani, Ghoneim & Diab, 2016)
  • One important diacritic mark that is often missing from digital versions is shadda (gemination) which may be used to easily resolve some lexical ambiguities. (Terner, Bar & Dershowitz, 2020)
  • Research has indicated that excessive diacritic marks can hinder skilled readers-reducing reading speed and accuracy. (ElNokrashy & AlKhamissi, 2024)
  • The absence of diacritic marks makes Arabic language more ambiguous (Meddeb et al 2016) (Mahmoud & Zrigui, 2017)
  • White (2013) also notes that diacritic marks are often a common source of errors due to their small size and distance from the main character and that training in a combined approach also greatly expands the larger OCR character set. (Biggs, 2015)

In other languages

URI

http://data.loterre.fr/ark:/67375/8LP-X6Z676S2-J

Download this concept:

RDF/XML TURTLE JSON-LD Last modified 6/14/24