Skip to main

Vocabulary of natural language processing

Search from vocabulary

Concept information

NLP methods and tools > signal processing > language identifier

Término preferido

language identifier  

Definición

  • A piece of software for the automatic recognition of the language of a document. (Adapted from Rajesh et al., Recognizing the languages in WebPages-A framework for NLP, 2013)

Concepto genérico

Ejemplo

  • However even the best language identifiers do not give perfect results when dealing with a large number of languages out-of-domain texts or short texts. (Jauhiainen, Lindén & Jauhiainen, 2017)
  • State of the art language identifiers obtain high rates in both recall and precision. (Jauhiainen, Lindén & Jauhiainen, 2017)
  • The language identifier created by Brown (2012) "whatlang" obtains 99.2% classification accuracy with smoothing for 65 character test strings when distinguishing between 1100 languages (Brown 2013; Brown 2014). (Jauhiainen, Lindén & Jauhiainen, 2017)
  • The language identifier itself can be utilized as a tool to pre-filter a new database in order to refine the probability table. (Vitale, 1991)

En otras lenguas

URI

http://data.loterre.fr/ark:/67375/8LP-FPHMHWZ6-4

Descargue este concepto:

RDF/XML TURTLE JSON-LD última modificación 26/4/24