Description du vocabulaire
Titre
Vocabulary of natural language processing
Description
The vocabulary of natural language processing (NLP) is a bilingual (French-English) terminological resource. It is the result of transforming a hierarchical list of terms into SKOS. It includes more than 1,600 concepts, some of which have one or more definitions, definitional contexts and examples of use.
This vocabulary is based on:
- reusing, merging, unifying and appending classes and properties from existing ontologies, i.e. the Vocabulary of Linguistics, the Thesaurus of Text Mining, the Vocabulary of Signal Theory and Processing, the Artes (Aide à la rédaction de textes scientifiques) dictionary created by the joined research team of UFR EILA and URP 3967 ALTAE departments of Université Paris Cité and based on Bénard (2019);
- extracting terms from subject-based corpora (ACL Anthology Reference Corpus);
- manual identification of problematic terms during an experimental post-editing session (Bawden et al., 2024);
- automatic extraction of definitional contexts and examples of use with the Concordancer tool, which was created as part of the MaTOS ANR project.
This vocabulary is based on:
- reusing, merging, unifying and appending classes and properties from existing ontologies, i.e. the Vocabulary of Linguistics, the Thesaurus of Text Mining, the Vocabulary of Signal Theory and Processing, the Artes (Aide à la rédaction de textes scientifiques) dictionary created by the joined research team of UFR EILA and URP 3967 ALTAE departments of Université Paris Cité and based on Bénard (2019);
- extracting terms from subject-based corpora (ACL Anthology Reference Corpus);
- manual identification of problematic terms during an experimental post-editing session (Bawden et al., 2024);
- automatic extraction of definitional contexts and examples of use with the Concordancer tool, which was created as part of the MaTOS ANR project.
Créateur
Institute for scientific and technical information (Inist) - CNRS/UAR76
ANR-22-CE23-0033 project MaTOS Machine Translation for Open Science - F. Yvon (dir.)
Version
2.0
Date de création
Friday, April 26, 2024 00:00:00
Date de dernière modification
Friday, September 12, 2025 00:00:00
Nom d'attribution
Institute for scientific and technical information (Inist) - CNRS/UAR76
cc:attributionURL
dc:alternative
NLP vocabulary
Identifiant
Description
This resource contains 1634 terminological entries.
skosmos:shortName
NLP Vocabulary
URI
http://data.loterre.fr/ark:/67375/8LP
Nombre d'entrées par type
| Type | Nombre |
|---|
Nombre de termes par langue
| Langue | Termes préférentiels | Termes synonymes | Termes cachés |
|---|