Results 131 to 140 of about 4,321 (229)
RUMLEM: A Dictionary-Based Lemmatizer for Romansh
Lemmatization -- the task of mapping an inflected word form to its dictionary form -- is a crucial component of many NLP applications. In this paper, we present RUMLEM, a lemmatizer that covers the five main varieties of Romansh as well as the supra-regional standard variety Rumantsch Grischun.
Dominic P. Fischer +2 more
openaire +2 more sources
An investigation into lemmatization in Southern Sotho
Lemmatization refers to the process whereby a lexicographer assigns a specific place in a dictionary to a word which he regards as the most basic form amongst other related forms.
Makgabutlane, Kelebohile Hilda
core
This paper describes Adam Mickiewicz University's (AMU) solution for the 4th Shared Task on SlavNER. The task involves the identification, categorization, and lemmatization of named entities in Slavic languages. Our approach involved exploring the use of
Pałka, Gabriela, Nowakowski, Artur
core
Weigh your words : memory-based lemmatization for Middle Dutch
: This article deals with the lemmatization of Middle Dutch literature. This text collectionlike any other medieval corpusis characterized by an enormous spelling variation, which makes it difficult to perform a computational analysis of this kind of ...
Daelemans, Walter +2 more
core
A Simple Joint Model for Improved Contextual Neural Lemmatization
English verbs have multiple forms. For instance, talk may also appear as talks, talked or talking, depending on the context. The NLP task of lemmatization seeks to map these diverse forms back to a canonical one, known as the lemma.
Wu, Shijie +2 more
core
KPWr annotation guidelines - named entity and phrase lemmatization 2.0
Guidelines for named entity and multi-word phrase lemmatization used in in KPWr (Polish Corpus of Wrocław University of Technology)
Oleksy, Marcin, Marcińczuk, Michał
core
Evaluation Analysis of the Necessity of Stemming and Lemmatization in Text Classification
Stemming and lemmatization are text preprocessing methods that aim to convert words into their root and to the canonical or dictionary form. Some previous studies state that using stemming and lemmatization worsens the performance of text classification ...
Muku, I Dewa Made Krishna +3 more
core +1 more source
GliLem: Leveraging GliNER for Contextualized Lemmatization in Estonian
We present GliLem—a novel hybrid lemmatization system for Estonian that enhances the highly accurate rule-based morphological analyzer Vabamorf with an external disambiguation module based on GliNER—an open vocabulary NER model that is able to match text
Sirts, Kairit, Dorkin, Aleksei
core
Croatian Lemmatization Server [PDF]
The need for lemmatization in inflectionally rich languages is indisputable: it is applicable for the whole range of procedures — from textsearch, up to parsing. From two predominant approaches to lemmatization: 1) algorithmic (generally rule-based and realized with FSA) and 2) relational (generally data-driven and realized with databases), this ...
openaire +1 more source
The plague of 1720 and migration in Martigues (France) in the 17th and 18th centuries. [PDF]
Darlu P, Séguy I.
europepmc +1 more source

