Results 181 to 190 of about 14,660 (202)
Some of the next articles are maybe not open access.

Hybrid Lemmatizer for Estonian

2014
In this paper, we present a lemmatizer for the Estonian language, which employs a hybrid approach to handle both in- and out-of-vocabulary words. Our method uses only publicly available data and does not require any external tools such as a POS tagger. In the process of experimentation, we achieved the accuracy of 91%.
Tkachenko Alexander   +2 more
openaire   +1 more source

CNN-based Context Sensitive Lemmatization

Proceedings of the ACM India Joint International Conference on Data Science and Management of Data, 2019
Morphological analysis is always considered as an important task in natural language processing (NLP). Lemmatization is a major morphological operation that finds the dictionary headword/root of a surface word. In context sensitive languages, the context of a surface word plays a key role to find its lemma.
Abhisek Chakrabarty   +2 more
openaire   +1 more source

Automatic Word Lemmatization

2002
This record contains a full paper presented at the 3rd Conference on Language Technologies (JT-2002), held in Ljubljana, Slovenia, in October 2002.
openaire   +1 more source

Automatic lemmatization of Persian words*

Journal of Quantitative Linguistics, 2006
Abstract This study presents a rather novel method for suffix and prefix stripping of Persian words. The method presented is a language independent one and mostly relies on a specially arranged corpus composed of a list of roots, word-forms, prefixes, and suffixes which has been manually compiled.
openaire   +1 more source

LIT: Rule Based Italian Lemmatizer

2019
In natural language processing applications, such as those related to question answering systems, and more specifically, to semantic role labelling, an important task to perform during the text normalization phase is lemmatization which consists in determining those two words which have the same root, despite their surface differences.
Simone Molendini   +2 more
openaire   +1 more source

Lemmatization and Headword Structure

1997
Abstract We now come to the actual structure and presentation of Palsgrave’s word list, and, as we can see from the following examples, his lexicographical method of presentation differs greatly from modem practice in bilingual English-French dictionaries.
openaire   +1 more source

The Croatian Lemmatization Server

2005
The need for lemmatization in inflectionally rich languages is indisputable: it is applicable for the whole range of procedures, from text-search up to parsing. From two predominant approaches to lemmatization (algorithmic— generally rule-based and realized with FSA— and relational— generally data-driven and realized with databases ...
openaire   +1 more source

Home - About - Disclaimer - Privacy