Results 181 to 190 of about 14,660 (202)
Some of the next articles are maybe not open access.
Hybrid Lemmatizer for Estonian
2014In this paper, we present a lemmatizer for the Estonian language, which employs a hybrid approach to handle both in- and out-of-vocabulary words. Our method uses only publicly available data and does not require any external tools such as a POS tagger. In the process of experimentation, we achieved the accuracy of 91%.
Tkachenko Alexander +2 more
openaire +1 more source
CNN-based Context Sensitive Lemmatization
Proceedings of the ACM India Joint International Conference on Data Science and Management of Data, 2019Morphological analysis is always considered as an important task in natural language processing (NLP). Lemmatization is a major morphological operation that finds the dictionary headword/root of a surface word. In context sensitive languages, the context of a surface word plays a key role to find its lemma.
Abhisek Chakrabarty +2 more
openaire +1 more source
2002
This record contains a full paper presented at the 3rd Conference on Language Technologies (JT-2002), held in Ljubljana, Slovenia, in October 2002.
openaire +1 more source
This record contains a full paper presented at the 3rd Conference on Language Technologies (JT-2002), held in Ljubljana, Slovenia, in October 2002.
openaire +1 more source
Automatic lemmatization of Persian words*
Journal of Quantitative Linguistics, 2006Abstract This study presents a rather novel method for suffix and prefix stripping of Persian words. The method presented is a language independent one and mostly relies on a specially arranged corpus composed of a list of roots, word-forms, prefixes, and suffixes which has been manually compiled.
openaire +1 more source
LIT: Rule Based Italian Lemmatizer
2019In natural language processing applications, such as those related to question answering systems, and more specifically, to semantic role labelling, an important task to perform during the text normalization phase is lemmatization which consists in determining those two words which have the same root, despite their surface differences.
Simone Molendini +2 more
openaire +1 more source
Lemmatization and Headword Structure
1997Abstract We now come to the actual structure and presentation of Palsgrave’s word list, and, as we can see from the following examples, his lexicographical method of presentation differs greatly from modem practice in bilingual English-French dictionaries.
openaire +1 more source
The Croatian Lemmatization Server
2005The need for lemmatization in inflectionally rich languages is indisputable: it is applicable for the whole range of procedures, from text-search up to parsing. From two predominant approaches to lemmatization (algorithmic— generally rule-based and realized with FSA— and relational— generally data-driven and realized with databases ...
openaire +1 more source

