Results 31 to 40 of about 14,660 (202)
Annif Analyzer Shootout: Comparing text lemmatization methods for automated subject indexing
Automated text classification is an important function for many AI systems relevant to libraries, including automated subject indexing and classification.
Osma Suominen, Ilkka Koskenniemi
doaj
Lemmatization and parsing with TACT preprocessing programs
None
R G Siemens
doaj +1 more source
Joint Lemmatization and Morphological Tagging with Lemming [PDF]
We present LEMMING, a modular log-linear model that jointly models lemmatization and tagging and supports the integration of arbitrary global features. It is trainable on corpora annotated with gold standard tags and lemmata and does not rely on morphological dictionaries or analyzers.
Muller, Thomas +3 more
openaire +2 more sources
Editing Middle English Medical Manuscripts : The Case of Glasgow University Library MS Hunter 509
It has been pointed out that the editing of a scientific treatise should be “an extended and challenging exercise in judgment, requiring an earnest commitment to scholarship” (Keiser 1998: 110).
María Laura Esteban-Segura
doaj +1 more source
Processing Tools for Greek and Other Languages of the Christian Middle East [PDF]
This paper presents some computer tools and linguistic resources of the GREgORI project. These developments allow automated processing of texts written in the main languages of the Christian Middel East, such as Greek, Arabic, Syriac, Armenian and ...
Bastien Kindt
doaj +1 more source
Contextual Urdu Lemmatization Using Recurrent Neural Network Models
In the field of natural language processing, machine translation is a colossally developing research area that helps humans communicate more effectively by bridging the linguistic gap.
Rabab Hafeez +7 more
doaj +1 more source
Effect of Tuned Parameters on a LSA MCQ Answering Model [PDF]
This paper presents the current state of a work in progress, whose objective is to better understand the effects of factors that significantly influence the performance of Latent Semantic Analysis (LSA).
A. C. Graesser +18 more
core +6 more sources
A novel Arabic lemmatization algorithm [PDF]
Tokenization is a fundamental step in processing textual data preceding the tasks of information retrieval, text mining, and natural language processing. Tokenization is a language-dependent approach, including normalization, stop words removal, lemmatization and stemming.Both stemming and lemmatization share a common goal of reducing a word to its ...
Eiman Al-Shammari, Jessica Lin
openaire +1 more source
BanglaLem: A Transformer-based Bangla Lemmatizer with an Enhanced Dataset
Lemmatization plays a crucial role in various natural language processing (NLP) tasks, such as information retrieval, sentiment analysis, text summarization, and text classification.
Md Fuadul Islam +4 more
doaj +1 more source
The lemmatization of Old English Verbs from the second weak class on a lexical database
This article compiles a list of lemmas of the second class weak verbs of Old English by using the latest version of the lexical database Nerthus, which incorporates the texts of the Dictionary of Old English Corpus.
Marta Tío Sáenz
doaj +1 more source

