Results 31 to 40 of about 14,660 (202)

Annif Analyzer Shootout: Comparing text lemmatization methods for automated subject indexing

open access: yesCode4Lib Journal, 2022
Automated text classification is an important function for many AI systems relevant to libraries, including automated subject indexing and classification.
Osma Suominen, Ilkka Koskenniemi
doaj  

Joint Lemmatization and Morphological Tagging with Lemming [PDF]

open access: yesProceedings of the 2015 Conference on Empirical Methods in Natural Language Processing, 2015
We present LEMMING, a modular log-linear model that jointly models lemmatization and tagging and supports the integration of arbitrary global features. It is trainable on corpora annotated with gold standard tags and lemmata and does not rely on morphological dictionaries or analyzers.
Muller, Thomas   +3 more
openaire   +2 more sources

Editing Middle English Medical Manuscripts : The Case of Glasgow University Library MS Hunter 509

open access: yesJournal of English Studies, 2011
It has been pointed out that the editing of a scientific treatise should be “an extended and challenging exercise in judgment, requiring an earnest commitment to scholarship” (Keiser 1998: 110).
María Laura Esteban-Segura
doaj   +1 more source

Processing Tools for Greek and Other Languages of the Christian Middle East [PDF]

open access: yesJournal of Data Mining and Digital Humanities, 2018
This paper presents some computer tools and linguistic resources of the GREgORI project. These developments allow automated processing of texts written in the main languages of the Christian Middel East, such as Greek, Arabic, Syriac, Armenian and ...
Bastien Kindt
doaj   +1 more source

Contextual Urdu Lemmatization Using Recurrent Neural Network Models

open access: yesMathematics, 2023
In the field of natural language processing, machine translation is a colossally developing research area that helps humans communicate more effectively by bridging the linguistic gap.
Rabab Hafeez   +7 more
doaj   +1 more source

Effect of Tuned Parameters on a LSA MCQ Answering Model [PDF]

open access: yes, 2009
This paper presents the current state of a work in progress, whose objective is to better understand the effects of factors that significantly influence the performance of Latent Semantic Analysis (LSA).
A. C. Graesser   +18 more
core   +6 more sources

A novel Arabic lemmatization algorithm [PDF]

open access: yesProceedings of the second workshop on Analytics for noisy unstructured text data, 2008
Tokenization is a fundamental step in processing textual data preceding the tasks of information retrieval, text mining, and natural language processing. Tokenization is a language-dependent approach, including normalization, stop words removal, lemmatization and stemming.Both stemming and lemmatization share a common goal of reducing a word to its ...
Eiman Al-Shammari, Jessica Lin
openaire   +1 more source

BanglaLem: A Transformer-based Bangla Lemmatizer with an Enhanced Dataset

open access: yesSystems and Soft Computing
Lemmatization plays a crucial role in various natural language processing (NLP) tasks, such as information retrieval, sentiment analysis, text summarization, and text classification.
Md Fuadul Islam   +4 more
doaj   +1 more source

The lemmatization of Old English Verbs from the second weak class on a lexical database

open access: yesJournal of English Studies, 2015
This article compiles a list of lemmas of the second class weak verbs of Old English by using the latest version of the lexical database Nerthus, which incorporates the texts of the Dictionary of Old English Corpus.
Marta Tío Sáenz
doaj   +1 more source

Home - About - Disclaimer - Privacy