Results 1 to 10 of about 20,287 (174)

Contrastive Learning for Morphological Disambiguation Using Large Language Models in Low-Resource Settings [PDF]

open access: goldApplied Sciences
In this paper, a contrastive learning approach for morphological disambiguation (MD) using large language models (LLMs) is presented. A contrastive loss function is introduced for training the approach, which reduces the distance between the correct ...
Gulmira Tolegen   +2 more
doaj   +3 more sources

Minimalist Entity Disambiguation for Mid-Resource Languages [PDF]

open access: goldProceedings of The Fourth Workshop on Simple and Efficient Natural Language Processing (SustaiNLP), 2023
For many languages and applications, even though enough data is available for training Named Entity Disambiguation (NED) systems, few off-the-shelf models are available for use in practice. This is due to both the large size of state-of-the-art models, and to the computational requirements for recreating them from scratch.
Benno Kruit
openalex   +3 more sources

An Unsupervised Word Sense Disambiguation System for Under-Resourced Languages [PDF]

open access: green, 2018
In this paper, we present Watasense, an unsupervised system for word sense disambiguation. Given a sentence, the system chooses the most relevant sense of each input word with respect to the semantic similarity between the given sentence and the synset constituting the sense of the target word. Watasense has two modes of operation. The sparse mode uses
Dmitry Ustalov   +5 more
  +6 more sources

Exploiting a lexical resource for discourse connective disambiguation in German [PDF]

open access: goldProceedings of the 28th International Conference on Computational Linguistics, 2020
In this paper we focus on connective identification and sense classification for explicit discourse relations in German, as two individual sub-tasks of the overarching Shallow Discourse Parsing task. We successively augment a purely-empirical approach based on contextualised embeddings with linguistic knowledge encoded in a connective lexicon.
Peter Bourgonje, Manfred Stede
openalex   +2 more sources

Hybrid artificial intelligence architectures for automatic text correction in the Kazakh language [PDF]

open access: yesFrontiers in Artificial Intelligence
The Kazakh language, as an agglutinative and morphologically rich language, presents significant challenges for the development of natural language processing (NLP) tools.
Laura Baitenova   +4 more
doaj   +2 more sources

Unsupervised Named Entity Disambiguation for Low Resource Domains [PDF]

open access: greenProceedings of the 2024 Conference on Empirical Methods in Natural Language Processing
In the ever-evolving landscape of natural language processing and information retrieval, the need for robust and domain-specific entity linking algorithms has become increasingly apparent. It is crucial in a considerable number of fields such as humanities, technical writing and biomedical sciences to enrich texts with semantics and discover more ...
D. V. Datta, Soumajit Pramanik
  +6 more sources

Web Resource Sense Disambiguation in Web of Data

open access: green, 2020
JUCS - Journal of Universal Computer Science Volume Nr.
Farzam Matinfar   +2 more
openalex   +4 more sources

Data sets for author name disambiguation: an empirical analysis and a new resource [PDF]

open access: hybridScientometrics, 2017
Data sets of publication meta data with manually disambiguated author names play an important role in current author name disambiguation (AND) research. We review the most important data sets used so far, and compare their respective advantages and shortcomings. From the results of this review, we derive a set of general requirements to future AND data
Mark-Christoph Müller   +2 more
openalex   +4 more sources

Word Sense Disambiguation Pipeline Framework for Low Resourced Morphologically Rich Languages

open access: greenSSRN Electronic Journal, 2023
Resolving ambiguity problem is a prolonged natural language processing theoretical research challenge. Sesotho sa Leboa language is an official name for Sepedi or Northern Sotho language as known to be an official language among 11 others in South Africa spoken by 4.7 million people.
Mosima Anna Masethe   +3 more
openalex   +3 more sources

Hybrid Transformer-Based Large Language Models for Word Sense Disambiguation in the Low-Resource Sesotho sa Leboa Language [PDF]

open access: goldApplied Sciences
This study addresses a lexical ambiguity issue in Sesotho sa Leboa that arises from terms with various meanings, often known as homonyms or polysemous words.
Hlaudi Daniel Masethe   +4 more
doaj   +2 more sources

Home - About - Disclaimer - Privacy