An Unsupervised Word Sense Disambiguation System for Under-Resourced Languages
In this paper, we present Watasense, an unsupervised system for word sense disambiguation. Given a sentence, the system chooses the most relevant sense of each input word with respect to the semantic similarity between the given sentence and the synset constituting the sense of the target word. Watasense has two modes of operation. The sparse mode uses
Dmitry Ustalov+5 more
+8 more sources
A comprehensive dataset for Arabic word sense disambiguation [PDF]
This data paper introduces a comprehensive dataset tailored for word sense disambiguation tasks, explicitly focusing on a hundred polysemous words frequently employed in Modern Standard Arabic.
Sanaa Kaddoura, Reem Nassar
doaj +2 more sources
Named entity disambiguation in short texts over knowledge graphs. [PDF]
The ever-growing usage of knowledge graphs (KGs) positions named entity disambiguation (NED) at the heart of designing accurate KG-driven systems such as query answering systems (QAS).
Bouarroudj W, Boufaida Z, Bellatreche L.
europepmc +2 more sources
Sense Unveiled: Enhancing Urdu Corpus for Nuanced Word Sense Disambiguation
Ambiguity in word meanings presents a significant challenge in natural language processing, necessitating robust techniques for Word Sense Disambiguation (WSD).
Sarfraz Bibi+2 more
doaj +2 more sources
GeneToList: A Web Application to Assist with Gene Identifiers for the Non-Bioinformatics-Savvy Scientist [PDF]
The increasing incorporation of omics technologies into biomedical research and translational medicine presents challenges to end users of the large and complex datasets that are generated by these methods.
Joshua D. Breidenbach+3 more
doaj +2 more sources
Word Sense Disambiguation Pipeline Framework for Low Resourced Morphologically Rich Languages
Resolving ambiguity problem is a prolonged natural language processing theoretical research challenge. Sesotho sa Leboa language is an official name for Sepedi or Northern Sotho language as known to be an official language among 11 others in South Africa spoken by 4.7 million people.
Mosima Anna Masethe+3 more
openalex +3 more sources
PubMed Computed Authors in 2024: an open resource of disambiguated author names in biomedical literature. [PDF]
Abstract Summary Over 55% of author names in PubMed are ambiguous: the same name is shared by different individual researchers. This poses significant challenges on precise literature retrieval for author name queries, a common behavior in biomedical literature search.
Tian S+4 more
europepmc +3 more sources
Web Resource Sense Disambiguation in Web of Data
JUCS - Journal of Universal Computer Science Volume Nr.
Farzam Matinfar+2 more
openalex +3 more sources
Exploring hidden pathways to sustainable manufacturing for cyber-physical production systems [PDF]
Future manufacturing scenarios will likely be built around cyber-physical production systems. To succeed, this new manufacturing paradigm will also have to comply with the golden rule of sustainability.
Gianfranco Pedone+2 more
doaj +2 more sources