In this paper, a contrastive learning approach for morphological disambiguation (MD) using large language models (LLMs) is presented. A contrastive loss function is introduced for training the approach, which reduces the distance between the correct ...
Gulmira Tolegen +2 more
doaj +4 more sources
Hybrid artificial intelligence architectures for automatic text correction in the Kazakh language [PDF]
The Kazakh language, as an agglutinative and morphologically rich language, presents significant challenges for the development of natural language processing (NLP) tools.
Laura Baitenova +4 more
doaj +2 more sources
Leveraging large language models for rare disease named entity recognition. [PDF]
Named Entity Recognition (NER) in the rare disease domain poses unique challenges due to limited labeled data, semantic ambiguity between entity types, and long-tail distributions.
Nan Miles Xi, Yu Deng, Lin Wang
doaj +2 more sources
A comprehensive dataset for Arabic word sense disambiguation [PDF]
This data paper introduces a comprehensive dataset tailored for word sense disambiguation tasks, explicitly focusing on a hundred polysemous words frequently employed in Modern Standard Arabic.
Sanaa Kaddoura, Reem Nassar
doaj +2 more sources
AI-Driven Medical Device Risk Management: A New Paradigm Integrating Large Language Models and Prompt Engineering for Standard-Risk Knowledge Graph Construction and Application [PDF]
Wanting Zhu,1 Peiming Zhang,1 Wenke Xia,1 Ziming Gao,2 Weiqi Li,1 Ruixue Tian,3 Li Wang4 1School of Health Science and Engineering, University of Shanghai for Science and Technology, Educational Institution, Shanghai, People’s Republic of China ...
Zhu W +6 more
doaj +2 more sources
PubMed Computed Authors in 2024: an open resource of disambiguated author names in biomedical literature. [PDF]
Abstract Summary Over 55% of author names in PubMed are ambiguous: the same name is shared by different individual researchers. This poses significant challenges on precise literature retrieval for author name queries, a common behavior in biomedical literature search.
Tian S +4 more
europepmc +3 more sources
Sentiment analysis techniques, challenges, and opportunities: Urdu language-based analytical study [PDF]
Sentiment analysis in research involves the processing and analysis of sentiments from textual data. The sentiment analysis for high resource languages such as English and French has been carried out effectively in the past. However, its applications are
Muhammad Irzam Liaqat +4 more
doaj +2 more sources
Efficient estimation of Hindi WSD with distributed word representation in vector space
Word Sense Disambiguation (WSD) is significant for improving the accuracy of the interpretation of a Natural language text. Various supervised learning-based models and knowledge-based models have been developed in the literature for WSD of the language ...
Archana Kumari, D.K. Lobiyal
doaj +1 more source
Exploiting Semantic Role Resources for Preposition Disambiguation [PDF]
This article describes how semantic role resources can be exploited for preposition disambiguation. The main resources include the semantic role annotations provided by the Penn Treebank and FrameNet tagged corpora. The resources also include the assertions contained in the Factotum knowledge base, as well as information from Cyc and Conceptual Graphs.
Tom O'Hara, Janyce Wiebe
openaire +1 more source
The Arabic text can be translated into English using a variety of machine translation techniques. The translation of Arabic text into English still poses a number of challenges in contemporary Arabic.
Shahab Ahmad Almaaytah +1 more
doaj +1 more source

