Multi-objective evolutionary optimization for dimensionality reduction of texts represented by synsets [PDF]
Despite new developments in machine learning classification techniques, improving the accuracy of spam filtering is a difficult task due to linguistic phenomena that limit its effectiveness.
Iñaki Vélez de Mendizabal+5 more
doaj +8 more sources
Sentiment Thesaurus, Synset and Word2Vec Based Improvement in Bigram Model for Classifying Product Reviews. [PDF]
Classifying product reviews is one of the tasks in Natural Language Processing by which the sentiment of the reviewer towards a product can be identified.
Poomagal S+4 more
europepmc +4 more sources
Automatically constructing Wordnet synsets [PDF]
Manually constructing a Wordnet is a difficult task, needing years of experts' time. As a first step to automatically construct full Wordnets, we propose approaches to generate Wordnet synsets for languages both resource-rich and resource-poor, using publicly available Wordnets, a machine translator and/or a single bilingual dictionary.
Khang Nhứt Lâm+2 more
arxiv +6 more sources
Sememe Prediction for BabelNet Synsets using Multilingual and Multimodal Information [PDF]
In linguistics, a sememe is defined as the minimum semantic unit of languages. Sememe knowledge bases (KBs), which are built by manually annotating words with sememes, have been successfully applied to various NLP tasks. However, existing sememe KBs only cover a few languages, which hinders the wide utilization of sememes.
Fanchao Qi+5 more
arxiv +7 more sources
The natural selection of words: Finding the features of fitness. [PDF]
We introduce a dataset for studying the evolution of words, constructed from WordNet and the Google Books Ngram Corpus. The dataset tracks the evolution of 4,000 synonym sets (synsets), containing 9,000 English words, from 1800 AD to 2000 AD.
Peter D Turney, Saif M Mohammad
doaj +3 more sources
Towards Automatic Construction of Filipino WordNet: Word Sense Induction and Synset Induction Using Sentence Embeddings [PDF]
Wordnets are indispensable tools for various natural language processing applications. Unfortunately, wordnets get outdated, and producing or updating wordnets can be slow and costly in terms of time and resources.
Dan John Velasco+7 more
openalex +3 more sources
A Synset Relation-enhanced Framework with a Try-again Mechanism for Word Sense Disambiguation [PDF]
Contextual embeddings are proved to be overwhelmingly effective to the task of Word Sense Disambiguation (WSD) compared with other sense representation techniques. However, these embeddings fail to embed sense knowledge in semantic networks.
Ming Wang, Yinglin Wang
openalex +2 more sources
Fighting with the Sparsity of Synonymy Dictionaries [PDF]
Graph-based synset induction methods, such as MaxMax and Watset, induce synsets by performing a global clustering of a synonymy graph. However, such methods are sensitive to the structure of the input synonymy graph: sparseness of the input dictionary can substantially reduce the quality of the extracted synsets. In this paper, we propose two different
Dmitry Ustalov+3 more
arxiv +3 more sources
Toward General Scene Graph: Integration of Visual Semantic Knowledge with Entity Synset Alignment [PDF]
Scene graph is a graph representation that explicitly represents high-level semantic knowledge of an image such as objects, attributes of objects and relationships between objects.
Woo Suk Choi+3 more
openalex +2 more sources
Sentiment Analysis of Image with Text Caption using Deep Learning Techniques. [PDF]
People are actively expressing their views and opinions via the use of visual pictures and text captions on social media platforms, rather than just publishing them in plain text as a consequence of technical improvements in this field. With the advent of visual media such as images, videos, and GIFs, research on the subject of sentiment analysis has ...
Chaubey PK+7 more
europepmc +2 more sources