Topic2features: a novel framework to classify noisy and sparse textual data using LDA topic distributions [PDF]
In supervised machine learning, specifically in classification tasks, selecting and analyzing the feature vector to achieve better results is one of the most important tasks.
Junaid Abdul Wahid +6 more
doaj +2 more sources
TACO: Topics in Algorithmic COde generation dataset
We introduce TACO, an open-source, large-scale code generation dataset, with a focus on the optics of algorithms, designed to provide a more challenging training dataset and evaluation benchmark in the field of code generation models. TACO includes competition-level programming questions that are more challenging, to enhance or evaluate problem ...
Rongao Li +8 more
openaire +2 more sources
Topic-Driven and Knowledge-Aware Transformer for Dialogue Emotion Detection [PDF]
Emotion detection in dialogues is challenging as it often requires the identification of thematic topics underlying a conversation, the relevant commonsense knowledge, and the intricate transition patterns between the affective states.
Lixing Zhu +4 more
semanticscholar +1 more source
Latent Dirichlet allocation (LDA) and topic modeling: models, applications, a survey [PDF]
Topic modeling is one of the most powerful techniques in text mining for data mining, latent data discovery, and finding relationships among data and text documents.
Hamed Jelodar +3 more
semanticscholar +1 more source
Quotation Recommendation for Multi-party Online Conversations Based on Semantic and Topic Fusion
Quotations are crucial for successful explanations and persuasions in interpersonal communications. However, finding what to quote in a conversation is challenging for humans. This work studies automatic quotation recommendation for online conversations.
Lingzhi Wang +2 more
semanticscholar +1 more source
TopiOCQA: Open-domain Conversational Question Answering with Topic Switching [PDF]
In a conversational question answering scenario, a questioner seeks to extract information about a topic through a series of interdependent questions and answers.
Vaibhav Adlakha +4 more
semanticscholar +1 more source
Benchmarking Zero-shot Text Classification: Datasets, Evaluation and Entailment Approach [PDF]
Zero-shot text classification (0Shot-TC) is a challenging NLU problem to which little attention has been paid by the research community. 0Shot-TC aims to associate an appropriate label with a piece of text, irrespective of the text domain and the aspect (
Wenpeng Yin, Jamaal Hay, Dan Roth
semanticscholar +1 more source
Retrieval Topic Recurrent Memory Network for Remote Sensing Image Captioning
Remote sensing image (RSI) captioning aims to generate sentences to describe the content of RSIs. Generally, five sentences are used to describe the RSI in caption datasets.
Binqiang Wang +3 more
doaj +1 more source
Investigating topic bias in emotion classification [PDF]
In emotion classification, texts are assigned a conceptual emotion representation such as discrete labels or dimensions of cognitive appraisal. Emotion classifiers are typically not universally applicable, but base their classification decisions on ...
Wegge, Maximilian
core +1 more source
Recent Advances in Traffic Sign Recognition: Approaches and Datasets
Autonomous vehicles have become a topic of interest in recent times due to the rapid advancement of automobile and computer vision technology. The ability of autonomous vehicles to drive safely and efficiently relies heavily on their ability to ...
Xin Roy Lim +5 more
semanticscholar +1 more source

