Results 231 to 240 of about 34,250 (255)
Some of the next articles are maybe not open access.

Data Augmentation for Visual Question Answering

Proceedings of the 10th International Conference on Natural Language Generation, 2017
Data augmentation is widely used to train deep neural networks for image classification tasks. Simply flipping images can help learning tremendously by increasing the number of training images by a factor of two. However, little work has been done studying data augmentation in natural language processing.
Kushal Kafle   +2 more
openaire   +1 more source

Semantically Guided Visual Question Answering

2018 IEEE Winter Conference on Applications of Computer Vision (WACV), 2018
We present a novel approach to enhance the challenging task of Visual Question Answering (VQA) by incorporating and enriching semantic knowledge in a VQA model. We first apply Multiple Instance Learning (MIL) to extract a richer visual representation addressing concepts beyond objects such as actions and colors.
Handong Zhao   +3 more
openaire   +1 more source

Affective Visual Question Answering Network

2018 IEEE Conference on Multimedia Information Processing and Retrieval (MIPR), 2018
Visual Question Answering (VQA) has recently attracted considerable attention from researchers in the trending field of deep learning. The need to improve VQA models by focusing on local regions of images, has resulted in the development of various attention models.
Nelson Ruwa   +3 more
openaire   +2 more sources

Indic Visual Question Answering

2022 IEEE International Conference on Signal Processing and Communications (SPCOM), 2022
Aditya Chandrasekar   +2 more
openaire   +1 more source

An Answer FeedBack Network for Visual Question Answering

2023 International Joint Conference on Neural Networks (IJCNN), 2023
Weidong Tian 0001   +3 more
openaire   +1 more source

Ques-to-Visual Guided Visual Question Answering

2022 IEEE International Conference on Image Processing (ICIP), 2022
Xiangyu Wu   +3 more
openaire   +1 more source

Multimodal feature fusion by relational reasoning and attention for visual question answering

Information Fusion, 2020
Weifeng Zhang   +2 more
exaly  

CAAN: Context-Aware attention network for visual question answering

Pattern Recognition, 2022
Dezhi Han, Chin-Chen Chang
exaly  

Home - About - Disclaimer - Privacy