Visual question answering - Open Access .click

Results 231 to 240 of about 34,250 (255)

Some of the next articles are maybe not open access.

Data Augmentation for Visual Question Answering

Proceedings of the 10th International Conference on Natural Language Generation, 2017
Data augmentation is widely used to train deep neural networks for image classification tasks. Simply flipping images can help learning tremendously by increasing the number of training images by a factor of two. However, little work has been done studying data augmentation in natural language processing.
Kushal Kafle +2 more
openaire +1 more source

Semantically Guided Visual Question Answering

2018 IEEE Winter Conference on Applications of Computer Vision (WACV), 2018
We present a novel approach to enhance the challenging task of Visual Question Answering (VQA) by incorporating and enriching semantic knowledge in a VQA model. We first apply Multiple Instance Learning (MIL) to extract a richer visual representation addressing concepts beyond objects such as actions and colors.
Handong Zhao +3 more
openaire +1 more source

Affective Visual Question Answering Network

2018 IEEE Conference on Multimedia Information Processing and Retrieval (MIPR), 2018
Visual Question Answering (VQA) has recently attracted considerable attention from researchers in the trending field of deep learning. The need to improve VQA models by focusing on local regions of images, has resulted in the development of various attention models.
Nelson Ruwa +3 more
openaire +2 more sources

Indic Visual Question Answering

2022 IEEE International Conference on Signal Processing and Communications (SPCOM), 2022
Aditya Chandrasekar, Amey Shimpi, Dinesh Naik +2 more
openaire +1 more source

An Answer FeedBack Network for Visual Question Answering

2023 International Joint Conference on Neural Networks (IJCNN), 2023
Weidong Tian 0001, Ruihua Tian, Zhongqiu Zhao, Quan Ren +3 more
openaire +1 more source

Ques-to-Visual Guided Visual Question Answering

2022 IEEE International Conference on Image Processing (ICIP), 2022
Xiangyu Wu +3 more
openaire +1 more source

Question-conditioned debiasing with focal visual context fusion for visual question answering

Knowledge-Based Systems, 2023
Fengyu Zhou, Huijuan Xu
exaly

Multimodal feature fusion by relational reasoning and attention for visual question answering

Information Fusion, 2020
Weifeng Zhang, Haiyang Hu, Zengchang Qin +2 more
exaly

CAAN: Context-Aware attention network for visual question answering

Pattern Recognition, 2022
Dezhi Han, Chin-Chen Chang
exaly

OpenViVQA: Task, dataset, and multimodal fusion models for visual question answering in Vietnamese

Information Fusion, 2023
Nghia Hieu Nguyen, Kiet Van Nguyen
exaly

vqa
attention mechanism
natural language processing

computer vision
deep learning
medicine

question answering