Visual question answering - Open Access .click

Results 21 to 30 of about 34,250 (255)

Pathological Visual Question Answering [PDF]

, 2020
We develop datasets and methods to perform visual question answering on pathology images.
Xuehai He +6 more
openaire +2 more sources

Survey of Multimodal Medical Question Answering

BioMedInformatics, 2023
Multimodal medical question answering (MMQA) is a vital area bridging healthcare and Artificial Intelligence (AI). This survey methodically examines the MMQA research published in recent years.
Hilmi Demirhan, Wlodek Zadrozny
doaj +1 more source

K-VQA: A visual question answering method

Journal of Hebei University of Science and Technology, 2020
The types of questions answered by the visual question answering of images and texts are roughly divided into two types. The first type is the questions that can get the answers directly from the images, and the second type is the questions that need the
Hongbin GAO, Jinying MAO, Huiyong WANG
doaj +1 more source

Localized Questions in Medical Visual Question Answering

, 2023
Visual Question Answering (VQA) models aim to answer natural language questions about given images. Due to its ability to ask questions that differ from those used when training the model, medical VQA has received substantial attention in recent years.
Sergio Tascon-Morales +2 more
openaire +2 more sources

Visual Question Answering: A Tutorial [PDF]

IEEE Signal Processing Magazine, 2017
The task of visual question answering (VQA) is receiving increasing interest from researchers in both the computer vision and natural language processing fields. Tremendous advances have been seen in the field of computer vision due to the success of deep learning, in particular on low- and midlevel tasks, such as image segmentation or object ...
Damien Teney, Qi Wu 0001, Anton van den Hengel +2 more
openaire +2 more sources

Visual Question Answering Method Based on Counterfactual Thinking [PDF]

Jisuanji kexue, 2022
Visual question answering(VQA) is a multi-modal task that combines computer vision and natural language proces-sing,which is extremely challenging.However,the current VQA model is often misled by the apparent correlation in the data,and the output of the
YUAN De-sen, LIU Xiu-jing, WU Qing-bo, LI Hong-liang, MENG Fan-man, NGAN King-ngi, XU Lin-feng
doaj +1 more source

Robust Explanations for Visual Question Answering [PDF]

2020 IEEE Winter Conference on Applications of Computer Vision (WACV), 2020
WACV-2020 (Accepted)
Badri N. Patro, Shivansh Patel, Vinay P. Namboodiri +2 more
openaire +3 more sources

Unanswerable Questions About Images and Texts

Frontiers in Artificial Intelligence, 2020
Questions about a text or an image that cannot be answered raise distinctive issues for an AI. This note discusses the problem of unanswerable questions in VQA (visual question answering), in QA (textual question answering), and in AI generally.
Ernest Davis
doaj +1 more source

Visual Question Answering

International Journal of Advanced Research in Science, Communication and Technology, 2022
We propose the task of free-form and open- ended Visual Question Answering (VQA). Given an image and a natural language question about the image, the task is to provide an accurate natural language answer. Mirroring real-world scenarios, such as helping the visually impaired, both the questions and answers are open-ended.
Qi Wu 0001 +4 more
openaire +2 more sources

Learning Answer Embeddings for Visual Question Answering [PDF]

2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2018
We propose a novel probabilistic model for visual question answering (Visual QA). The key idea is to infer two sets of embeddings: one for the image and the question jointly and the other for the answers. The learning objective is to learn the best parameterization of those embeddings such that the correct answer has higher likelihood among all ...
Hexiang Hu, Wei-Lun Chao, Fei Sha
openaire +2 more sources

vqa
attention mechanism
natural language processing

computer vision
deep learning
medicine

question answering