Results 21 to 30 of about 34,250 (255)
Pathological Visual Question Answering [PDF]
We develop datasets and methods to perform visual question answering on pathology images.
Xuehai He +6 more
openaire +2 more sources
Survey of Multimodal Medical Question Answering
Multimodal medical question answering (MMQA) is a vital area bridging healthcare and Artificial Intelligence (AI). This survey methodically examines the MMQA research published in recent years.
Hilmi Demirhan, Wlodek Zadrozny
doaj +1 more source
K-VQA: A visual question answering method
The types of questions answered by the visual question answering of images and texts are roughly divided into two types. The first type is the questions that can get the answers directly from the images, and the second type is the questions that need the
Hongbin GAO, Jinying MAO, Huiyong WANG
doaj +1 more source
Localized Questions in Medical Visual Question Answering
Visual Question Answering (VQA) models aim to answer natural language questions about given images. Due to its ability to ask questions that differ from those used when training the model, medical VQA has received substantial attention in recent years.
Sergio Tascon-Morales +2 more
openaire +2 more sources
Visual Question Answering: A Tutorial [PDF]
The task of visual question answering (VQA) is receiving increasing interest from researchers in both the computer vision and natural language processing fields. Tremendous advances have been seen in the field of computer vision due to the success of deep learning, in particular on low- and midlevel tasks, such as image segmentation or object ...
Damien Teney +2 more
openaire +2 more sources
Visual Question Answering Method Based on Counterfactual Thinking [PDF]
Visual question answering(VQA) is a multi-modal task that combines computer vision and natural language proces-sing,which is extremely challenging.However,the current VQA model is often misled by the apparent correlation in the data,and the output of the
YUAN De-sen, LIU Xiu-jing, WU Qing-bo, LI Hong-liang, MENG Fan-man, NGAN King-ngi, XU Lin-feng
doaj +1 more source
Robust Explanations for Visual Question Answering [PDF]
WACV-2020 (Accepted)
Badri N. Patro +2 more
openaire +3 more sources
Unanswerable Questions About Images and Texts
Questions about a text or an image that cannot be answered raise distinctive issues for an AI. This note discusses the problem of unanswerable questions in VQA (visual question answering), in QA (textual question answering), and in AI generally.
Ernest Davis
doaj +1 more source
We propose the task of free-form and open- ended Visual Question Answering (VQA). Given an image and a natural language question about the image, the task is to provide an accurate natural language answer. Mirroring real-world scenarios, such as helping the visually impaired, both the questions and answers are open-ended.
Qi Wu 0001 +4 more
openaire +2 more sources
Learning Answer Embeddings for Visual Question Answering [PDF]
We propose a novel probabilistic model for visual question answering (Visual QA). The key idea is to infer two sets of embeddings: one for the image and the question jointly and the other for the answers. The learning objective is to learn the best parameterization of those embeddings such that the correct answer has higher likelihood among all ...
Hexiang Hu, Wei-Lun Chao, Fei Sha
openaire +2 more sources

