Visual question answering - Open Access .click

Results 51 to 60 of about 34,250 (255)

Visual Question Answering Using Semantic Information from Image Descriptions

Proceedings of the International Florida Artificial Intelligence Research Society Conference, 2021
In this work, we propose a deep neural architecture that uses an attention mechanism which utilizes region based image features, the natural language question asked, and semantic knowledge extracted from the regions of an image to produce open-ended ...
Tasmia Tasmia, Md Sultan Al Nahian, Brent Harrison +2 more
doaj +1 more source

Questioning the Stability of Visual Question Answering

CoRR
Visual Language Models (VLMs) have achieved remarkable progress, yet their reliability under small, meaning-preserving input changes remains poorly understood. We present the first large-scale, systematic study of VLM robustness to benign visual and textual perturbations: pixel-level shifts, light geometric transformations, padded rescaling ...
Amir Rosenfeld, Neta Glazer, Ethan Fetaya +2 more
openaire +2 more sources

Multimodal Encoder-Decoder Attention Networks for Visual Question Answering

IEEE Access, 2020
Visual Question Answering (VQA) is a multimodal task involving Computer Vision (CV) and Natural Language Processing (NLP), the goal is to establish a high-efficiency VQA model.
Chongqing Chen, Dezhi Han, Jun Wang
doaj +1 more source

VISALOGY: Answering Visual Analogy Questions

CoRR, 2015
To appear in NIPS ...
Fereshteh Sadeghi, C. Lawrence Zitnick, Ali Farhadi +2 more
openaire +3 more sources

The Role of Hematopoietic Cell Transplantation in Ataxia‐Telangiectasia

Pediatric Blood &Cancer, EarlyView.
ABSTRACT Background Ataxia‐telangiectasia (A‐T) is a DNA repair disorder characterized by neurodegeneration, immunodeficiency, and cancer predisposition. Hematopoietic cell transplantation (HCT) is an established therapy in related disorders such as Fanconi anemia (FA) and Nijmegen breakage syndrome (NBS), but its role in A‐T is unclear.
Laila Alkhouli +3 more
wiley +1 more source

Visual Question Answering in the Medical Domain

2023 International Conference on Digital Image Computing: Techniques and Applications (DICTA), 2023
8 pages, 7 figures, Accepted to DICTA 2023 ...
Louisa Canepa, Sonit Singh, Arcot Sowmya
openaire +2 more sources

An upstream open reading frame regulates expression of the mitochondrial protein Slm35 and mitophagy flux

FEBS Letters, EarlyView.
This study reveals how the mitochondrial protein Slm35 is regulated in Saccharomyces cerevisiae. The authors identify stress‐responsive DNA elements and two upstream open reading frames (uORFs) in the 5′ untranslated region of SLM35. One uORF restricts translation, and its mutation increases Slm35 protein levels and mitophagy.
Hernán Romo‐Casanueva +5 more
wiley +1 more source

Document Collection Visual Question Answering [PDF]

, 2021
Current tasks and methods in Document Understanding aims to process documents as single elements. However, documents are usually organized in collections (historical records, purchase invoices), that provide context useful for their interpretation.
Rubèn Tito, Dimosthenis Karatzas, Ernest Valveny +2 more
openaire +2 more sources

Interrogating the immune landscape of microsatellite stable RAS‐mutated colon cancer

Molecular Oncology, EarlyView.
COLOSSUS project RAS‐mutated MSS colon cancer study explored transcriptomics and immune cell density by immunohistochemistry (IHC), Immunoscore (IS), ISIC/TuLIS scores, mutation counts, and detected different prevalences but similar microenvironment composition across immune markers with clinical relevance for future immunotherapy combination ...
Rodrigo Dienstmann +61 more
wiley +1 more source

Aesthetic Visual Question Answering of Photographs

CoRR, 2022
Aesthetic assessment of images can be categorized into two main forms: numerical assessment and language assessment. Aesthetics caption of photographs is the only task of aesthetic language assessment that has been addressed. In this paper, we propose a new task of aesthetic language assessment: aesthetic visual question and answering (AVQA) of images.
Xin Jin 0015 +6 more
openaire +2 more sources

vqa
attention mechanism
natural language processing

computer vision
deep learning
medicine

question answering