Results 51 to 60 of about 34,250 (255)
Visual Question Answering Using Semantic Information from Image Descriptions
In this work, we propose a deep neural architecture that uses an attention mechanism which utilizes region based image features, the natural language question asked, and semantic knowledge extracted from the regions of an image to produce open-ended ...
Tasmia Tasmia +2 more
doaj +1 more source
Questioning the Stability of Visual Question Answering
Visual Language Models (VLMs) have achieved remarkable progress, yet their reliability under small, meaning-preserving input changes remains poorly understood. We present the first large-scale, systematic study of VLM robustness to benign visual and textual perturbations: pixel-level shifts, light geometric transformations, padded rescaling ...
Amir Rosenfeld +2 more
openaire +2 more sources
Multimodal Encoder-Decoder Attention Networks for Visual Question Answering
Visual Question Answering (VQA) is a multimodal task involving Computer Vision (CV) and Natural Language Processing (NLP), the goal is to establish a high-efficiency VQA model.
Chongqing Chen, Dezhi Han, Jun Wang
doaj +1 more source
VISALOGY: Answering Visual Analogy Questions
To appear in NIPS ...
Fereshteh Sadeghi +2 more
openaire +3 more sources
The Role of Hematopoietic Cell Transplantation in Ataxia‐Telangiectasia
ABSTRACT Background Ataxia‐telangiectasia (A‐T) is a DNA repair disorder characterized by neurodegeneration, immunodeficiency, and cancer predisposition. Hematopoietic cell transplantation (HCT) is an established therapy in related disorders such as Fanconi anemia (FA) and Nijmegen breakage syndrome (NBS), but its role in A‐T is unclear.
Laila Alkhouli +3 more
wiley +1 more source
Visual Question Answering in the Medical Domain
8 pages, 7 figures, Accepted to DICTA 2023 ...
Louisa Canepa, Sonit Singh, Arcot Sowmya
openaire +2 more sources
This study reveals how the mitochondrial protein Slm35 is regulated in Saccharomyces cerevisiae. The authors identify stress‐responsive DNA elements and two upstream open reading frames (uORFs) in the 5′ untranslated region of SLM35. One uORF restricts translation, and its mutation increases Slm35 protein levels and mitophagy.
Hernán Romo‐Casanueva +5 more
wiley +1 more source
Document Collection Visual Question Answering [PDF]
Current tasks and methods in Document Understanding aims to process documents as single elements. However, documents are usually organized in collections (historical records, purchase invoices), that provide context useful for their interpretation.
Rubèn Tito +2 more
openaire +2 more sources
Interrogating the immune landscape of microsatellite stable RAS‐mutated colon cancer
COLOSSUS project RAS‐mutated MSS colon cancer study explored transcriptomics and immune cell density by immunohistochemistry (IHC), Immunoscore (IS), ISIC/TuLIS scores, mutation counts, and detected different prevalences but similar microenvironment composition across immune markers with clinical relevance for future immunotherapy combination ...
Rodrigo Dienstmann +61 more
wiley +1 more source
Aesthetic Visual Question Answering of Photographs
Aesthetic assessment of images can be categorized into two main forms: numerical assessment and language assessment. Aesthetics caption of photographs is the only task of aesthetic language assessment that has been addressed. In this paper, we propose a new task of aesthetic language assessment: aesthetic visual question and answering (AVQA) of images.
Xin Jin 0015 +6 more
openaire +2 more sources

