PMC-VQA: Visual Instruction Tuning for Medical Visual Question Answering
Medical Visual Question Answering (MedVQA) presents a significant opportunity to enhance diagnostic accuracy and healthcare delivery by leveraging artificial intelligence to interpret and answer questions based on medical images.
Zhao, Ziheng +6 more
core
ViDscribe: Multimodal AI for Customizing Audio Description and Question Answering in Online Videos. [PDF]
Cheema MS +3 more
europepmc +1 more source
D<sup>2</sup>MNet: Difference-Aware Decoupling and Multi-Prompt Learning for Medical Difference Visual Question Answering. [PDF]
Lai L, Ou W, Gou J, Liu Z.
europepmc +1 more source
Contrastive learning-based video quality assessment-jointed video vision transformer for video recognition. [PDF]
Sun J, Mahoor M.
europepmc +1 more source
Med-R1: Reinforcement Learning for Generalizable Medical Reasoning in Vision-Language Models. [PDF]
Lai Y +6 more
europepmc +1 more source
Uncover This Tech Term: Large Vision-Language Models in Radiology. [PDF]
Faghani S, Park YW, Park JE.
europepmc +1 more source
A data-efficient 3D medical vision-language model using only a 2D encoder. [PDF]
Lian Y, Xie Y, Jiang Y, Wang L, Yu H.
europepmc +1 more source
Collaborative positional attention for image to English question answering. [PDF]
Li Y, Teng H.
europepmc +1 more source
A linguistic lens into vision-language models for open-ended question-answers in medical visual question answering. [PDF]
Lameesa A +3 more
europepmc +1 more source
Attention re-alignment in multimodal large language models via intermediate-layer guidance. [PDF]
Chen Y +5 more
europepmc +1 more source

