Results 101 to 110 of about 2,605 (177)

Modularized Zero-shot VQA with Pre-trained Models

open access: yes
Large-scale pre-trained models (PTMs) show great zero-shot capabilities. In this paper, we study how to leverage them for zero-shot visual question answering (VQA). Our approach is motivated by a few observations.
Jiang, Jing, Cao, Rui
core  

Determining the Ensemble <i>N</i>-Representability of Reduced Density Matrices. [PDF]

open access: yesJ Chem Theory Comput
Oña OB   +7 more
europepmc   +1 more source

ReLaX-VQA: Residual Fragment and Layer Stack Extraction for Enhancing Video Quality Assessment

open access: yes
With the rapid growth of User-Generated Content (UGC) exchanged between users and sharing platforms, the need for video quality assessment in the wild has emerged. UGC is mostly acquired using consumer devices and undergoes multiple rounds of compression
Bull, David   +2 more
core  

Say It My Way: Exploring Control in Conversational Visual Question Answering with Blind Users. [PDF]

open access: yesProc SIGCHI Conf Hum Factor Comput Syst
Zeraati FZ   +4 more
europepmc   +1 more source

Home - About - Disclaimer - Privacy