Results 141 to 150 of about 484,281 (309)
Multimodal agent interfaces and system architectures for health and fitness companions [PDF]
Multimodal conversational spoken dialogues using physical and virtual agents provide a potential interface to motivate and support users in the domain of health and fitness.
Cavazza, Marc+9 more
core +1 more source
This section used to be called Visual Anthropology. Its new name—Multimodal Anthropologies—reflects changes in the media ecologies we engage as anthropologists, changes that have broadened our perspective to include other forms of media practice, while
S. Collins+2 more
semanticscholar +1 more source
Variational Fusion for Multimodal Sentiment Analysis [PDF]
Multimodal fusion is considered a key step in multimodal tasks such as sentiment analysis, emotion detection, question answering, and others. Most of the recent work on multimodal fusion does not guarantee the fidelity of the multimodal representation with respect to the unimodal representations.
arxiv
Challenges in Transcribing Multimodal Data: A Case Study [PDF]
open2siComputer-mediated communication (CMC) once meant principally text-based communication mediated by computers, but rapid technological advances in recent years have heralded an era of multimodal communication with a growing emphasis on audio and ...
core
Experimental focal ischemia in cats: changes in multimodality evoked potentials as related to local cerebral blood flow and ischemic brain edema. [PDF]
Keisuke Kataoka+3 more
openalex +1 more source
Rendering techniques for multimodal data [PDF]
Many different direct volume rendering methods have been developed to visualize 3D scalar fields on uniform rectilinear grids. However, little work has been done on rendering simultaneously various properties of the same 3D region measured with different
Ferré Bergadà, Maria+2 more
core +1 more source
MuRAR: A Simple and Effective Multimodal Retrieval and Answer Refinement Framework for Multimodal Question Answering [PDF]
Recent advancements in retrieval-augmented generation (RAG) have demonstrated impressive performance in the question-answering (QA) task. However, most previous works predominantly focus on text-based answers. While some studies address multimodal data, they still fall short in generating comprehensive multimodal answers, particularly for explaining ...
arxiv
TIMING MULTIMODAL EVENTS IN PIGEONS [PDF]
Ken Cheng, William A. Roberts
openalex +1 more source