Results 91 to 100 of about 5,597 (161)

EdgeVidCap: A Channel-Spatial Dual-Branch Lightweight Video Captioning Model for IoT Edge Cameras. [PDF]

open access: yesSensors (Basel)
Guo L   +9 more
europepmc   +1 more source

Multimodal generative AI for interpreting 3D medical images and videos. [PDF]

open access: yesNPJ Digit Med
Lee JO   +4 more
europepmc   +1 more source

Learning Semantic Features for Dense Video Captioning

open access: yesJournal of KIISE, 2019
Sujin Lee, Incheol Kim
openaire   +1 more source

Seeing Far and Clearly: Mitigating Hallucinations in MLLMs with Attention Causal Decoding. [PDF]

open access: yesProc IEEE Comput Soc Conf Comput Vis Pattern Recognit
Tang F   +20 more
europepmc   +1 more source

Home - About - Disclaimer - Privacy