Results 161 to 170 of about 10,385 (175)
Some of the next articles are maybe not open access.

video-SALMONN 2: Captioning-Enhanced Audio-Visual Large Language Models

arXiv.org
Changli Tang   +7 more
semanticscholar   +1 more source

A Multi-instance Multi-label Dual Learning Approach for Video Captioning

ACM Transactions on Multimedia Computing, Communications and Applications, 2021
Wanting Ji
exaly  

Fused GRU with semantic-temporal attention for video captioning

Neurocomputing, 2020
Lianli Gao, Jingkuan Song
exaly  

An attention-based hybrid deep learning approach for bengali video captioning

Journal of King Saud University - Computer and Information Sciences, 2023
exaly  

VersaVid-R1: A Versatile Video Understanding and Reasoning Model from Question Answering to Captioning Tasks

arXiv.org
Xinlong Chen   +9 more
semanticscholar   +1 more source

Semantic-filtered Soft-Split-Aware video captioning with audio-augmented feature

Neurocomputing, 2019
Yuecong Xu, Jianfei Yang, Kezhi Mao
exaly  

Video Captioning by Adversarial LSTM

IEEE Transactions on Image Processing, 2018
Yang Yang, Yi Bin, Alan Hanjalic
exaly  

Home - About - Disclaimer - Privacy