Results 211 to 220 of about 20,321 (258)

OSCaR: Object State Captioning and State Change Representation. [PDF]

open access: yesFind ACL NAACL
Nguyen N   +5 more
europepmc   +1 more source

State-aware Video Procedural Captioning

Proceedings of the 29th ACM International Conference on Multimedia, 2021
Video procedural captioning (VPC), which generates procedural text from instructional videos, is an essential task for scene understanding and real-world applications. The main challenge of VPC is to describe how to manipulate materials accurately. This paper focuses on this challenge by designing a new VPC task, generating a procedural text from the ...
Taichi Nishimura   +4 more
openaire   +1 more source

Multi-Perspective Video Captioning

Proceedings of the 29th ACM International Conference on Multimedia, 2021
This work targets at the problems of comprehensive video captioning and the generation of multiple descriptions from different perspectives, termed asMulti-Perspective Video Captioning. We build and release a dataset named VidOR-MPVC, the first dataset for multi-perspective video captioning, where each video is annotated with multiple descriptions from
Yi Bin   +4 more
openaire   +1 more source

Video Captioning by Adversarial LSTM

IEEE Transactions on Image Processing, 2018
In this paper, we propose a novel approach to video captioning based on adversarial learning and Long-Short Term Memory (LSTM). With this solution concept we aim at compensating for the deficiencies of LSTM-based video captioning methods that generally show potential to effectively handle temporal nature of video data when generating captions, but that
Yang Yang, Yi Bin, Alan Hanjalic
exaly   +5 more sources

Adversarial Video Captioning

2019 49th Annual IEEE/IFIP International Conference on Dependable Systems and Networks Workshops (DSN-W), 2019
In recent years, developments in the field of computer vision have allowed deep learning-based techniques to surpass human-level performance. However, these advances have also culminated in the advent of adversarial machine learning techniques, capable of launching targeted image captioning attacks that easily fool deep learning models.
Suman Kalyan Adari   +2 more
openaire   +1 more source

Home - About - Disclaimer - Privacy