Results 211 to 220 of about 20,321 (258)
PBC-Transformer: Interpreting Poultry Behavior Classification Using Image Caption Generation Techniques. [PDF]
Li J +7 more
europepmc +1 more source
Everyday challenges and solutions for individuals aging with deafness. [PDF]
Shende SA +3 more
europepmc +1 more source
An intelligent object detection and classification framework for assisting visually challenged persons using deep learning and improved crow search optimization. [PDF]
Khadidos AO, Yafoz A.
europepmc +1 more source
OSCaR: Object State Captioning and State Change Representation. [PDF]
Nguyen N +5 more
europepmc +1 more source
Some of the next articles are maybe not open access.
Related searches:
Related searches:
State-aware Video Procedural Captioning
Proceedings of the 29th ACM International Conference on Multimedia, 2021Video procedural captioning (VPC), which generates procedural text from instructional videos, is an essential task for scene understanding and real-world applications. The main challenge of VPC is to describe how to manipulate materials accurately. This paper focuses on this challenge by designing a new VPC task, generating a procedural text from the ...
Taichi Nishimura +4 more
openaire +1 more source
Multi-Perspective Video Captioning
Proceedings of the 29th ACM International Conference on Multimedia, 2021This work targets at the problems of comprehensive video captioning and the generation of multiple descriptions from different perspectives, termed asMulti-Perspective Video Captioning. We build and release a dataset named VidOR-MPVC, the first dataset for multi-perspective video captioning, where each video is annotated with multiple descriptions from
Yi Bin +4 more
openaire +1 more source
Video Captioning by Adversarial LSTM
IEEE Transactions on Image Processing, 2018In this paper, we propose a novel approach to video captioning based on adversarial learning and Long-Short Term Memory (LSTM). With this solution concept we aim at compensating for the deficiencies of LSTM-based video captioning methods that generally show potential to effectively handle temporal nature of video data when generating captions, but that
Yang Yang, Yi Bin, Alan Hanjalic
exaly +5 more sources
2019 49th Annual IEEE/IFIP International Conference on Dependable Systems and Networks Workshops (DSN-W), 2019
In recent years, developments in the field of computer vision have allowed deep learning-based techniques to surpass human-level performance. However, these advances have also culminated in the advent of adversarial machine learning techniques, capable of launching targeted image captioning attacks that easily fool deep learning models.
Suman Kalyan Adari +2 more
openaire +1 more source
In recent years, developments in the field of computer vision have allowed deep learning-based techniques to surpass human-level performance. However, these advances have also culminated in the advent of adversarial machine learning techniques, capable of launching targeted image captioning attacks that easily fool deep learning models.
Suman Kalyan Adari +2 more
openaire +1 more source

