Results 41 to 50 of about 32,447 (225)
Step by Step: A Gradual Approach for Dense Video Captioning
Dense video captioning aims to localize and describe events for storytelling in untrimmed videos. It is a conceptually very challenging task that requires concise, relevant, and coherent captioning based on high-quality event localization.
Wangyu Choi, Jiasi Chen, Jongwon Yoon
doaj +1 more source
OBJ2TEXT: Generating Visually Descriptive Language from Object Layouts
Generating captions for images is a task that has recently received considerable attention. In this work we focus on caption generation for abstract scenes, or object layouts where the only information provided is a set of objects and their locations. We
Ordonez, Vicente, Yin, Xuwang
core +1 more source
Improving Image Captioning with Better Use of Caption
ACL ...
Shi, Zhan +3 more
openaire +2 more sources
Substrate Stress Relaxation Regulates Cell‐Mediated Assembly of Extracellular Matrix
Silicone‐based viscoelastic substrates with tunable stress relaxation reveal how matrix mechanics regulates cellular mechanosensing and cell‐mediated matrix remodelling in the stiff regime. High stress relaxation promotes assembly of fibronectin fibril‐like structures, increased nuclear localization of YAP and formation of β1 integrin‐enriched ...
Jonah L. Voigt +2 more
wiley +1 more source
Attend to You: Personalized Image Captioning with Context Sequence Memory Networks [PDF]
We address personalization issues of image captioning, which have not been discussed yet in previous research. For a query image, we aim to generate a descriptive sentence, accounting for prior knowledge such as the user's active vocabularies in previous
Kim, Byeongchang +2 more
core +1 more source
Generating Video Descriptions with Topic Guidance
Generating video descriptions in natural language (a.k.a. video captioning) is a more challenging task than image captioning as the videos are intrinsically more complicated than images in two aspects.
Ba Lei Jimmy +4 more
core +1 more source
Polymorph engineering in ErMnO3 enables low‐voltage, forming‐free threshold switching with tunable negative differential resistance. Conducting orthorhombic regions embedded in an insulating hexagonal matrix provide controlled Joule‐heating‐enhanced Poole–Frenkel transport. The hexagonal phase prevents excessive heating and breakdown.
Rong Wu +8 more
wiley +1 more source
Social Image Captioning: Exploring Visual Attention and User Attention
Image captioning with a natural language has been an emerging trend. However, the social image, associated with a set of user-contributed tags, has been rarely investigated for a similar task.
Leiquan Wang +5 more
doaj +1 more source
Methodologies that utilize Deep Learning offer great potential for applications that automatically attempt to generate captions or descriptions about images and video frames.
Soheyla Amirian +3 more
doaj +1 more source
Temporal Deformable Convolutional Encoder-Decoder Networks for Video Captioning
It is well believed that video captioning is a fundamental but challenging task in both computer vision and artificial intelligence fields. The prevalent approach is to map an input video to a variable-length output sentence in a sequence to sequence ...
Chao, Hongyang +5 more
core +1 more source

