Video captioning - Open Access .click

Results 61 to 70 of about 20,321 (258)

Weakly Supervised Dense Video Captioning

, 2017
This paper focuses on a novel and challenging vision task, dense video captioning, which aims to automatically describe a video clip with multiple informative and diverse caption sentences.
Chen, Yurong +6 more
core +1 more source

Soft Neural Interfaces for Circuit‐Level Analysis of Magnetogenetic Deep Brain Stimulation in Parkinson's Disease Models

Advanced Healthcare Materials, EarlyView.
ABSTRACT Magnetogenetic deep brain stimulation (MG‐DBS) represents a wireless neuromodulation that has demonstrated long‐lasting behavioral benefits in Parkinson's disease models. However, the circuit‐level mechanisms underlying these therapeutic effects have remained uncharacterized due to limitations of conventional neural interfaces.
Jakyoung Lee +10 more
wiley +1 more source

Excitation Backprop for RNNs

, 2018
Deep models are state-of-the-art for many vision tasks including video action recognition and video captioning. Models are trained to caption or classify activity in videos, but little is known about the evidence used to make such decisions.
Bargal, Sarah Adel +5 more
core +1 more source

Video Captioning with Tube Features [PDF]

Proceedings of the Twenty-Seventh International Joint Conference on Artificial Intelligence, 2018
Visual feature plays an important role in the video captioning task. Considering that the video content is mainly composed of the activities of salient objects, it has restricted the caption quality of current approaches which just focus on global frame features while paying less attention to the salient objects.
Bin Zhao, Xuelong Li, Xiaoqiang Lu
openaire +1 more source

Assembling a True “Olympic Gel” From over 16 000 Combinatorial DNA Rings

Advanced Materials, EarlyView.
Olympic gels are an elusive class of soft matter, consisting of molecular networks held together purely by mechanically interlocked rings. Their topological structure promises unique properties and functions, but their synthesis has proven notoriously difficult.
Sarah K. Speed +9 more
wiley +1 more source

Liquids as Reinforcements for Anisotropic and Tough Soft Matter Composites

Advanced Materials, EarlyView.
Liquid metal droplets are shaped and oriented within soft elastomers to create all‐soft matter composites with programmable mechanical anisotropy. These liquid inclusions act as fiber‐like reinforcements, enabling directional stiffness, enhanced toughness, and controlled crack steering under extreme deformation, offering new routes to resilient soft ...
Gwyneth M. Schloer +5 more
wiley +1 more source

Multilevel Language and Vision Integration for Text-to-Clip Retrieval

, 2018
We address the problem of text-based activity retrieval in video. Given a sentence describing an activity, our task is to retrieve matching clips from an untrimmed video.
He, Kun +5 more
core +1 more source

Grounding Large Language Models for Robot Task Planning Using Closed‐Loop State Feedback

Advanced Robotics Research, EarlyView.
BrainBody‐Large Language Model (LLM) introduces a hierarchical, feedback‐driven planning framework where two LLMs coordinate high‐level reasoning and low‐level control for robotic tasks. By grounding decisions in real‐time state feedback, it reduces hallucinations and improves task reliability.
Vineet Bhat +4 more
wiley +1 more source

Hierarchical Photo-Scene Encoder for Album Storytelling

, 2019
In this paper, we propose a novel model with a hierarchical photo-scene encoder and a reconstructor for the task of album storytelling. The photo-scene encoder contains two sub-encoders, namely the photo and scene encoders, which are stacked together and
Jiang, Wenhao +4 more
core +1 more source

Multimodal Wearable Biosensing Meets Multidomain AI: A Pathway to Decentralized Healthcare

Advanced Science, EarlyView.
Multimodal biosensing meets multidomain AI. Wearable biosensors capture complementary biochemical and physiological signals, while cross‐device, population‐aware learning aligns noisy, heterogeneous streams. This Review distills key sensing modalities, fusion and calibration strategies, and privacy‐preserving deployment pathways that transform ...
Chenshu Liu +10 more
wiley +1 more source

deep learning
computer vision
dense video captioning

lstm
natural language processing
video description

arabic video captioning