Results 61 to 70 of about 20,321 (258)
Weakly Supervised Dense Video Captioning
This paper focuses on a novel and challenging vision task, dense video captioning, which aims to automatically describe a video clip with multiple informative and diverse caption sentences.
Chen, Yurong +6 more
core +1 more source
ABSTRACT Magnetogenetic deep brain stimulation (MG‐DBS) represents a wireless neuromodulation that has demonstrated long‐lasting behavioral benefits in Parkinson's disease models. However, the circuit‐level mechanisms underlying these therapeutic effects have remained uncharacterized due to limitations of conventional neural interfaces.
Jakyoung Lee +10 more
wiley +1 more source
Deep models are state-of-the-art for many vision tasks including video action recognition and video captioning. Models are trained to caption or classify activity in videos, but little is known about the evidence used to make such decisions.
Bargal, Sarah Adel +5 more
core +1 more source
Video Captioning with Tube Features [PDF]
Visual feature plays an important role in the video captioning task. Considering that the video content is mainly composed of the activities of salient objects, it has restricted the caption quality of current approaches which just focus on global frame features while paying less attention to the salient objects.
Bin Zhao, Xuelong Li, Xiaoqiang Lu
openaire +1 more source
Assembling a True “Olympic Gel” From over 16 000 Combinatorial DNA Rings
Olympic gels are an elusive class of soft matter, consisting of molecular networks held together purely by mechanically interlocked rings. Their topological structure promises unique properties and functions, but their synthesis has proven notoriously difficult.
Sarah K. Speed +9 more
wiley +1 more source
Liquids as Reinforcements for Anisotropic and Tough Soft Matter Composites
Liquid metal droplets are shaped and oriented within soft elastomers to create all‐soft matter composites with programmable mechanical anisotropy. These liquid inclusions act as fiber‐like reinforcements, enabling directional stiffness, enhanced toughness, and controlled crack steering under extreme deformation, offering new routes to resilient soft ...
Gwyneth M. Schloer +5 more
wiley +1 more source
Multilevel Language and Vision Integration for Text-to-Clip Retrieval
We address the problem of text-based activity retrieval in video. Given a sentence describing an activity, our task is to retrieve matching clips from an untrimmed video.
He, Kun +5 more
core +1 more source
Grounding Large Language Models for Robot Task Planning Using Closed‐Loop State Feedback
BrainBody‐Large Language Model (LLM) introduces a hierarchical, feedback‐driven planning framework where two LLMs coordinate high‐level reasoning and low‐level control for robotic tasks. By grounding decisions in real‐time state feedback, it reduces hallucinations and improves task reliability.
Vineet Bhat +4 more
wiley +1 more source
Hierarchical Photo-Scene Encoder for Album Storytelling
In this paper, we propose a novel model with a hierarchical photo-scene encoder and a reconstructor for the task of album storytelling. The photo-scene encoder contains two sub-encoders, namely the photo and scene encoders, which are stacked together and
Jiang, Wenhao +4 more
core +1 more source
Multimodal Wearable Biosensing Meets Multidomain AI: A Pathway to Decentralized Healthcare
Multimodal biosensing meets multidomain AI. Wearable biosensors capture complementary biochemical and physiological signals, while cross‐device, population‐aware learning aligns noisy, heterogeneous streams. This Review distills key sensing modalities, fusion and calibration strategies, and privacy‐preserving deployment pathways that transform ...
Chenshu Liu +10 more
wiley +1 more source

