Results 81 to 90 of about 103,551 (264)
Target tracking in hyperspectral videos is a new research topic. In this paper, a novel method based on convolutional network and Kernelized Correlation Filter (KCF) framework is presented for tracking objects of interest in hyperspectral videos.
J Henriques +6 more
core +1 more source
Animatable Neural Radiance Fields from Monocular RGB Videos
12 pages, 12 ...
Chen, Jianchuan +6 more
openaire +2 more sources
Learning human activities and object affordances from RGB-D videos [PDF]
Understanding human activities and object affordances are two very important skills, especially for personal robots which operate in human environments. In this work, we consider the problem of extracting a descriptive labeling of the sequence of sub-activities being performed by a human, and more importantly, of their interactions with the objects in
Koppula, Hema Swetha +2 more
openaire +2 more sources
Multimodal Human–Robot Interaction Using Human Pose Estimation and Local Large Language Models
A multimodal human–robot interaction framework integrates human pose estimation (HPE) and a large language model (LLM) for gesture‐ and voice‐based robot control. Speech‐to‐text (STT) enables voice command interpretation, while a safety‐aware arbitration mechanism prioritizes gesture input for rapid intervention.
Nasiru Aboki +2 more
wiley +1 more source
Spectral video construction from RGB video: Application to Image Guided Neurosurgery
Spectral imaging has received enormous interest in the field of medical imaging modalities. It provides a powerful tool for the analysis of different organs and non-invasive tissues. Therefore, significant amount of research has been conducted to explore the possibility of using spectral imaging in biomedical applications.
Hasnat, Md. Abul +2 more
openaire +2 more sources
Miniaturized Magnetic Tip Design for Endoluminal Vine Robot Navigation
A magnetic tip mount is designed for a miniaturized 7 mm soft‐growing vine robot to enable wireless magnetic steering and onboard imaging, while preserving a 3 mm working channel. The internal–external ring magnets design balances magnetic attachment with low eversion pressure. Experiments demonstrate ±90° steering, 34 mm bending radius, and successful
Andrea Yanez Trujillo +4 more
wiley +1 more source
NTU RGB+D 120: A Large-Scale Benchmark for 3D Human Activity Understanding
Research on depth-based human activity analysis achieved outstanding performance and demonstrated the effectiveness of 3D representation for action recognition.
Duan, Ling-Yu +5 more
core +1 more source
Skeleton‐oriented object segmentation (SKOOTS) introduces a new strategy for 3D mitochondrial instance segmentation by predicting explicit skeletons rather than relying on boundary cues. This approach enables robust analysis of densely packed organelles in large FIB‐SEM datasets.
Christopher J. Buswinka +3 more
wiley +1 more source
Rethinking Spatiotemporal Feature Learning: Speed-Accuracy Trade-offs in Video Classification
Despite the steady progress in video analysis led by the adoption of convolutional neural networks (CNNs), the relative improvement has been less drastic as that in 2D static image classification.
Huang, Jonathan +4 more
core +1 more source
Reconstructing Articulated Rigged Models from RGB-D Videos [PDF]
Although commercial and open-source software exist to reconstruct a static object from a sequence recorded with an RGB-D sensor, there is a lack of tools that build rigged models of articulated objects that deform realistically and can be used for tracking or animation.
Tzionas, D., Gall, J.
openaire +3 more sources

