Rgb video - Open Access .click

Results 81 to 90 of about 103,551 (264)

Object Tracking in Hyperspectral Videos with Convolutional Features and Kernelized Correlation Filter

, 2018
Target tracking in hyperspectral videos is a new research topic. In this paper, a novel method based on convolutional network and Kernelized Correlation Filter (KCF) framework is presented for tracking objects of interest in hyperspectral videos.
J Henriques +6 more
core +1 more source

Animatable Neural Radiance Fields from Monocular RGB Videos

, 2021
12 pages, 12 ...
Chen, Jianchuan +6 more
openaire +2 more sources

Learning human activities and object affordances from RGB-D videos [PDF]

The International Journal of Robotics Research, 2013
Understanding human activities and object affordances are two very important skills, especially for personal robots which operate in human environments. In this work, we consider the problem of extracting a descriptive labeling of the sequence of sub-activities being performed by a human, and more importantly, of their interactions with the objects in
Koppula, Hema Swetha, Gupta, Rudhir, Saxena, Ashutosh +2 more
openaire +2 more sources

Multimodal Human–Robot Interaction Using Human Pose Estimation and Local Large Language Models

Advanced Robotics Research, EarlyView.
A multimodal human–robot interaction framework integrates human pose estimation (HPE) and a large language model (LLM) for gesture‐ and voice‐based robot control. Speech‐to‐text (STT) enables voice command interpretation, while a safety‐aware arbitration mechanism prioritizes gesture input for rapid intervention.
Nasiru Aboki, Ilche Georgievski, Marco Aiello +2 more
wiley +1 more source

Spectral video construction from RGB video: Application to Image Guided Neurosurgery

, 2016
Spectral imaging has received enormous interest in the field of medical imaging modalities. It provides a powerful tool for the analysis of different organs and non-invasive tissues. Therefore, significant amount of research has been conducted to explore the possibility of using spectral imaging in biomedical applications.
Hasnat, Md. Abul +2 more
openaire +2 more sources

Miniaturized Magnetic Tip Design for Endoluminal Vine Robot Navigation

Advanced Robotics Research, EarlyView.
A magnetic tip mount is designed for a miniaturized 7 mm soft‐growing vine robot to enable wireless magnetic steering and onboard imaging, while preserving a 3 mm working channel. The internal–external ring magnets design balances magnetic attachment with low eversion pressure. Experiments demonstrate ±90° steering, 34 mm bending radius, and successful
Andrea Yanez Trujillo +4 more
wiley +1 more source

NTU RGB+D 120: A Large-Scale Benchmark for 3D Human Activity Understanding

, 2019
Research on depth-based human activity analysis achieved outstanding performance and demonstrated the effectiveness of 3D representation for action recognition.
Duan, Ling-Yu +5 more
core +1 more source

SKOOTS: Skeleton‐Oriented Object Segmentation for Mitochondria in High‐Resolution Cochlear EM Datasets

Advanced Science, EarlyView.
Skeleton‐oriented object segmentation (SKOOTS) introduces a new strategy for 3D mitochondrial instance segmentation by predicting explicit skeletons rather than relying on boundary cues. This approach enables robust analysis of densely packed organelles in large FIB‐SEM datasets.
Christopher J. Buswinka +3 more
wiley +1 more source

Rethinking Spatiotemporal Feature Learning: Speed-Accuracy Trade-offs in Video Classification

, 2018
Despite the steady progress in video analysis led by the adoption of convolutional neural networks (CNNs), the relative improvement has been less drastic as that in 2D static image classification.
Huang, Jonathan +4 more
core +1 more source

Reconstructing Articulated Rigged Models from RGB-D Videos [PDF]

, 2016
Although commercial and open-source software exist to reconstruct a static object from a sequence recorded with an RGB-D sensor, there is a lack of tools that build rigged models of articulated objects that deform realistically and can be used for tracking or animation.
Tzionas, D., Gall, J.
openaire +3 more sources

fos: computer and information sciences
computer vision and pattern recognition cs.cv
deep learning

human action recognition
action recognition
video coding

machine learning
artificial intelligence
computer vision