Temporally Consistent Motion Segmentation From RGB‐D Video [PDF]
AbstractTemporally consistent motion segmentation from RGB‐D videos is challenging because of the limitations of current RGB‐D sensors. We formulate segmentation as a motion assignment problem, where a motion is a sequence of rigid transformations through all frames of the input.
Peter Bertholet +2 more
openalex +4 more sources
Towards Unconstrained Joint Hand-Object Reconstruction From RGB Videos [PDF]
Project website: https://hassony2.github.io/homan ...
Yana Hasson +3 more
openalex +5 more sources
Viewpoint Invariant Action Recognition Using RGB-D Videos
In video-based action recognition, viewpoint variations often pose major challenges because the same actions can appear different from different views. We use the complementary RGB and Depth information from the RGB-D cameras to address this problem. The proposed technique capitalizes on the spatio-temporal information available in the two data streams
Jian Liu, Naveed Akhtar, Ajmal Mian
openalex +4 more sources
Recognizing American Sign Language Manual Signs from RGB-D Videos [PDF]
In this paper, we propose a 3D Convolutional Neural Network (3DCNN) based multi-stream framework to recognize American Sign Language (ASL) manual signs (consisting of movements of the hands, as well as non-manual face movements in some cases) in real-time from RGB-D videos, by fusing multimodality features including hand gestures, facial expressions ...
Longlong Jing +3 more
openalex +4 more sources
Action recognition based on 2D skeletons extracted from RGB videos [PDF]
In this paper a methodology to recognize actions based on RGB videos is proposed which takes advantages of the recent breakthrough made in deep learning.
Aubry Sophie +3 more
doaj +3 more sources
Learning 3D-Gaussian Simulators from RGB Videos [PDF]
Realistic simulation is critical for applications ranging from robotics to animation. Learned simulators have emerged as a possibility to capture real world physics directly from video data, but very often require privileged information such as depth information, particle tracks and hand-engineered features to maintain spatial and temporal consistency.
Mikel Zhobro +2 more
openalex +3 more sources
Mask R-CNN and Centroid Tracking Algorithm to Process UAV Based Thermal-RGB Video for Drylot Cattle Heat Stress Monitoring [PDF]
This study developed and evaluated an algorithm for processing thermal-RGB video feeds captured by an unmanned aerial vehicle (UAV) to automate heat stress monitoring in cattle housed in the drylots.
Keshawa M. Dadallage +3 more
doaj +2 more sources
Generation of Moire-Like Videos from RGB-D Videos Window [PDF]
Sho Enomoto, Toru Hiraoka
openalex +2 more sources
Video Anomaly Detection Based on Improved Time Segmentation Network [PDF]
Video anomaly detection is an important research topic in the field of computer vision, that is widely used in road monitoring and abnormal event monitoring.Considering the obvious differences between the appearance and motion characteristics of abnormal
HUANG Tao, WU Kaijun, WANG Dicong, BAI Chenshuai, TAO Xiaomiao
doaj +1 more source
Using Motion History Images With 3D Convolutional Networks in Isolated Sign Language Recognition
Sign language recognition using computational models is a challenging problem that requires simultaneous spatio-temporal modeling of the multiple sources, i.e. faces, hands, body, etc. In this paper, we propose an isolated sign language recognition model
Ozge Mercanoglu Sincan +1 more
doaj +1 more source

