Results 1 to 10 of about 1,365,020 (307)

Video Anomaly Detection Based on Improved Time Segmentation Network [PDF]

open access: yesJisuanji gongcheng, 2022
Video anomaly detection is an important research topic in the field of computer vision, that is widely used in road monitoring and abnormal event monitoring.Considering the obvious differences between the appearance and motion characteristics of abnormal
HUANG Tao, WU Kaijun, WANG Dicong, BAI Chenshuai, TAO Xiaomiao
doaj   +1 more source

HARP: Personalized Hand Reconstruction from a Monocular RGB Video [PDF]

open access: yesComputer Vision and Pattern Recognition, 2022
We present HARP (HAnd Reconstruction and Personalization), a personalized hand avatar creation approach that takes a short monocular RGB video of a human hand as input and reconstructs a faithful hand avatar exhibiting a high-fidelity appearance and ...
Korrawe Karunratanakul   +3 more
semanticscholar   +1 more source

ODAM: Object Detection, Association, and Mapping using Posed RGB Video [PDF]

open access: yesIEEE International Conference on Computer Vision, 2021
Localizing objects and estimating their extent in 3D is an important step towards high-level 3D scene understanding, which has many applications in Augmented Reality and Robotics.
Kejie Li   +8 more
semanticscholar   +1 more source

MonoClothCap: Towards Temporally Coherent Clothing Capture from Monocular RGB Video [PDF]

open access: yesInternational Conference on 3D Vision, 2020
We present a method to capture temporally coherent dynamic clothing deformation from a monocular RGB video input. In contrast to the existing literature, our method does not require a pre-scanned personalized mesh template, and thus can be applied to in ...
Donglai Xiang   +3 more
semanticscholar   +1 more source

Using Motion History Images With 3D Convolutional Networks in Isolated Sign Language Recognition

open access: yesIEEE Access, 2022
Sign language recognition using computational models is a challenging problem that requires simultaneous spatio-temporal modeling of the multiple sources, i.e. faces, hands, body, etc. In this paper, we propose an isolated sign language recognition model
Ozge Mercanoglu Sincan   +1 more
doaj   +1 more source

ViCo-MoCo-DL: Video Coding and Motion Compensation Solutions for Human Activity Recognition Using Deep Learning

open access: yesIEEE Access, 2023
This paper proposes three novel feature extraction approaches for human activity recognition in videos. The proposed solutions are based on video coding concepts including motion compensations and coding based feature variables.
Tamer Shanableh
doaj   +1 more source

Isolated Sign Recognition from RGB Video using Pose Flow and Self-Attention

open access: yes2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW), 2021
Automatic sign language recognition lies at the intersection of natural language processing (NLP) and computer vision. The highly successful transformer architectures, based on multi-head attention, originate from the field of NLP.
Mathieu De Coster   +2 more
semanticscholar   +1 more source

Singular Spectrum Analysis for Background Initialization with Spatio-Temporal RGB Color Channel Data

open access: yesEntropy, 2021
In video processing, background initialization aims to obtain a scene without foreground objects. Recently, the background initialization problem has attracted the attention of researchers because of its real-world applications, such as video ...
Huy D. Le   +3 more
doaj   +1 more source

Motion Feature Retrieval in Basketball Match Video Based on Multisource Motion Feature Fusion

open access: yesAdvances in Mathematical Physics, 2022
Both the human body and its motion are three-dimensional information, while the traditional feature description method of two-person interaction based on RGB video has a low degree of discrimination due to the lack of depth information.
Biao Ma, Minghui Ji
doaj   +1 more source

PINA: Learning a Personalized Implicit Neural Avatar from a Single RGB-D Video Sequence [PDF]

open access: yesComputer Vision and Pattern Recognition, 2022
We present a novel method to learn Personalized Implicit Neural Avatars (PINA) from a short RGB-D sequence. This allows non-expert users to create a detailed and personal-ized virtual copy of themselves, which can be animated with realistic clothing ...
Zijian Dong   +5 more
semanticscholar   +1 more source

Home - About - Disclaimer - Privacy