Results 21 to 30 of about 103,607 (290)
Machine learning with different digital images classification in laparoscopic surgery
The evaluation of the effectiveness of the automatic computer diagnostic (ACD) systems developed based on two classifiers – HAAR features cascade and AdaBoost for the laparoscopic diagnostics of appendicitis and ovarian cysts in women with chronic ...
M. Bayazitov +4 more
doaj +1 more source
Neural Head Avatars from Monocular RGB Videos
We present Neural Head Avatars, a novel neural representation that explicitly models the surface geometry and appearance of an animatable human avatar that can be used for teleconferencing in AR/VR or other applications in the movie or games industry that rely on a digital human.
Grassal, P. +5 more
openaire +3 more sources
Unsupervised Learning of Long-Term Motion Dynamics for Videos [PDF]
We present an unsupervised representation learning approach that compactly encodes the motion dependencies in videos. Given a pair of images from a video clip, our framework learns to predict the long-term 3D motions.
Alahi, Alexandre +4 more
core +2 more sources
Body and Hand–Object ROI-Based Behavior Recognition Using Deep Learning
Behavior recognition has applications in automatic crime monitoring, automatic sports video analysis, and context awareness of so-called silver robots. In this study, we employ deep learning to recognize behavior based on body and hand–object interaction
Yeong-Hyeon Byeon +3 more
doaj +1 more source
RGB-D datasets using microsoft kinect or similar sensors: a survey [PDF]
RGB-D data has turned out to be a very useful representation of an indoor scene for solving fundamental computer vision problems. It takes the advantages of the color image that provides appearance information of an object and also the depth image that ...
Galili +7 more
core +2 more sources
Radar and RGB-depth sensors for fall detection: a review [PDF]
This paper reviews recent works in the literature on the use of systems based on radar and RGB-Depth (RGB-D) sensors for fall detection, and discusses outstanding research challenges and trends related to this research field.
Cippitelli, Enea +3 more
core +1 more source
Action tube extraction based 3D-CNN for RGB-D action recognition [PDF]
In this paper we propose a novel action tube extractor for RGB-D action recognition in trimmed videos. The action tube extractor takes as input a video and outputs an action tube.
Morros Rubió, Josep Ramon +2 more
core +1 more source
Most of the existing deep learning based end-to-end image/video coding (DLEC) architectures are designed for non-subsampled RGB color format. However, in order to achieve a superior coding performance, many state-of-the-art block-based compression ...
Hilmi Egilmez +7 more
doaj +1 more source
Multimodal Spatiotemporal Networks for Sign Language Recognition
Different from other human behaviors, sign language has the characteristics of limited local motion of upper limb and meticulous hand action. Some sign language gestures are ambiguous in RGB video due to the influence of lighting and background color ...
Shujun Zhang +3 more
doaj +1 more source
Play and Learn: Using Video Games to Train Computer Vision Models [PDF]
Video games are a compelling source of annotated data as they can readily provide fine-grained groundtruth for diverse tasks. However, it is not clear whether the synthetically generated data has enough resemblance to the real-world images to improve the
Little, James J. +2 more
core +1 more source

