Multi-Scale Spatial Temporal Graph Convolutional Network for Skeleton-Based Action Recognition [PDF]
Graph convolutional networks have been widely used for skeleton-based action recognition due to their excellent modeling ability of non-Euclidean data. As the graph convolution is a local operation, it can only utilize the short-range joint dependencies ...
Zhan Chen +4 more
semanticscholar +1 more source
Spatial Temporal Graph Convolutional Networks for Skeleton-Based Action Recognition [PDF]
Dynamics of human body skeletons convey significant information for human action recognition. Conventional approaches for modeling skeletons usually rely on hand-crafted parts or traversal rules, thus resulting in limited expressive power and ...
Sijie Yan, Yuanjun Xiong, Dahua Lin
semanticscholar +1 more source
Action Capsules: Human skeleton action recognition
11 pages, 11 ...
Ali Farajzadeh Bavil +2 more
openaire +2 more sources
Quo Vadis, Action Recognition? A New Model and the Kinetics Dataset [PDF]
The paucity of videos in current action classification datasets (UCF-101 and HMDB-51) has made it difficult to identify good video architectures, as most methods obtain similar performance on existing small-scale benchmarks. This paper re-evaluates state-
João Carreira, Andrew Zisserman
semanticscholar +1 more source
Disentangling and Unifying Graph Convolutions for Skeleton-Based Action Recognition [PDF]
Spatial-temporal graphs have been widely used by skeleton-based action recognition algorithms to model human action dynamics. To capture robust movement patterns from these graphs, long-range and multi-scale context aggregation and spatial-temporal ...
Ken Ziyu Liu +4 more
semanticscholar +1 more source
PYSKL: Towards Good Practices for Skeleton Action Recognition [PDF]
We present PYSKL: an open-source toolbox for skeleton-based action recognition based on PyTorch. The toolbox supports a wide variety of skeleton action recognition algorithms, including approaches based on GCN and CNN. In contrast to existing open-source
Haodong Duan +3 more
semanticscholar +1 more source
A Closer Look at Spatiotemporal Convolutions for Action Recognition [PDF]
In this paper we discuss several forms of spatiotemporal convolutions for video analysis and study their effects on action recognition. Our motivation stems from the observation that 2D CNNs applied to individual frames of the video have remained solid ...
Du Tran +5 more
semanticscholar +1 more source
Temporal-Relational CrossTransformers for Few-Shot Action Recognition [PDF]
We propose a novel approach to few-shot action recognition, finding temporally-corresponding frame tuples between the query and videos in the support set.
Toby Perrett +4 more
semanticscholar +1 more source
Learning Discriminative Representations for Skeleton Based Action Recognition [PDF]
Human action recognition aims at classifying the category of human action from a segment of a video. Recently, people have dived into designing GCN-based models to extract features from skeletons for performing this task, because skeleton representations
Huanyu Zhou, Qingjie Liu, Yunhong Wang
semanticscholar +1 more source
Convolutional Two-Stream Network Fusion for Video Action Recognition [PDF]
Recent applications of Convolutional Neural Networks (ConvNets) for human action recognition in videos have proposed different solutions for incorporating the appearance and motion information.
Christoph Feichtenhofer +2 more
semanticscholar +1 more source

