Results 1 to 10 of about 8,410,900 (380)
Action recognition using Natural Action Structures [PDF]
Humans can detect, recognize, and classify natural actions in a very short time. How this is achieved by the visual system and how to make machines understand human actions have been the focus of neuro-scientific studies and computational modeling in the last several decades.
Zhu Xiaoyuan, Yang Zhiyong, Tsien Joe Z
doaj +3 more sources
Action Recognition by Hierarchical Mid-level Action Elements [PDF]
Realistic videos of human actions exhibit rich spatiotemporal structures at multiple levels of granularity: an action can always be decomposed into multiple finer-grained elements in both space and time. To capture this intuition, we propose to represent
Lan, Tian +3 more
core +2 more sources
Fusion Attention for Action Recognition: Integrating Sparse-Dense and Global Attention for Video Action Recognition [PDF]
Conventional approaches to video action recognition perform global attention over the entire video patches, which may be ineffective due to the temporal redundancy of video frames. Recent works on masked video modeling adopt a high-ratio tube masking and
Hyun-Woo Kim, Yong-Suk Choi
doaj +2 more sources
Convolutional Block Attention Module–Multimodal Feature-Fusion Action Recognition: Enabling Miner Unsafe Action Recognition [PDF]
The unsafe action of miners is one of the main causes of mine accidents. Research on underground miner unsafe action recognition based on computer vision enables relatively accurate real-time recognition of unsafe action among underground miners.
Yu Wang +3 more
doaj +2 more sources
Perceptual Perspective Taking and Action Recognition
Robots that operate in social environments need to be able to recognise and understand the actions of other robots, and humans, in order to facilitate learning through imitation and collaboration.
Matthew Johnson, Yiannis Demiris
doaj +3 more sources
Channel-wise Topology Refinement Graph Convolution for Skeleton-Based Action Recognition [PDF]
Graph convolutional networks (GCNs) have been widely used and achieved remarkable results in skeleton-based action recognition. In GCNs, graph topology dominates feature aggregation and therefore is the key to extracting representative features.
Yuxin Chen +5 more
semanticscholar +1 more source
Recognition in planning seeks to find agent intentions, goals or activities given a set of observations and a knowledge library (e.g. goal states, plans or domain theories). In this work we introduce the problem of Online Action Recognition. It consists in recognizing, in an open world, the planning action that best explains a partially observable ...
Suárez Hernández, Alejandro +3 more
openaire +7 more sources
AIM: Adapting Image Models for Efficient Video Action Recognition [PDF]
Recent vision transformer based video models mostly follow the ``image pre-training then finetuning"paradigm and have achieved great success on multiple video benchmarks.
Taojiannan Yang +5 more
semanticscholar +1 more source
Revisiting Skeleton-based Action Recognition [PDF]
Human skeleton, as a compact representation of human action, has received increasing attention in recent years. Many skeleton-based action recognition methods adopt GCNs to extract features on top of human skeletons. Despite the positive results shown in
Haodong Duan +5 more
semanticscholar +1 more source
ST-Adapter: Parameter-Efficient Image-to-Video Transfer Learning for Action Recognition [PDF]
Capitalizing on large pre-trained models for various downstream tasks of interest have recently emerged with promising performance. Due to the ever-growing model size, the standard full fine-tuning based task adaptation strategy becomes prohibitively ...
Junting Pan +4 more
semanticscholar +1 more source

