Results 71 to 80 of about 96,008 (322)
ViPLO: Vision Transformer based Pose-Conditioned Self-Loop Graph for Human-Object Interaction Detection [PDF]
Jeeseung Park +2 more
openalex +1 more source
Interpretability-Aware Vision Transformer
Vision Transformers (ViTs) have become prominent models for solving various vision tasks. However, the interpretability of ViTs has not kept pace with their promising performance. While there has been a surge of interest in developing {\it post hoc} solutions to explain ViTs' outputs, these methods do not generalize to different downstream tasks and ...
Yao Qiang +3 more
openaire +2 more sources
Liquid‐phase transmission electron microscopy enables direct observation of nucleation and growth processes in solution. This review is dedicated to the remembrance of Helmut Cölfen and highlights recent studies on complex materials—oxides, biominerals, organic–inorganic crystals—which were central to his research activity. It summarizes key milestones,
Charles Sidhoum +5 more
wiley +1 more source
Modeling Image Virality with Pairwise Spatial Transformer Networks
The study of virality and information diffusion online is a topic gaining traction rapidly in the computational social sciences. Computer vision and social network analysis research have also focused on understanding the impact of content and information
Agarwal, Sumeet, Dubey, Abhimanyu
core +1 more source
SGDViT: Saliency-Guided Dynamic Vision Transformer for UAV Tracking [PDF]
Liangliang Yao +4 more
openalex +1 more source
Automat optical inspection (AOI) techniques in semiconductor fabrication can be leveraged in battery manufacturing, enabling scalable detection and analysis of electrode‐ and cell‐level imperfections through AI‐driven analytics and a digital‐twin framework.
Jianyu Li, Ertao Hu, Wei Wei, Feifei Shi
wiley +1 more source
Vision Transformers and Transfer Learning Approaches for Arabic Sign Language Recognition
Sign languages are complex, but there are ongoing research efforts in engineering and data science to recognize, understand, and utilize them in real-time applications.
Nojood M. Alharthi, Salha M. Alzahrani
doaj +1 more source
Sparse Attentive Backtracking: Temporal CreditAssignment Through Reminding
Learning long-term dependencies in extended temporal sequences requires credit assignment to events far back in the past. The most common method for training recurrent neural networks, back-propagation through time (BPTT), requires credit information to ...
Bengio, Yoshua +6 more
core +1 more source
Vision Transformers (ViTs) have recently become the state-of-the-art across many computer vision tasks. In contrast to convolutional networks (CNNs), ViTs enable global information sharing even within shallow layers of a network, i.e., among high-resolution features. However, this perk was later overlooked with the success of pyramid architectures such
Jongwoo Park 0003 +5 more
openaire +2 more sources
We present ultrathin flexible transparent electrodes through iCVD‐enabled molecular control of 10 nm gold films on poly(dimethylaminomethylstyrene). In vivo validation demonstrated photoelectric artifact reduction vs. opaque electrodes and preservation of natural neural dynamics.
Tae Jin Mun +11 more
wiley +1 more source

