Results 1 to 10 of about 83,346 (161)
Quantum Vision Transformers [PDF]
In this work, quantum transformers are designed and analysed in detail by extending the state-of-the-art classical transformer neural network architectures known to be very performant in natural language processing and image analysis.
El Amine Cherrat +5 more
doaj +3 more sources
Rosette Trajectory MRI Reconstruction with Vision Transformers [PDF]
Introduction: An efficient pipeline for rosette trajectory magnetic resonance imaging reconstruction is proposed, combining the inverse Fourier transform with a vision transformer (ViT) network enhanced with a convolutional layer.
Muhammed Fikret Yalcinbas +4 more
doaj +2 more sources
ViTT: Vision Transformer Tracker [PDF]
This paper presents a new model for multi-object tracking (MOT) with a transformer. MOT is a spatiotemporal correlation task among interest objects and one of the crucial technologies of multi-unmanned aerial vehicles (Multi-UAV). The transformer is a self-attentional codec architecture that has been successfully used in natural language processing and
Zhu, Xiaoning +4 more
openaire +3 more sources
Person Re-Identification is an essential task in computer vision, particularly in surveillance applications. The aim is to identify a person based on an input image from surveillance photographs in various scenarios.
Muhammad Tahir, Saeed Anwar
doaj +1 more source
Prior works have proposed several strategies to reduce the computational cost of self-attention mechanism. Many of these works consider decomposing the self-attention procedure into regional and local feature extraction procedures that each incurs a much smaller computational complexity.
Ting Yao +5 more
openaire +3 more sources
Multiscale Vision Transformers [PDF]
Technical ...
Fan, Haoqi +6 more
openaire +2 more sources
code: https://github.com/OpenNLPLab/Vicinity-Vision ...
Weixuan Sun +9 more
openaire +3 more sources
Self-attention in vision transformers performs perceptual grouping, not attention
Recently, a considerable number of studies in computer vision involve deep neural architectures called vision transformers. Visual processing in these models incorporates computational models that are claimed to implement attention mechanisms. Despite an
Paria Mehrani, John K. Tsotsos
doaj +1 more source
ViTFER: Facial Emotion Recognition with Vision Transformers
In several fields nowadays, automated emotion recognition has been shown to be a highly powerful tool. Mapping different facial expressions to their respective emotional states is the main objective of facial emotion recognition (FER).
Aayushi Chaudhari +3 more
doaj +1 more source
Multi-Manifold Attention for Vision Transformers
Vision Transformers are very popular nowadays due to their state-of-the-art performance in several computer vision tasks, such as image classification and action recognition. Although their performance has been greatly enhanced through highly descriptive
Dimitrios Konstantinidis +3 more
doaj +1 more source

