Results 1 to 10 of about 172,556 (157)

ViTT: Vision Transformer Tracker [PDF]

open access: yesSensors, 2021
This paper presents a new model for multi-object tracking (MOT) with a transformer. MOT is a spatiotemporal correlation task among interest objects and one of the crucial technologies of multi-unmanned aerial vehicles (Multi-UAV).
Xiaoning Zhu   +4 more
doaj   +3 more sources

Review of Transformer in Computer Vision [PDF]

open access: yesJisuanji kexue, 2023
Transformer is an attention-based encoder-decoder architecture.Due to its long-range sequence modeling and parallel computing capability,Transformer have made a significant breakthrough in natural language processing and is gradually expanding to ...
CHEN Luoxuan, LIN Chengchuang, ZHENG Zhaoliang, MO Zefeng, HUANG Xinyi, ZHAO Gansen
doaj   +1 more source

Dual Vision Transformer

open access: yesIEEE Transactions on Pattern Analysis and Machine Intelligence, 2023
Prior works have proposed several strategies to reduce the computational cost of self-attention mechanism. Many of these works consider decomposing the self-attention procedure into regional and local feature extraction procedures that each incurs a much smaller computational complexity.
Ting Yao   +5 more
openaire   +3 more sources

Multiscale Vision Transformers [PDF]

open access: yes2021 IEEE/CVF International Conference on Computer Vision (ICCV), 2021
Technical ...
Fan, Haoqi   +6 more
openaire   +2 more sources

Vicinity Vision Transformer

open access: yesIEEE Transactions on Pattern Analysis and Machine Intelligence, 2023
code: https://github.com/OpenNLPLab/Vicinity-Vision ...
Weixuan Sun   +9 more
openaire   +3 more sources

Evaluation and Comparison of Semantic Segmentation Networks for Rice Identification Based on Sentinel-2 Imagery

open access: yesRemote Sensing, 2023
Efficient and accurate rice identification based on high spatial and temporal resolution remote sensing imagery is essential for achieving precision agriculture and ensuring food security.
Huiyao Xu, Jia Song, Yunqiang Zhu
doaj   +1 more source

CSiT: A Multiscale Vision Transformer for Hyperspectral Image Classification

open access: yesIEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing, 2022
The hyperspectral image (HSI) has nearly continuous spectral information; thus, the target of interest can be accurately identified by the subtle details of spectral properties.
Wenxuan He   +4 more
doaj   +1 more source

Transformer-Based Semantic Segmentation for Extraction of Building Footprints from Very-High-Resolution Images

open access: yesSensors, 2023
Semantic segmentation with deep learning networks has become an important approach to the extraction of objects from very high-resolution remote sensing images.
Jia Song, A-Xing Zhu, Yunqiang Zhu
doaj   +1 more source

A Review of Transformer-Based Approaches for Image Captioning

open access: yesApplied Sciences, 2023
Visual understanding is a research area that bridges the gap between computer vision and natural language processing. Image captioning is a visual understanding task in which natural language descriptions of images are automatically generated using ...
Oscar Ondeng, Heywood Ouma, Peter Akuon
doaj   +1 more source

STHarDNet: Swin Transformer with HarDNet for MRI Segmentation

open access: yesApplied Sciences, 2022
In magnetic resonance imaging (MRI) segmentation, conventional approaches utilize U-Net models with encoder–decoder structures, segmentation models using vision transformers, or models that combine a vision transformer with an encoder–decoder model ...
Yeonghyeon Gu   +2 more
doaj   +1 more source

Home - About - Disclaimer - Privacy