Results 11 to 20 of about 14,679 (231)
S-Swin Transformer: simplified Swin Transformer model for offline handwritten Chinese character recognition [PDF]
The Transformer shows good prospects in computer vision. However, the Swin Transformer model has the disadvantage of a large number of parameters and high computational effort.
Yongping Dan +3 more
doaj +4 more sources
Swin transformer for fast MRI [PDF]
Magnetic resonance imaging (MRI) is an important non-invasive clinical tool that can produce high-resolution and reproducible images. However, a long scanning time is required for high-quality MR images, which leads to exhaustion and discomfort of ...
Fang, Yingying +8 more
core +9 more sources
SparseSwin: Swin Transformer with Sparse Transformer Block
Advancements in computer vision research have put transformer architecture as the state of the art in computer vision tasks. One of the known drawbacks of the transformer architecture is the high number of parameters, this can lead to a more complex and ...
Irsal, Riyandi Banovbi Putera +4 more
core +2 more sources
SPT-Swin: A Shifted Patch Tokenization Swin Transformer for Image Classification
Recently, the transformer-based model e.g., the vision transformer (ViT) has been extensively used in computer vision tasks. The superior performance of the ViT leads to the requirement of an enormous dataset and the complexity of calculating self ...
Gazi Jannatul Ferdous +3 more
doaj +2 more sources
SwiFT: Swin 4D fMRI Transformer
Modeling spatiotemporal brain dynamics from high-dimensional data, such as functional Magnetic Resonance Imaging (fMRI), is a formidable task in neuroscience. Existing approaches for fMRI analysis utilize hand-crafted features, but the process of feature
Bae, Sangyoon +8 more
core +2 more sources
DarSwin: Distortion Aware Radial Swin Transformer
Wide-angle lenses are commonly used in perception tasks requiring a large field of view. Unfortunately, these lenses produce significant distortions making conventional models that ignore the distortion effects unable to adapt to wide-angle images.
Afrasiyabi, Arman +5 more
core +2 more sources
The vision community is witnessing a modeling shift from CNNs to Transformers, where pure Transformer architectures have attained top accuracy on the major video recognition benchmarks. These video models are all built on Transformer layers that globally connect patches across the spatial and temporal dimensions.
Liu, Ze +6 more
openaire +2 more sources
Swin-Pose: Swin Transformer Based Human Pose Estimation
Convolutional neural networks (CNNs) have been widely utilized in many computer vision tasks. However, CNNs have a fixed reception field and lack the ability of long-range perception, which is crucial to human pose estimation. Due to its capability to capture long-range dependencies between pixels, transformer architecture has been adopted to computer ...
Xiong, Zinan +4 more
openaire +2 more sources
SQ-Swin: a Pretrained Siamese Quadratic Swin Transformer for Lettuce Browning Prediction
Packaged fresh-cut lettuce is widely consumed as a major component of vegetable salad owing to its high nutrition, freshness, and convenience. However, enzymatic browning discoloration on lettuce cut edges significantly reduces product quality and shelf ...
Luo, Yaguang +4 more
core +2 more sources
Asymmetric convolution Swin transformer for medical image super-resolution
Medical Image Super-Resolution plays a pivotal role in enhancing diagnostic accuracy. Transformer-based methods, such as Image Restoration Using Swin Transformer (SwinIR) and Swin transformer for fast Magnetic Resonance Imaging (SwinMR), have shown ...
Weijia Lu +9 more
doaj +1 more source

