Results 51 to 60 of about 14,679 (231)
Multi-Focus Microscopy Image Fusion Based on Swin Transformer Architecture
In this study, we introduce the U-Swin fusion model, an effective and efficient transformer-based architecture designed for the fusion of multi-focus microscope images.
Han Hank Xia +4 more
doaj +1 more source
Multi-dimension unified Swin Transformer for 3D Lesion Segmentation in Multiple Anatomical Locations
In oncology research, accurate 3D segmentation of lesions from CT scans is essential for the modeling of lesion growth kinetics. However, following the RECIST criteria, radiologists routinely only delineate each lesion on the axial slice showing the ...
Baumgartner, Richard +8 more
core
This study introduces a hydraulically steerable catheter with a soft tip in vascular procedures. The steering soft tip achieves a minimal diameter of 2.6 mm and supports a 180° bend. Real‐time shape and position tracking, facilitated by segmentation and endpoint detection techniques, improves navigation.
Jingyi Kang +9 more
wiley +1 more source
SwinOCSR: end-to-end optical chemical structure recognition using a Swin Transformer
Optical chemical structure recognition from scientific publications is essential for rediscovering a chemical structure. It is an extremely challenging problem, and current rule-based and deep-learning methods cannot achieve satisfactory recognition ...
Zhanpeng Xu +4 more
doaj +1 more source
Semantic-Aware Local-Global Vision Transformer
Vision Transformers have achieved remarkable progresses, among which Swin Transformer has demonstrated the tremendous potential of Transformer for vision tasks.
Chen, Fanglin +4 more
core
Swin-FER: Swin Transformer for Facial Expression Recognition
The ability of transformers to capture global context information is highly beneficial for recognizing subtle differences in facial expressions. However, compared to convolutional neural networks, transformers require the computation of dependencies between each element and all other elements, leading to high computational complexity. Additionally, the
Mei Bie +4 more
openaire +2 more sources
Source Microphone Identification Using Swin Transformer
Microphone identification is a crucial challenge in the field of digital audio forensics. The ability to accurately identify the type of microphone used to record a piece of audio can provide important information for forensic analysis and crime investigations.
Mustafa Qamhan +2 more
openaire +2 more sources
A loss‐based ensemble generative adversarial network (GAN) framework is proposed to address mode collapse in sperm morphology classification. By integrating spatial augmentation and multiple GAN models, the study enhances synthetic data quality. The Shifted Window Transformer achieves 95.37% accuracy on the HuSHeM dataset, outperforming previous ...
Berke Cansiz +2 more
wiley +1 more source
Pattern Attention Transformer with Doughnut Kernel
We present in this paper a new architecture, the Pattern Attention Transformer (PAT), that is composed of the new doughnut kernel. Compared with tokens in the NLP field, Transformer in computer vision has the problem of handling the high resolution of ...
Sheng, WenYuan
core
BMPCQA: Bioinspired Metaverse Point Cloud Quality Assessment Based on Large Multimodal Models
This study presents a bioinspired metaverse point cloud quality assessment metric, which simulates the human visual evaluation process to perform the point cloud quality assessment task. It first extracts rendering projection video features, normal image features, and point cloud patch features, which are then fed into a large multimodal model to ...
Huiyu Duan +7 more
wiley +1 more source

