Results 131 to 140 of about 18,617 (276)

LLVMs4Protest: Harnessing the Power of Large Language and Vision Models for Deciphering Protests in the News

open access: yes, 2023
Large language and vision models have transformed how social movements scholars identify protest and extract key protest attributes from multi-modal data such as texts, images, and videos.
Zhang, Yongjun
core  

Swin Transformer-Based Dynamic Semantic Communication for Multi-User with Different Computing Capacity [PDF]

open access: green, 2023
Loc X. Nguyen   +6 more
openalex   +1 more source

SE‐Swin: An improved Swin‐Transfomer network of self‐ensemble feature extraction framework for image retrieval

open access: yesIET Image Processing
The Swin‐Transformer is a variant of the Vision Transformer, which constructs a hierarchical Transformer that computes representations with shifted windows and window multi‐head self‐attention.
Yixuan Xu   +3 more
doaj   +1 more source

Transformers meet CNNs for insights into breast mass classification from histopathological images

open access: yesFrontiers in Artificial Intelligence
IntroductionBreast cancer remains one of the leading causes of cancer-related deaths among women worldwide, highlighting the critical need for accurate histopathological diagnosis and reliable decision-support systems to improve diagnostic sensitivity ...
Vatsala Anand, Ajay Khajuria
doaj   +1 more source

YotoR-You Only Transform One Representation

open access: yes
This paper introduces YotoR (You Only Transform One Representation), a novel deep learning model for object detection that combines Swin Transformers and YoloR architectures.
Loncomilla, Patricio   +2 more
core  

SwinV2DNet: Pyramid and Self-Supervision Compounded Feature Learning for Remote Sensing Images Change Detection

open access: yes, 2023
Among the current mainstream change detection networks, transformer is deficient in the ability to capture accurate low-level details, while convolutional neural network (CNN) is wanting in the capacity to understand global information and establish ...
Liu, Jia   +3 more
core  

Swin Transformer With Spatial and Local Context Augmentation for Enhanced Semantic Segmentation of Remote Sensing Images

open access: yesIEEE Open Journal of Signal Processing
Semantic segmentation of remote sensing images is extensively used in crop cover and type analysis, and environmental monitoring. In the semantic segmentation of remote sensing images, owning to the specificity of remote sensing images, not only the ...
Rong-Xing Ding   +4 more
doaj   +1 more source

Efficient Wheat Disease Identification Using Hybrid Swin-SHARP Vision Model

open access: yesIEEE Access
Accurate identification of wheat diseases is an essential component for increasing crop yields and guaranteeing global food security. However, subjective opinions, errors, and laborious procedures frequently limit traditional approaches, which are based ...
Waqar Khalid   +3 more
doaj   +1 more source

McSTRA: A multi-branch cascaded swin transformer for point spread function-guided robust MRI reconstruction

open access: hybrid, 2023
M.M. Ekanayake   +4 more
openalex   +1 more source

B-Cos Aligned Transformers Learn Human-Interpretable Features

open access: yes
Vision Transformers (ViTs) and Swin Transformers (Swin) are currently state-of-the-art in computational pathology. However, domain experts are still reluctant to use these models due to their lack of interpretability.
Boxberg, Melanie   +9 more
core  

Home - About - Disclaimer - Privacy