Learning Hierarchically Consistent Disentanglement with Multi-Channel Augmentation for Public Security-Oriented Sketch Person Re-Identification [PDF]
Sketch re-identification (Re-ID) aims to retrieve pedestrian photographs in the gallery dataset by a query sketch image drawn by professionals, which is crucial for criminal investigations and missing person searches in the field of public security.
Yu Ye, Zhihong Sun, Jun Chen
doaj +2 more sources
AFCLNet: An Attention and Feature-Consistency-Loss-Based Multi-Task Learning Network for Affective Matching Prediction in Music–Video Clips [PDF]
Emotion matching prediction between music and video segments is essential for intelligent mobile sensing systems, where multimodal affective cues collected from smart devices must be jointly analyzed for context-aware media understanding.
Zhibin Su +4 more
doaj +2 more sources
Improvement of deep cross-modal retrieval by generating real-valued representation [PDF]
The cross-modal retrieval (CMR) has attracted much attention in the research community due to flexible and comprehensive retrieval. The core challenge in CMR is the heterogeneity gap, which is generated due to different statistical properties of multi ...
Nikita Bhatt, Amit Ganatra
doaj +2 more sources
Closing the Domain Gap for Cross-modal Visible-Infrared Vehicle Re-identification
Traditional vehicle re-identification (ReID) approaches, based on visible spectrum data achieve high performance, but have limited capability of real-life applications, as they perform poorly under occluded visibility conditions, such as night-time and bad weather.
Kamenou, Eleni +3 more
openaire +4 more sources
Cross Task Modality Alignment Network for Sketch Face Recognition
The task of sketch face recognition refers to matching cross-modality facial images from sketch to photo, which is widely applied in the criminal investigation area.
Yanan Guo +5 more
doaj +1 more source
DA-GAN: Dual Attention Generative Adversarial Network for Cross-Modal Retrieval
Cross-modal retrieval aims to search samples of one modality via queries of other modalities, which is a hot issue in the community of multimedia. However, two main challenges, i.e., heterogeneity gap and semantic interaction across different modalities,
Liewu Cai +3 more
doaj +1 more source
Mind the Gap: Alleviating Local Imbalance for Unsupervised Cross-Modality Medical Image Segmentation
Unsupervised cross-modality medical image adaptation aims to alleviate the severe domain gap between different imaging modalities without using the target domain label. A key in this campaign relies upon aligning the distributions of source and target domain.
Zixian Su +6 more
openaire +3 more sources
Data gap decomposed by auxiliary modality for NIR‐VIS heterogeneous face recognition
In the dark scene at night, the face images captured by ordinary visible light (VIS) are generally poor quality and very dim, while the near‐infrared (NIR) can capture high definition and recognizable face images at night.
Rui Sun +3 more
doaj +1 more source
Cross‐modality person re‐identification using hybrid mutual learning
Cross‐modality person re‐identification (Re‐ID) aims to retrieve a query identity from red, green, blue (RGB) images or infrared (IR) images. Many approaches have been proposed to reduce the distribution gap between RGB modality and IR modality. However,
Zhong Zhang +5 more
doaj +1 more source
Cross Modal Facial Image Synthesis Using a Collaborative Bidirectional Style Transfer Network
In this paper, we present a novel collaborative bidirectional style transfer network based on generative adversarial network (GAN) for cross modal facial image synthesis, possibly with large modality gap.
Nizam Ud Din +4 more
doaj +1 more source

