Results 251 to 260 of about 153,605 (299)
Some of the next articles are maybe not open access.
MV-CLAM: Multi-View Molecular Interpretation with Cross-Modal Projection via Language Model
Conference on Empirical Methods in Natural Language ProcessingHuman expertise in chemistry and biomedicine relies on contextual molecular understanding, a capability that large language models (LLMs) can extend through fine-grained alignment between molecular structures and text. Recent multimodal learning advances
Sumin Ha +3 more
semanticscholar +1 more source
Multimedia Feature Mapping and Correlation Learning for Cross-Modal Retrieval
International Journal of Grid and High Performance Computing, 2018This article describes how with the rapid increasing of multimedia content on the Internet, the need for effective cross-modal retrieval has attracted much attention recently. Many related works ignore the latent semantic correlations of modalities in the non-linear space and the extraction of high-level modality features, which only focuses on the ...
Xu Yuan +4 more
openaire +1 more source
2025 IEEE 17th International Conference on Computer Research and Development (ICCRD)
With the rapid development of related technologies, face forgery techniques have become increasingly sophisticated. Meanwhile, it brings some potential risks.
Jia Meng +3 more
semanticscholar +1 more source
With the rapid development of related technologies, face forgery techniques have become increasingly sophisticated. Meanwhile, it brings some potential risks.
Jia Meng +3 more
semanticscholar +1 more source
Learning Semantic Structure-preserved Embeddings for Cross-modal Retrieval
ACM Multimedia, 2018This paper learns semantic embeddings for multi-label cross-modal retrieval. Our method exploits the structure in semantics represented by label vectors to guide the learning of embeddings.
Yiling Wu, Shuhui Wang, Qingming Huang
semanticscholar +1 more source
Cross-modal association between vowels and colours: A cross-linguistic perspective.
Journal of the Acoustical Society of America, 2019Previous studies showed similar mappings between sounds and colours for synaesthetes and non-synaesthetes alike, and proposed that common mechanisms underlie such cross-modal association.
P. Mok +4 more
semanticscholar +1 more source
Masking-Based Cross-Modal Remote Sensing Image–Text Retrieval via Dynamic Contrastive Learning
IEEE Transactions on Geoscience and Remote SensingCross-modal remote sensing image-text retrieval (CMRSITR) aims to extract comprehensive information from diverse modalities. The primary challenge in this field is developing effective mappings between visual and textual modalities to a shared latent ...
Zuopeng Zhao +7 more
semanticscholar +1 more source
Syncgan: Synchronize the Latent Spaces of Cross-Modal Generative Adversarial Networks
IEEE International Conference on Multimedia and Expo, 2018Generative adversarial network (GAN) has achieved impressive success on cross-domain generation, but it faces difficulty in cross-modal generation due to the lack of a common distribution between heterogeneous data.
Wen-Cheng Chen +2 more
semanticscholar +1 more source
GEAL: Generalizable 3D Affordance Learning with Cross-Modal Consistency
Computer Vision and Pattern RecognitionIdentifying affordance regions on 3D objects from semantic cues is essential for robotics and human-machine interaction. However, existing 3D affordance learning methods struggle with generalization and robustness due to limited annotated data and a ...
Dongyue Lu +3 more
semanticscholar +1 more source
Workshop on Cognitive Modeling and Computational Linguistics
Humans have clear cross-modal preferences when matching certain novel words to visual shapes. Evidence suggests that these preferences play a prominent role in our linguistic processing, language learning, and the origins of signal-meaning mappings. With
T. Verhoef +2 more
semanticscholar +1 more source
Humans have clear cross-modal preferences when matching certain novel words to visual shapes. Evidence suggests that these preferences play a prominent role in our linguistic processing, language learning, and the origins of signal-meaning mappings. With
T. Verhoef +2 more
semanticscholar +1 more source
Robust Cross-modal Medical Image Translation via Diffusion Model and Knowledge Distillation
IEEE International Joint Conference on Neural NetworkMedical image translation holds significant value, but its difficulty is amplified due to variations in noise patterns and the requisite anatomical invariance of image content.
Yuehan Xia +3 more
semanticscholar +1 more source

