Results 251 to 260 of about 153,605 (299)
Some of the next articles are maybe not open access.

MV-CLAM: Multi-View Molecular Interpretation with Cross-Modal Projection via Language Model

Conference on Empirical Methods in Natural Language Processing
Human expertise in chemistry and biomedicine relies on contextual molecular understanding, a capability that large language models (LLMs) can extend through fine-grained alignment between molecular structures and text. Recent multimodal learning advances
Sumin Ha   +3 more
semanticscholar   +1 more source

Multimedia Feature Mapping and Correlation Learning for Cross-Modal Retrieval

International Journal of Grid and High Performance Computing, 2018
This article describes how with the rapid increasing of multimedia content on the Internet, the need for effective cross-modal retrieval has attracted much attention recently. Many related works ignore the latent semantic correlations of modalities in the non-linear space and the extraction of high-level modality features, which only focuses on the ...
Xu Yuan   +4 more
openaire   +1 more source

The Application of Cross-modal Consistency Enhancement and Feature Fusion in High-frequency Feature-driven Face Forgery Detection

2025 IEEE 17th International Conference on Computer Research and Development (ICCRD)
With the rapid development of related technologies, face forgery techniques have become increasingly sophisticated. Meanwhile, it brings some potential risks.
Jia Meng   +3 more
semanticscholar   +1 more source

Learning Semantic Structure-preserved Embeddings for Cross-modal Retrieval

ACM Multimedia, 2018
This paper learns semantic embeddings for multi-label cross-modal retrieval. Our method exploits the structure in semantics represented by label vectors to guide the learning of embeddings.
Yiling Wu, Shuhui Wang, Qingming Huang
semanticscholar   +1 more source

Cross-modal association between vowels and colours: A cross-linguistic perspective.

Journal of the Acoustical Society of America, 2019
Previous studies showed similar mappings between sounds and colours for synaesthetes and non-synaesthetes alike, and proposed that common mechanisms underlie such cross-modal association.
P. Mok   +4 more
semanticscholar   +1 more source

Masking-Based Cross-Modal Remote Sensing Image–Text Retrieval via Dynamic Contrastive Learning

IEEE Transactions on Geoscience and Remote Sensing
Cross-modal remote sensing image-text retrieval (CMRSITR) aims to extract comprehensive information from diverse modalities. The primary challenge in this field is developing effective mappings between visual and textual modalities to a shared latent ...
Zuopeng Zhao   +7 more
semanticscholar   +1 more source

Syncgan: Synchronize the Latent Spaces of Cross-Modal Generative Adversarial Networks

IEEE International Conference on Multimedia and Expo, 2018
Generative adversarial network (GAN) has achieved impressive success on cross-domain generation, but it faces difficulty in cross-modal generation due to the lack of a common distribution between heterogeneous data.
Wen-Cheng Chen   +2 more
semanticscholar   +1 more source

GEAL: Generalizable 3D Affordance Learning with Cross-Modal Consistency

Computer Vision and Pattern Recognition
Identifying affordance regions on 3D objects from semantic cues is essential for robotics and human-machine interaction. However, existing 3D affordance learning methods struggle with generalization and robustness due to limited annotated data and a ...
Dongyue Lu   +3 more
semanticscholar   +1 more source

What does Kiki look like? Cross-modal associations between speech sounds and visual shapes in vision-and-language models

Workshop on Cognitive Modeling and Computational Linguistics
Humans have clear cross-modal preferences when matching certain novel words to visual shapes. Evidence suggests that these preferences play a prominent role in our linguistic processing, language learning, and the origins of signal-meaning mappings. With
T. Verhoef   +2 more
semanticscholar   +1 more source

Robust Cross-modal Medical Image Translation via Diffusion Model and Knowledge Distillation

IEEE International Joint Conference on Neural Network
Medical image translation holds significant value, but its difficulty is amplified due to variations in noise patterns and the requisite anatomical invariance of image content.
Yuehan Xia   +3 more
semanticscholar   +1 more source

Home - About - Disclaimer - Privacy