Results 51 to 60 of about 51,006 (167)
A text classification method based on multimodal fusion enhancement
Although multimodal text classification techniques have potential when applied to specific scenarios, there are still some limitations.Existing multimodal fusion models require modal alignment in the input data, resulting in a large amount of incomplete ...
Dezhi LIU +3 more
doaj
Multimodal Transformer for Comics Text-Cloze
This work explores a closure task in comics, a medium where visual and textual elements are intricately intertwined. Specifically, Text-cloze refers to the task of selecting the correct text to use in a comic panel, given its neighboring panels. Traditional methods based on recurrent neural networks have struggled with this task due to limited OCR ...
Emanuele Vivoli +3 more
openaire +2 more sources
Perception of Violence in Multimodal Text: Cross-Disciplinary Approach
The article deals with the linguistic and psychological concepts explaining the perception of multimodal texts that imply information about violence. The importance of the subject is verified by the task of countering the spread of ideologies of violence,
Maria B. Voroshilova +2 more
doaj +1 more source
A social media geolocation prediction method based on multimodal fusion
Geographical information extracted from social media text reveals underlying spatial correlations.A geographical location prediction method for social media text based on multimodal fusion was proposed.By utilizing images associated with the text as ...
Shiduo HUANG, Yongchang XU, Haojun AI
doaj
Multimodal Text Sets to Use Literature and Engage All Learners in the Science Classroom. [PDF]
Lannin A +7 more
europepmc +1 more source
CONCEPTUAL INTEGRATION AS A MECHANISM OF DEMOTIVATOR INTERPRETATION
The demotivator as a multimodal text which synthesizes verbal and visual components and is characterized by multidimensionality, comic sense is analyzed.
Lyudmila Vladimirovna Babina +1 more
doaj
Why So Meme? A Comparative and Explainable Analysis of Multimodal Hateful Meme Detection
The rise of toxic content, particularly in the form of hateful memes, poses a significant challenge to social media platforms. This paper presents an empirical comparative study of unimodal and multimodal architectures for toxic content detection. Rather
Nor Saiful Azam Bin Nor Azmi +3 more
doaj +1 more source
Visual-Language Pre-training (VLP) Models demonstrate exceptional capability in understanding the interactions between images and text, yet they remain vulnerable to multimodal adversarial examples.
Xujie Ren +4 more
doaj +1 more source
A survey of multimodal composite editing and retrieval
In the real world, where information is abundant and diverse across different modalities, understanding and utilizing various data types to improve retrieval systems is a key focus of research. Multimodal composite retrieval integrates diverse modalities
Suyan Li, Fuxiang Huang, Lei Zhang
doaj +1 more source

