Results 41 to 50 of about 51,006 (167)
VT-MFLV: Vision–Text Multimodal Feature Learning V Network for Medical Image Segmentation
Currently, existing multimodal segmentation methods face limitations in effectively leveraging medical text to guide visual feature learning. They often suffer from insufficient multimodal fusion and inadequate accuracy in fine-grained lesion ...
Wenju Wang +5 more
doaj +1 more source
Survey of research on multimodal semantic communication
With the cross-integration of artificial intelligence and communications, technologies for processing multimodal data such as text, image, audio, and video are booming, the shared dimension of modal semantics is deeply excavated, and the characteristics ...
Zhijin QIN +3 more
doaj +2 more sources
Multimodal Event Classification for Social Media Based on Text-Image-Caption Assisted Alignment
The vast amount and diverse forms of information (such as text, images, etc.) provide people with rich data. How to effectively obtain and utilize multimodal data has gradually become a research hotspot in the field of artificial intelligence.
Yuanting Wang
doaj +1 more source
FACTMS: Enhancing Multimodal Factual Consistency in Multimodal Summarization
Multimodal summarization (MS) generates text summaries from multimedia articles with textual and visual content. Therefore, MS can suffer from the multimodal factual inconsistency problem, where the generated summaries may distort or deviate from both ...
Mai Zhang, Hao Yan, Chaozhuo Li
doaj +1 more source
Voice and Touch Based Error-tolerant Multimodal Text Editing and Correction for Smartphones. [PDF]
Zhao M +4 more
europepmc +1 more source
This chapter begins with the issues surrounding large-scale analyses of the modal ensemble through a case study that focuses on one form of contemporary written communication, the Instagram post, and demonstrates an analytical approach that takes into account the whole text, including non-verbal elements. It employs corpus-assisted multimodal discourse
openaire +2 more sources
The article considers a new video format that has become extremely popular due to its perfect alignment with mosaic thinking. Although these videos are widely spread on social media, they remain understudied from a linguistic perspective.
M. A. Goncharova
doaj +1 more source
Multimodal Diversity of Postmodernist Fiction Text
The article is devoted to the analysis of structural and functional manifestations of multimodal diversity in postmodernist fiction texts. Multimodality is defined as the coexistence of more than one semiotic mode within a certain context.
U. I. Tykha
doaj +1 more source
Multimodal Sentiment Analysis Based on Cross-Modal Semantic Information Enhancement [PDF]
With the development of social networks, humans express their emotions in different ways, including text, vision and speech, i.e., multimodal. In response to the failure of previous multimodal sentiment analysis methods to effectively obtain multimodal ...
LI Mengyun, ZHANG Jing, ZHANG Huanxiang, ZHANG Xiaolin, LIU Luyao
doaj +1 more source
Brian Bilston’s multimodal poetic practices: interactions between the digital and the analogue
This article examines texts by the modern British poet Brian Bilston from the perspective of their semantic and syntactic organisation and the lines of the author's investigation into paralinguistic, i. e. visual, elements.
Sabina G. Busareva
doaj +1 more source

