RefCap: image captioning with referent objects attributes [PDF]
In recent years, significant progress has been made in visual-linguistic multi-modality research, leading to advancements in visual comprehension and its applications in computer vision tasks.
Seokmok Park, Joonki Paik
doaj +2 more sources
Image Captioning Using Motion-CNN with Object Detection [PDF]
Automatic image captioning has many important applications, such as the depiction of visual contents for visually impaired people or the indexing of images on the internet.
Kiyohiko Iwamura +4 more
doaj +2 more sources
Social Image Captioning: Exploring Visual Attention and User Attention [PDF]
Image captioning with a natural language has been an emerging trend. However, the social image, associated with a set of user-contributed tags, has been rarely investigated for a similar task.
Leiquan Wang +5 more
doaj +2 more sources
An innovative multi-head attention mechanism-driven recurrent neural network model with feature representation fusion for enhanced image captioning to assist individuals with visual impairments [PDF]
Developments in image captioning technologies played a crucial role in improving the quality of life for individuals with visual impairments, advancing better social inclusivity.
Mashael M. Asiri +3 more
doaj +2 more sources
Image Captioning Based on Semantic Scenes [PDF]
With the development of artificial intelligence and deep learning technologies, image captioning has become an important research direction at the intersection of computer vision and natural language processing.
Fengzhi Zhao +3 more
doaj +2 more sources
Novel concept-based image captioning models using LSTM and multi-encoder transformer architecture [PDF]
Captioning an image involves using a combination of vision and language models to describe the image in an expressive and concise sentence. Successful captioning task requires extracting as much information as possible from the corresponding image.
Asmaa A. E. Osman +3 more
doaj +2 more sources
Image Captioning Optimization Strategy Based on Deep Learning [PDF]
Image captioning aims to describe image content with grammatically correct sentences and automatically generate text.Image captioning involves computer vision and natural language processing,which is a classic task in multimodal field.In recent years,a ...
ZHOU Ziyi, XIONG Hailing
doaj +1 more source
Pre-trained CNNs as Feature-Extraction Modules for Image Captioning
In this work, we present a thorough experimental study about feature extraction using Convolutional Neural Networks (CNNs) for the task of image captioning in the context of deep learning.
Muhammad Abdelhadie Al-Malla +3 more
doaj +1 more source
A Scientometric Visualization Analysis of Image Captioning Research From 2010 to 2020
Image captioning has gradually gained attention in the field of artificial intelligence and become an interesting and challenging task for image understanding.
Wenxuan Liu +4 more
doaj +1 more source
Stylized Image Captioning Model Based on Disentangle-Retrieve-Generate [PDF]
Image captioning aims to generate a description text for the input image to accurately describe the image content.The stylized image captioning goes a step further on the basis of image captioning and introduces the consideration of language style.It ...
CHEN Zhang-hui, XIONG Yun
doaj +1 more source

