DIC-Transformer: interpretation of plant disease classification results using image caption generation technology [PDF]
Disease image classification systems play a crucial role in identifying disease categories in the field of agricultural diseases. However, current plant disease image classification methods can only predict the disease category and do not offer ...
Qingtian Zeng, Jian Sun, Shansong Wang
doaj +2 more sources
Enhancing image caption generation through context-aware attention mechanism [PDF]
Image captioning, the process of generating natural language descriptions based on image content, has garnered attention in AI research for its implications in scene understanding and human-computer interaction.
Ahatesham Bhuiyan +3 more
doaj +2 more sources
VSAM-Based Visual Keyword Generation for Image Caption
Image caption is to understand and describe the visual content, which is expected to be applied in automatic news reporting in future. In recent years, there has been an increasing interest in an Encoder-Decoder framework for image caption: the encoder ...
Suya Zhang +3 more
doaj +3 more sources
PBC-Transformer: Interpreting Poultry Behavior Classification Using Image Caption Generation Techniques [PDF]
Accurate classification of poultry behavior is critical for assessing welfare and health, yet most existing methods predict behavior categories without providing explanations for the image content. This study introduces the PBC-Transformer model, a novel
Jun Li +7 more
doaj +2 more sources
An Overview of Image Caption Generation Methods [PDF]
In recent years, with the rapid development of artificial intelligence, image caption has gradually attracted the attention of many researchers in the field of artificial intelligence and has become an interesting and arduous task. Image caption, automatically generating natural language descriptions according to the content observed in an image, is an
Haoran Wang, Yue Zhang, Xiaosheng Yu
openaire +3 more sources
Cross-Lingual Image Caption Generation Based on Visual Attention Model
As an interesting and challenging problem, generating image caption automatically has attracted increasingly attention in natural language processing and computer vision communities.
Bin Wang +5 more
doaj +3 more sources
Fine-Tuning a Small Vision Language Model Using Synthetic Data for Explaining Bacterial Skin Disease Images [PDF]
Background/Objectives: Vision language models (VLMs) show strong potential for medical image understanding, but their large scale often limits practical deployment. This study investigates whether a compact VLM can be effectively adapted for dermatology,
Shiwan Zhang +3 more
doaj +2 more sources
TVPRNN for image caption generation
Image caption generation has attracted considerable interest in computer vision and natural language processes. However, existing methods usually use convolution neural network (CNN) for extracting image feature and recurrent NN (RNN) to predict next produced word, which may make the obtained features unadaptable to the word generated at current time ...
Haifeng Hu
exaly +2 more sources
Content moderation assistance through image caption generation
The rapid growth in digital media creation has led to an increased challenge in content moderation. Manual and automated moderation are susceptible to risks associated with a slower response time and false positives arising from unpredictable user inputs
Liam Kearns
doaj +2 more sources
Review of Image Captioning Methods Based on Encoding-Decoding Technology [PDF]
In recent years, image caption generation, as a multimodal task in the field of artificial intelligence, integrates the related research of computer vision and natural language processing, and can realize the modal conversion from image to text. It plays
GENG Yaogang, MEI Hongyan, ZHANG Xing, LI Xiaohui
doaj +1 more source

