Results 1 to 10 of about 69,316 (271)

DIC-Transformer: interpretation of plant disease classification results using image caption generation technology [PDF]

open access: yesFrontiers in Plant Science, 2023
Disease image classification systems play a crucial role in identifying disease categories in the field of agricultural diseases. However, current plant disease image classification methods can only predict the disease category and do not offer ...
Qingtian Zeng, Jian Sun, Shansong Wang
doaj   +2 more sources

Enhancing image caption generation through context-aware attention mechanism [PDF]

open access: yesHeliyon
Image captioning, the process of generating natural language descriptions based on image content, has garnered attention in AI research for its implications in scene understanding and human-computer interaction.
Ahatesham Bhuiyan   +3 more
doaj   +2 more sources

VSAM-Based Visual Keyword Generation for Image Caption

open access: yesIEEE Access, 2021
Image caption is to understand and describe the visual content, which is expected to be applied in automatic news reporting in future. In recent years, there has been an increasing interest in an Encoder-Decoder framework for image caption: the encoder ...
Suya Zhang   +3 more
doaj   +3 more sources

PBC-Transformer: Interpreting Poultry Behavior Classification Using Image Caption Generation Techniques [PDF]

open access: yesAnimals
Accurate classification of poultry behavior is critical for assessing welfare and health, yet most existing methods predict behavior categories without providing explanations for the image content. This study introduces the PBC-Transformer model, a novel
Jun Li   +7 more
doaj   +2 more sources

An Overview of Image Caption Generation Methods [PDF]

open access: yesComputational Intelligence and Neuroscience, 2020
In recent years, with the rapid development of artificial intelligence, image caption has gradually attracted the attention of many researchers in the field of artificial intelligence and has become an interesting and arduous task. Image caption, automatically generating natural language descriptions according to the content observed in an image, is an
Haoran Wang, Yue Zhang, Xiaosheng Yu
openaire   +3 more sources

Cross-Lingual Image Caption Generation Based on Visual Attention Model

open access: yesIEEE Access, 2020
As an interesting and challenging problem, generating image caption automatically has attracted increasingly attention in natural language processing and computer vision communities.
Bin Wang   +5 more
doaj   +3 more sources

Fine-Tuning a Small Vision Language Model Using Synthetic Data for Explaining Bacterial Skin Disease Images [PDF]

open access: yesDiagnostics
Background/Objectives: Vision language models (VLMs) show strong potential for medical image understanding, but their large scale often limits practical deployment. This study investigates whether a compact VLM can be effectively adapted for dermatology,
Shiwan Zhang   +3 more
doaj   +2 more sources

TVPRNN for image caption generation

open access: yesElectronics Letters, 2017
Image caption generation has attracted considerable interest in computer vision and natural language processes. However, existing methods usually use convolution neural network (CNN) for extracting image feature and recurrent NN (RNN) to predict next produced word, which may make the obtained features unadaptable to the word generated at current time ...
Haifeng Hu
exaly   +2 more sources

Content moderation assistance through image caption generation

open access: yesIntelligent Systems with Applications
The rapid growth in digital media creation has led to an increased challenge in content moderation. Manual and automated moderation are susceptible to risks associated with a slower response time and false positives arising from unpredictable user inputs
Liam Kearns
doaj   +2 more sources

Review of Image Captioning Methods Based on Encoding-Decoding Technology [PDF]

open access: yesJisuanji kexue yu tansuo, 2022
In recent years, image caption generation, as a multimodal task in the field of artificial intelligence, integrates the related research of computer vision and natural language processing, and can realize the modal conversion from image to text. It plays
GENG Yaogang, MEI Hongyan, ZHANG Xing, LI Xiaohui
doaj   +1 more source

Home - About - Disclaimer - Privacy