Comparative Study of Deep Learning Algorithms For Image Caption Generation
- DOI
- 10.2991/978-94-6463-738-0_14How to use a DOI?
- Keywords
- Image Captioning; Deep Learning; Inception V3; ResNet 50; BiLSTM; LSTM
- Abstract
In the recent times, image captioning had achieved widespread interest in the field of natural language processing. With the booming advancements in deep learning, there is a huge scope for the development in image captioning using deep learning. The proposed research addresses the challenges faced by researchers like generalization and variable embedding sizes by holding a comparative analysis of deep learning algorithms for image caption generation. In the proposed research work, transfer learning and Recurrent Neural Networks (RNN) have been used for autonomous generation of image captions. Pre-trained Inception V3 and RESNET 50 models were used to extract image feature vectors and GloVe 6B 200D was used to create embedding of the captions. A merge model had been proposed to integrate the intricacies of captions and images. The proposed model is trained on Flick 8k dataset and a combo of Inception V3 and BiLSTM had achieved a BLEU score of 0.576470.
- Copyright
- © 2025 The Author(s)
- Open Access
- Open Access This chapter is licensed under the terms of the Creative Commons Attribution-NonCommercial 4.0 International License (http://creativecommons.org/licenses/by-nc/4.0/), which permits any noncommercial use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license and indicate if changes were made.
Cite this article
TY - CONF AU - Valavala S. S. S. R. Manikumar AU - G. Bharathi Mohan PY - 2025 DA - 2025/06/22 TI - Comparative Study of Deep Learning Algorithms For Image Caption Generation BT - Proceedings of the International Conference on Advances and Applications in Artificial Intelligence (ICAAAI 2025) PB - Atlantis Press SP - 160 EP - 178 SN - 1951-6851 UR - https://doi.org/10.2991/978-94-6463-738-0_14 DO - 10.2991/978-94-6463-738-0_14 ID - Manikumar2025 ER -