Taylor African vulture optimization algorithm with hybrid deep convolution neural network for image captioning system

Chengamma Chitteti,K. Reddy Madhavi
DOI: https://doi.org/10.1007/s11042-023-18080-0
IF: 2.577
2024-01-24
Multimedia Tools and Applications
Abstract:Image captioning using deep learning find useful in various applications, including aiding visually impaired individuals, improving content indexing and retrieval, and enhancing user experiences in fields like e-commerce and entertainment. Recently, deep learning models, particularly Convolutional Neural Networks can be used for the generation of effective text descriptions of the input images. Therefore, this study designs a novel Taylor African vulture optimization algorithm with hybrid deep learning for image captioning system (TAVOHDL-ICS) technique. The purpose of the proposed technique is to exploit deep learning models for the generation of the textual image captioning of the input images. To accomplish this, the presented technique applies BERT word embedding which generates good captions of the image and understand the semantics of words. For deriving feature vectors of the input image, the Inception ResNetv2 model can be employed. Moreover, the hybrid attention bidirectional gated recurrent unit model can be utilized for the effectual generation of image captions and its hyperparameters can be tuned by the TAVO algorithm. The simulation analysis of the proposed technique can be performed on the Flickr400 dataset and the outcomes are inspected under several measures. The comparison examination demonstrated the better performance of the proposed model over other existing algorithms with METEOR, CIDEr, and Rouge-L of 33, 183, and 60.
computer science, information systems, theory & methods,engineering, electrical & electronic, software engineering
What problem does this paper attempt to address?