An Attention Based Automatic Image Description Generation

R. Lakshmi Tulasi
DOI: https://doi.org/10.1007/978-981-16-3660-8_24
2021-01-01
Abstract:With the recent advancements in object detection and machine translation techniques, we proposed an attention based model that automatically performs the image caption generation. We have used two kinds of attention mechanisms named hard stochastic attention and soft deterministic attention. We can train the model using backpropagation techniques and our model concentrates on the important objects present in the image. By considering these salient objects it can generate the corresponding captions word by word as the output sequence of LSTM. We validated our model on the standard MSCOCO dataset.
What problem does this paper attempt to address?