Recurrent convolutional video captioning with global and local attention.

Tao Jin,Yingming Li,Zhongfei Zhang
DOI: https://doi.org/10.1016/j.neucom.2019.08.042
IF: 6
2019-01-01
Neurocomputing
Abstract:•We propose a novel video captioning model with global-local attention.•We combine LSTM and 1D CNN in the decoder.•Our model outperforms the state-of-the-art on MSVD and MSR-VTT.
What problem does this paper attempt to address?