Focus Your Attention: A Focal Attention for Multimodal Learning

Chunxiao Liu,Zhendong Mao,Tianzhu Zhang,Anan Liu,Bin Wang,Yongdong Zhang
DOI: https://doi.org/10.1109/TMM.2020.3046855
IF: 7.3
2022-01-01
IEEE Transactions on Multimedia
Abstract:The key point in multimodal learning is to learn semantic alignment that finds the correspondence between sub-elements of instances from different modality data. Attention mechanism has shown its power in semantic alignment learning as it enables to densely associate sub-elements across different modalities. However, for each sub-element, existing attention aligns it with all the sub-elements from...
What problem does this paper attempt to address?