Robust Multimodal Representation Learning With Evolutionary Adversarial Attention Networks

Feiran Huang,Alireza Jolfaei,Ali Kashif Bashir
DOI: https://doi.org/10.1109/TEVC.2021.3066285
IF: 16.497
2021-01-01
IEEE Transactions on Evolutionary Computation
Abstract:Multimodal representation learning is beneficial for many multimedia-oriented applications, such as social image recognition and visual question answering. The different modalities of the same instance (e.g., a social image and its corresponding description) are usually correlational and complementary. Most existing approaches for multimodal representation learning are not effective to model the d...
What problem does this paper attempt to address?