Abstract:With the development of the Mobile Internet, more and more people create and release multi-modal posts on social media platforms. Fake news detection has become an increasingly challenging task. Although many current works focus on constructing models extracting abstract features from the content of each post, they neglect the intrinsic semantic architecture such as latent topics, etc. These models only learn patterns in content coupled with certain specific latent topics on the training set to distinguish real and fake posts, which will suffer generalization and discriminating ability decline, especially when posts are associated with rare or new topics. Moreover, most existing works using deep schemes to extract and integrate textual and visual representation in post have not effectively modeled and sufficiently utilized the complementary and noisy multi-modal information containing semantic concepts and entities to complement and enhance each modal. In this paper, to deal with the above problems, we propose a novel end-to-end Multi-modal Topic Memory Network (MTMN), which obtains and combines post representations shared across latent topics together with global features of latent topics while modeling intra-modality and inter-modality information in a unified framework. (1) To tackle real scenarios where newly arriving posts with different topic distribution from the training data, our method incorporates a topic memory module to explicitly characterize final representation as post feature shared across topics and global features of latent topics. These two kinds of features are jointly learned and then combined to generate robust representation. (2) To effectively integrate multi-modality information in posts, we propose a novel blended attention module for multi-modal fusion, which can simultaneously exploit the intra-modality relation within each modal and the inter-modality relation between text words and image regions to complement and enhance each other fo- high-quality representation. Extensive experiments on two public real-world datasets demonstrate the superior performance of MTMN compared with other state-of-the-art algorithms.

End-to-End Deep Memory Network for Visual-Textual Sentiment Analysis

Sentiment Analysis Using Deep Robust Complementary Fusion of Multi-Features and Multi-Modalities.

Image-Text Multimodal Emotion Classification via Multi-View Attentional Network

Aspect Level Sentiment Classification with Deep Memory Network

Multi-Interactive Memory Network for Aspect Based Multimodal Sentiment Analysis

MultiSentiNet: A Deep Semantic Network for Multimodal Sentiment Analysis

A Multimodal Sentiment Analysis Approach Based on a Joint Chained Interactive Attention Mechanism

Convolutional multi-head self-attention on memory for aspect sentiment classification

Attention-Based Modality-Gated Networks for Image-Text Sentiment Analysis

A Deep Multi-Level Attentive network for Multimodal Sentiment Analysis

Visual-Textual Sentiment Analysis Enhanced by Hierarchical Cross-Modality Interaction

Multimodal Memory Modelling for Video Captioning

MFSC: A Multimodal Aspect-Level Sentiment Classification Framework with Multi-Image Gate and Fusion Networks

Fake News Detection via Multi-Modal Topic Memory Network

ModalNet: an aspect-level sentiment classification model by exploring multimodal data with fusion discriminant attentional network

[Spinal cord injuries. An intact nerve can be enough for a successful phrenic nerve stimulation].

Multi-level textual-visual alignment and fusion network for multimodal aspect-based sentiment analysis

Multi-Task Multi-Head Attention Memory Network for Fine-Grained Sentiment Analysis.

Text-oriented Modality Reinforcement Network for Multimodal Sentiment Analysis from Unaligned Multimodal Sequences