Abstract:Learning hidden topics from data streams has become absolutely necessary but posed challenging problems such as concept drift as well as short and noisy data. Using prior knowledge to enrich a topic model is one of potential solutions to cope with these challenges. Prior knowledge that is derived from human knowledge (e.g. Wordnet) or a pre-trained model (e.g. Word2vec) is very valuable and useful to help topic models work better. However, in a streaming environment where data arrives continually and infinitely, existing studies are limited to exploiting these resources effectively. Especially, a knowledge graph, that contains meaningful word relations, is ignored. In this paper, to aim at exploiting a knowledge graph effectively, we propose a novel graph convolutional topic model (GCTM) which integrates graph convolutional networks (GCN) into a topic model and a learning method which learns the networks and the topic model simultaneously for data streams. In each minibatch, our method not only can exploit an external knowledge graph but also can balance the external and old knowledge to perform well on new data. We conduct extensive experiments to evaluate our method with both a human knowledge graph (Wordnet) and a graph built from pre-trained word embeddings (Word2vec). The experimental results show that our method achieves significantly better performances than state-of-the-art baselines in terms of probabilistic predictive measure and topic coherence. In particular, our method can work well when dealing with short texts as well as concept drift. The implementation of GCTM is available at \url{<a class="link-external link-https" href="https://github.com/bachtranxuan/GCTM.git" rel="external noopener nofollow">this https URL</a>}.

Speech Topic Classification Based on Pre-trained and Graph Networks.

Speech Topic Classification Based on Multi-Scale and Graph Attention Networks

End-to-end Speech Topic Classification Based on Pre-Trained Model Wavlm

Topic Classification on Spoken Documents Using Deep Acoustic and Linguistic Features

Graph-Based Audio Classification Using Pre-Trained Models and Graph Neural Networks

Simplified Graph Learning for Inductive Short Text Classification

Linguistic Steganalysis with Graph Neural Networks

An End-to-End Speech Enhancement Framework Using Stacked Multi-scale Blocks.

Graph Fusion Network for Text Classification

Graph LSTM with Context-Gated Mechanism for Spoken Language Understanding.

Graph Structural-topic Neural Network

End-To-End Topic Classification Without Asr

Time-frequency Network for Robust Speaker Recognition

Speech Emotion Recognition Based on Temporal-Spatial Learnable Graph Convolutional Neural Network

Tensor Graph Convolutional Networks for Text Classification

Graph Neural Network Backend for Speaker Recognition

A Study on Graph Embedding for Speaker Recognition.

Predicting the Silent Majority on Graphs: Knowledge Transferable Graph Neural Network

STAGE: Simplified Text-Attributed Graph Embeddings Using Pre-trained LLMs

A Graph Convolutional Topic Model for Short and Noisy Text Streams

Speech Recognition for Air Traffic Control Via Feature Learning and End-to-end Training