Weight attention layer‐based document classification incorporating information gain

Min Seok Lee,Seok Woo Yang,Hong Joo Lee
DOI: https://doi.org/10.1111/exsy.12833
IF: 3.3
2021-09-27
Expert Systems
Abstract:The performance of document classifiers largely depends on their internal representations of text data. Recent studies have been conducted to identify areas of focus and find latent data spaces to increase the representativeness and the performance of classifiers. In this study, we propose a weight attention layer (WAL) that uses an additional feature of words when computing their attention weights for deep learning models based on attention mechanisms. In the WAL, the attention distribution is calculated through the dot product of the attention weight matrix and a word weight matrix. We utilized information gain, which is one of the feature selection algorithms for the additional feature. To evaluate the proposed method, datasets of helpful reviews, sentiment reviews, and fake reviews were used. These datasets were applied to two deep learning models based on attention mechanisms, including an attention-based bidirectional long short-term memory (LSTM) and a hierarchical attention network. As a result of 10-fold cross validation, the improved performance of the models in terms of accuracy and F1-score when using WAL is demonstrated.
computer science, artificial intelligence, theory & methods
What problem does this paper attempt to address?