Multi-Label Classification of Chinese Books with LSTM Model

Deng Sanhong,Fu Yuyangzi,Wang Hao
DOI: https://doi.org/10.11925/infotech.2096-3467.2017.0484
2017-01-01
Abstract:: [ Objective ] This paper proposes a new method to automatically cataloguing Chinese books based on LSTM model, aiming to solve the issues facing single or multi-label classification. [ Methods ] First, we introduced deep learning algorithms to construct a new classification system with character embedding technique. Then, we trained the LSTM model with strings consisting of titles and keywords. Finally, we constructed multiple binary classifiers, which were examined with bibliographic data from three universities. [ Results ] The proposed model performed well and had practical value. [ Limitations ] We only analyzed five categories of Chinese bibliographies, and the granularity of classification was coarse. [ Conclusions ] The proposed Chinese book classification system based on LSTM model could preprocess data and learn incrementally, which could be transferred to other fields.
What problem does this paper attempt to address?