HE-HMTC: A hybrid embedding-based text representation for Hierarchical multi-label text classification

Xiaofeng Liu,Huili Liu,Yinglong Ma
DOI: https://doi.org/10.1016/j.simpa.2022.100397
2022-11-01
Software Impacts
Abstract:Hierarchical multi-label text classification (HMTC) has become rather challenging when it requires handling large sets of closely related categories. We present a novel software that provides a hybrid embedding-based text representation for HMTC, shortened as HE-HMTC. It made full use of categories’ structure and their labels semantics to enrich the text representation, and therefore attempted to improve the classification performance of the text. In addition, HE-HMTC facilitates improving the accuracy of HMTC tasks. Moreover, our HE-HMTC can easily be generalized in other hierarchical classification tasks and achieve superior performance.
What problem does this paper attempt to address?