Classification of Sensitive Web Documents

Hui Gao,Yan Fu,Jian-Ping Li
DOI: https://doi.org/10.1109/icacia.2008.4770027
2008-01-01
Abstract:Web document classification is the process of grouping web documents into one or more predefined categories based on their content. It is an important component of web monitor system that can assist people to reduce the dissemination of harmful information. This paper proposes a combined approach for building a decision tree with the multilayer neural network as its categorically value function, and presented a complete approach for automated news categorization. The experimental evaluation demonstrates that this approach provides better classification accuracy than single traditional text categorization methods.
What problem does this paper attempt to address?