Optimization of Internet Content Filtering---combined with KNN and OCAT Algorithms

Tianze Guo,Lingjing Wu,Jiaming Liu
DOI: https://doi.org/10.1063/1.5033805
2018-01-01
AIP Conference Proceedings
Abstract:The face of the status quo that rampant illegal content in the Internet, the result of traditional way to filter information, keyword recognition and manual screening, is getting worse. Based on this, this paper uses OCAT algorithm nested by KNN classification algorithm to construct a corpus training library that can dynamically learn and update, which can be improved on the filter corpus for constantly updated illegal content of the network, including text and pictures, and thus can better filter and investigate illegal content and its source. After that, the research direction will focus on the simplified updating of recognition and comparison algorithms and the optimization of the corpus learning ability in order to improve the efficiency of filtering, save time and resources.
What problem does this paper attempt to address?