Varying Naive Bayes Models with Applications to Classification of Chinese Text Documents

Guoyu Guan,Jianhua Guo,Hansheng Wang
DOI: https://doi.org/10.1080/07350015.2014.903086
2015-01-01
Abstract:Document classification is an area of great importance for which many classification methods have been well developed. However, most of these methods cannot generate time-dependent classification rules. Thus, they are not the best choices for problems with time-varying structures. To address this problem, we propose a varying naive Bayes model, which is a natural extension of the naive Bayes model that allows for time-dependent classification rule. The method of kernel smoothing is developed for parameter estimation and a BIC-type criterion is invented for feature selection. Asymptotic theory is developed and numerical studies are conducted. Finally, the proposed method is demonstrated on a real dataset, which was generated by the Mayor Public Hotline of Changchun, the capital city of Jilin Province in Northeast China.
What problem does this paper attempt to address?