Naive Bayes Based Criminal Text Classification of Unbalanced Classes

程春惠,何钦铭
DOI: https://doi.org/10.3778/j.issn.1002-8331.2009.35.038
2009-01-01
Abstract:According to the feature of case text,this paper explores the special text preprocessing method and compares two effective feature selection methods.An improved model based on multi-variate Bernoulli model is proposed,due to the unbalanced distribution of criminal case categories.The experiment indicates that the improved Naive Bayes method performs better in the case text classification.
What problem does this paper attempt to address?