Abstract:Despite its unrealistic independence assumption, the Naive Bayes classifier is remarkably successful in practice. In the Naive Bayes classifier, all variables are assumed to be nominal variables, it means that each variable has a finite number of values. But in large databases, the variables (or fields) often take continuous values or have a large number of numerical values. So many researchers discussed the discretization (or crisp partitioning) of the domain of the continuous variables. We generalize the Naive Bayes classifier to the situation in which the fuzzy partition of the variable domains instead of discretization is taken. Therefore each variable in the Fuzzy Naive Bayes classifier can take a linguistic value or fuzzy set. From the observed data set one method of estimating the conditional probabilities in the Fuzzy Naive Bayes classifier is proposed in this paper. For each numeric input the method to predict its class label using the fuzzy Naive Bayes classifier is presented. In the training phase of the classifier, the training data (just including the feature variables without class labels) is first clustered in an unsupervised way by fuzzy c-means or a similar algorithm. Then the optimal cluster centers of training data are used to determine the fuzzy partition of the feature variables space. This generalization can decrease the complexity of learning optimal discretization which the classical Naive Bayes Classifier often faces, reduce the loss of information because of the discretization and increase the power of dealing with imprecise data and the large databases. Some well-known classification problems in the machine learning field have been tested in this paper, the results show that the Fuzzy Naive Bayes classifier is an alternative and effective tool to deal with the classification problem which has continuous variables.

Kind of Efficient Relational Naive Bayesian Classification Algorithm

An efficient multi-relational Naïve Bayesian classifier based on semantic relationship graph

Graph-NB:an Efficient and Accurate Multi-relational Naive Bayesian Classifier

A PERFORMANCE PREDICTION METHOD BASED ON NAIVE BAYES CLASSIFICATION

Multi-relational Bayesian Classification Algorithm with Rough Set

Evolutionary Lazy Learning for Naive Bayes Classification

Comparison and Research of Bayesian Classification Algorithm in Relational Learning

Fuzzy Naive Bayes Classifier Based on Fuzzy Clustering

A Novel Text Classification Algorithm Based on Naïve Bayes and KL-Divergence

Exploring Optimization of Semantic Relationship Graph for Multi-Relational Bayesian Classification

Learning Nt Bayesian Classifier Based On Canonical Cover Analysis Of Relational Database

A Scalable Classification Algorithm Exploring Database Technology

Learning Class-Level Bayes Nets for Relational Data

Naïve Bayes Classification in R

Naive Bayesian Classification Algorithm Based on Attribute Clustering under Different Classification

Naive Bayes Classification Algorithm Based on Optimized Training Data

Attribute Weighted Naive Bayesian Classification Algorithm

A Weighted Relational Classification Algorithm Based on Rough Set

Selecting Effective Features and Relations for Efficient Multi-Relational Classification.

A Fast Scalable Classifier Tightly Integrated with RDBMS

A General Multi-relational Classification Approach Using Feature Generation and Selection.