Abstract:Over recent decades, the rapid growth in data makes ever more urgent the quest for highly scalable Bayesian networks that have better classification performance and expressivity (that is, capacity to respectively describe dependence relationships between attributes in different situations). To reduce the search space of possible attribute orders, k-dependence Bayesian classifier (KDB) simply applies mutual information to sort attributes. This sorting strategy is very efficient but it neglects the conditional dependencies between attributes and is sub-optimal. In this paper, we propose a novel sorting strategy and extend KDB from a single restricted network to unrestricted ensemble networks, i.e., unrestricted Bayesian classifier (UKDB), in terms of Markov blanket analysis and target learning. Target learning is a framework that takes each unlabeled testing instance P as a target and builds a specific Bayesian model Bayesian network classifiers (BNC) P to complement BNC T learned from training data T . UKDB respectively introduced UKDB P and UKDB T to flexibly describe the change in dependence relationships for different testing instances and the robust dependence relationships implicated in training data. They both use UKDB as the base classifier by applying the same learning strategy while modeling different parts of the data space, thus they are complementary in nature. The extensive experimental results on the Wisconsin breast cancer database for case study and other 10 datasets by involving classifiers with different structure complexities, such as Naive Bayes (0-dependence), Tree augmented Naive Bayes (1-dependence) and KDB (arbitrary k-dependence), prove the effectiveness and robustness of the proposed approach.

Learning Bayesian Multinets from Labeled and Unlabeled Data for Knowledge Representation.

Locally Weighted Learning: How and when Does It Work in Bayesian Networks?

A Bayesian Network nearest k-labels method for Multi-label classification

Semi-supervised Learning for K-Dependence Bayesian Classifiers

Exploiting the Implicit Independence Assumption for Learning Directed Graphical Models.

Efficient Heuristics for Learning Bayesian Network from Labeled and Unlabeled Data

Efficient heuristics for learning scalable Bayesian network classifier from labeled and unlabeled data

Probability Knowledge Acquisition from Unlabeled Instance Based on Dual Learning

Learning Balanced Bayesian Classifiers from Labeled and Unlabeled Data

Hierarchical Independence Thresholding for Learning Bayesian Network Classifiers.

Bagging K-Dependence Bayesian Network Classifiers

Discriminative Structure Learning of Bayesian Network Classifiers from Training Dataset and Testing Instance.

Exploring Complex Multivariate Probability Distributions with Simple and Robust Bayesian Network Topology for Classification

A Novel Approach to Fully Representing the Diversity in Conditional Dependencies for Learning Bayesian Network Classifier

Learning High-Dependence Bayesian Network Classifier with Robust Topology

Discriminatory Target Learning: Mining Significant Dependence Relationships from Labeled and Unlabeled Data.

Stochastic Optimization for Bayesian Network Classifiers

Label-Driven Learning Framework: Towards More Accurate Bayesian Network Classifiers Through Discrimination of High-Confidence Labels

Exploring the Non-Trivial Knowledge Implicit in Test Instance to Fully Represent Unrestricted Bayesian Classifier.

From Undirected Dependence to Directed Causality: A Novel Bayesian Learning Approach

Structure Learning of Bayesian Network Based on Adaptive Thresholding