Abstract:The usefulness of the results produced by data mining methods can be critically impaired by several factors such as (1) low quality of data, including errors due to contamination, or incompleteness due to limited bandwidth for data acquisition, and (2) inadequacy of the data model for capturing complex probabilistic relationships in data. Fortunately, a wide spectrum of applications exhibit strong dependencies between data samples. For example, the readings of nearby sensors are generally correlated, and proteins interact with each other when performing crucial functions. Therefore, dependencies among data can be successfully exploited to remedy the problems mentioned above. In this paper, we propose a unified approach to improving mining quality using Markov networks as the data model to exploit local dependencies. Belief propagation is used to efficiently compute the marginal or maximum posterior probabilities, so as to clean the data, to infer missing values, or to improve the mining results from a model that ignores these dependencies. To illustrate the benefits and great generality of the technique, we present its application to three challenging problems: (i) cost-efficient sensor probing, (ii) enhancing protein function predictions, and (iii) sequence data denoising.

Algorithms of Nonmonotonic Data Mining Based on Concept Hierarchy and Layered Mining

Fuzzy Clustering-Based Quantitative Association Rules Mining in Multidimensional Data Set

Association Rules Mining Based on the Discriminative Concept Lattice

Dtgc-Tree: A New Strategy Of Association Rules Mining

Density-Based Mining of Quantitative Association Rules

Data Mining In Multisensor System Based On Rough Set Theory

Mining Hierarchical Decision Rules from Hybrid Data with Categorical and Continuous Valued Attributes

A Novel Feature Decomposition Method To Develop Multi-Hierarchy Model

Visual Analysis of User-Driven Association Rule Mining

A Method of Failure Diagnosis Based on Association Rules and Study

Mining of Multi-Relational Association Rules

A Dynamic Approach Based on Apriorilike Algorithm for Mining Association Rules

Mining positive and negative rules via one-sided fuzzy three-way concept lattices

An Optimized Method for Mining Biological Data Multilevel Association Rules

Develop Multi-hierarchy Classification Model: Rough Set Based Feature Decomposition Method

Mining Generalized Association Rules with Fuzzy Taxonomic Structures

Mining Positive and Negative Fuzzy Association Rules.

Improving Mining Quality by Exploiting Data Dependency

NeuroRule: A Connectionist Approach to Data Mining

Method of Association Rules Mining Based on Genetic Algorithms

Mining A Complete Set Of Both Positive And Negative Association Rules From Large Databases