Perception based data mining and decision support systems

Guoqing Chen,Qiang Wei
2008-01-01
Abstract:The massive amount of data currently being accumulated and stored has necessitated the development of algorithmic techniques to analyze large databases in order to synthesize useful information for decision analysis and forecasting. Over the past fifteen years the field of data mining has produced techniques for extracting information that have been successfully employed in a number of application areas including economics, finance, business, telecommunications, and e-business. When humans are involved in the process, either by providing the input data or utilizing the results of the data analysis, perception plays a role in the representation and the interpretation of the information. The methodology for computing with words and perceptions developed by Zadeh provides the basis for a fuzzy semantics of words and for reasoning with fuzzy perceptions [1–3]. The formal analysis of perceptions utilizes a linguistic term set consisting of words and modifiers. The underlying interpretation of the terms is given as fuzzy membership functions and the modifiers are modeled by fuzzy set operations. Because of the ability to represent information in linguistic terms, the incorporation of fuzzy methodologies has been identified as one of the research areas with the potential to have a significant impact on the next generation of machine learning and data mining systems [4]. Fuzzy sets have been used in data mining since the mid 1990s (see [5] for an overview of early applications of fuzzy set theory in data mining). The primary motivation for fuzzy sets in this work was to avoid unnatural boundaries in partitioning attribute domains in quantitative association rules and to facilitate the interpretation of the resulting rules. In representing perceptions linguistically and computing with words, the focus is more on the term set itself rather than on the underlying membership functions. The papers in this special issue are concerned with the generation and refinement of linguistic terms from data and the assessment of data represented as linguistic terms. This volume contains the five papers that consider different aspects of perception based data mining and decision analysis. In all of the papers, fuzzy sets provide the underlying representation of linguistic and perceptual information. The first four papers consider techniques for identifying linguistically represented associations in numeric data, temporal data, and web browsing data. The final paper focuses on the analysis of linguistically represented assessment information in a decision support system. The first two papers describe general methods for the construction of linguistically describable associations and use logical properties to reduce the size of the resulting fuzzy rules. The paper ‘‘Mining Pure Linguistic Associations from Numerical Data’’ of Vilém Novák, Irina Perfilieva, Antonı́n Dvor̆ák, Guoqing Chen, Qiang Wei and Peng Yan presents a method for a direct search for linguistic associations from numerical data. The support for the associations is determined using the theory of evaluating linguistic expressions and predications. Linguistic association rules discovered from data are composed of linguistic expressions like ‘‘small’’ and ‘‘big’’ and modified by fuzzy hedges ‘‘extremely’’, ‘‘significantly’’, ‘‘very’’, ‘‘quite roughly’’, etc. Realvalued data is evaluated using the corresponding linguistic expressions and standard data-mining techniques
What problem does this paper attempt to address?