Abstract:Data mining, or knowledge discovery in databases (KDD), is an interdisciplinary field that integrates techniques from several research areas including machine learning, statistics, database systems, and pattern recognition, for the analysis of large volumes of possibly complex, highly-distributed and poorly-organized data. The prosperity of the data mining field may attribute to two essential reasons. Firstly, a huge amount of data is collected and stored everyday. On the one hand, along with the continuing development of advanced technologies in many domains, data is generated at enormous speeds. For examples, purchases data at department/grocery stores, bank/credit card transaction data, e-commerce data, Internet traffic data that describes the browsing history of Web users, remote sensor data from agricultural satellites, and gene expression data from microarray technology. On the other hand, the progress made in hardware technology allows today’s computer systems to store very large amounts of data. Secondly, with these large volumes of data at hand, the data owners have an imminent intent to turn them into useful knowledge. From a commercial viewpoint, the ultimate goal of the data owners is to gain more and pay less for their business activities. Under the competition pressure, they want to enhance their services, develop cost-effective strategies, and target the right group of potential customers. From a scientific viewpoint, when traditional techniques are infeasible in dealing with the raw data, data mining may help scientists in many ways, such as classifying and segmenting data. By applying the knowledge extracted from data mining, the business analyst may rate customers by their propensity to respond to an offer, the doctor may estimate the probability of an illness re-occurrence, the website publisher may display customized Web pages to individual Web users according to their browsing habit, and the geneticist may discover novel gene-gene interaction patterns. In this talk, we aim to provide a general picture for important data mining steps, topics, algorithms and challenges.

Knowledge Discovery Processing Model Based on Data Extractor

Knowledge Discovery in Multiple Databases

Workshop on Model Mining

A Pattern of Experiment Data Mining Based on Π Theorem Combined with ANN

Research On The Identification Model Of Customer Knowledge Source Based On Data Mining

Data Mining: Algorithms and Problems

Knowledge Discovery in Very Large Databases.

A Multi-Objective Model for Discovering High-Quality Knowledge Based on Data Quality and Prior Knowledge

Knowledge discovery based on soft computing model in the area of data mining

A Review on Data Preprocessing Techniques Toward Efficient and Reliable Knowledge Discovery From Building Operational Data

DMiner-I: A Software Tool of Data Mining and Its Applications

DATA PREPROCESSING ALGORITHM Ⅱ BASED ON HYBRID OPTIMIZATION ALGORITHMS

Knowledge-Based Simulation Experiment Data Integrative Analysis Technology

Decision Trees Based Knowledge Discovery in Databases for High-Rise Structures Intelligent from Selection

An Evolutive Frequent Pattern Tree-based Incremental Knowledge Discovery Algorithm

Development Environment of Application Domain-oriented Knowledge Discovery System KDIST

Data Mining Algorithm Data Model Data Analysis Based on Artificial Intelligence Technology

Computer-Aided Data Mining: Automating a Novel Knowledge Discovery and Data Mining Process Model for Metabolomics

International study on Internet/Web data mining with the state of art and advances

An overview of data mining and knowledge discovery

Description and Method Research on Knowledge Discovery in Sequence Mode