Editorial – a Special Issue on Data Mining

Jing Li,Murat Külahçı
DOI: https://doi.org/10.1002/qre.1720
2014-01-01
Quality and Reliability Engineering International
Abstract:I n many of today’s data-driven applications, the main theme seems to revolve around the effective and efficient use of ‘big data’ generated in modern processes/systems. This usually requires the involvement of various disciplines such as Mathematics, Computer Science and Statistics. The term data mining is often used to describe in general the analysis efforts of large data sets and has attracted enormous interest in multiple research fields in recent years. The knowledge generated through data mining has benefited a variety of application domains. This special issue of QREI provides a collection of recent research efforts in this area. We are very grateful for the contributions of the authors who submitted their work for this special issue. After two rounds of rigorous reviews, eight papers are accepted for publication. The application domains include engineering, health care, mobile networks, and finance. Motivated by an ingot growth process in semiconductor manufacturing, the paper by Dai et al. proposes a method to monitor growth profile trajectories of unequal lengths. The paper by Zhang et al. proposes a method to estimate the remaining useful life for engineering systems with high fluctuating degradation. The paper by Zeng and Peterson develops models to mine telephone nurse triage data. These models reveal the variation in the accessibility of the triage service and the effect of weekday/weekend, which provide significant information for performance assessment and improvement of the service. Duan et al. present a semisupervised learning method to integrate low-accuracy and high-accuracy mobile device location data. The paper by Li et al. proposes an L1-regularized support vector machine and applies it to the modeling of financial early warning systems. Phaladiganon et al. propose a support vector data description method, in which the boundary constructed for differentiating novel from normal patterns considers both shape and dense region of the data. Han and Clemmensen propose a double weighted support vector regression in which one weight is added to the slack variables in the objective function and another weight to the slack variables in the constraints. This approach is shown to be able to describe the relative importance of observations and lower the influence of possible outliners. The paper by Smith et al. studies how to improve statistical process control methods for understanding and visualizing process variation in data-rich environment and creates a quality visualization toolkit for practitioners. Finally, we would like to extend our sincere thanks to the reviewers for their valuable and timely evaluation of the manuscripts and to the Chief Editors, Douglas Montgomery and Aarnout Brombacher, for providing this opportunity to share with QREI readers the outstanding research efforts in data mining.
What problem does this paper attempt to address?