Key quality characteristics selection for imbalanced production data using a two-phase bi-objective feature selection method

An-Da Li,Zhen He,Qing Wang,Yang Zhang
DOI: https://doi.org/10.1016/j.ejor.2018.10.051
IF: 6.4
2019-05-01
European Journal of Operational Research
Abstract:The selection of key quality characteristics (KQCs) that are significantly associated with product quality is essential for improving product quality. Production lines generally yield a larger number of regular products than do premium products, which creates an imbalance in production datasets and complicates KQC selection. In this study, a KQC selection method with an excellent ability to predict product quality is proposed based on a two-phase bi-objective feature selection method. The KQC selection model is established as a bi-objective problem of maximizing feature (i.e., quality characteristic) importance and minimizing percentage of selected features, and the geometric mean (G-mean) is selected as the feature importance metric for imbalanced data. To solve this model, a two-phase multi-objective optimization method is proposed; this method yields a set of candidate solutions (KQC sets) using an improved direct multi-search (DMS) strategy and uses the ideal point method (IPM) to select the final KQC sets from the candidate solutions. The experimental results indicate that the proposed method is effective for selecting KQCs for imbalanced production data.
operations research & management science
What problem does this paper attempt to address?