Preprocessing Time Series Data for Classification with Application to CRM.

Yiming Yang,Qiang Yang,Wei Lu,Sinno Jialin Pan,Rong Pan,Chenhui Lu,Lei Li,Zhenxing Qin
DOI: https://doi.org/10.1007/11589990_16
2005-01-01
Abstract:We develop an innovative data preprocessing algorithm for classifying customers using unbalanced time series data. This problem is directly motivated by an application whose aim is to uncover the customers’ churning behavior in the telecommunication industry. We model this problem as a sequential classification problem, and present an effective solution for solving the challenging problem, where the elements in the sequences are of a multi-dimensional nature, the sequences are uneven in length and classes of the data are highly unbalanced. Our solution is to integrate model based clustering and develop an innovative data preprocessing algorithm for the time series data. In this paper, we provide the theory and algorithms for the task, and empirically demonstrate that the method is effective in determining the customer class for CRM applications in the telecommunications industry.
What problem does this paper attempt to address?