Bad Data Processing Strategies for Short-Term Bus Load Forecasting Based on Stratified Analysis of Characteristic Matrix

Qian Sun,Jiangang Yao,Min Jin,Shengjie Yang,Shaolin Kuang,Zhenchao Xu
DOI: https://doi.org/10.3969/j.issn.1000-6753.2013.07.032
2013-01-01
Abstract:As an important link, the original data analysis would improve the accuracy of short-term bus load forecasting a lot. Thus, a bad data processing strategy based on stratified analysis of characteristic matrix is presented. Firstly, the AFS clustering algorithm for dividing sample set optimal clustering structure is studied. The search interval of clustering number for per-unit curve sample set is calculated using AP(Affinity propagation) clustering algorithm, and the initialized matrix is obtained on the basis of the density index arranged according to the decreasing order. Then the optimal clustering results are finally achieved by effectiveness testing based on the Silhouette index. Referring to the characteristic curves, the horizontal and vertical eigenvectors reflecting properties of the load points are calculated, and the characteristic matrix is formed. By applying the discriminant criterion, the stratified analysis for the characteristic matrix of daily load curve is carried out, and thereafter the corresponding bad data processing strategies focusing on bus loads which have different variation of characteristics are established.Case study shows that the proposed method could improve the quality of raw data as well as the bus load forecasting accuracy effectively.
What problem does this paper attempt to address?