Ensemble Clustering Algorithm Combined With Dimension Reduction Techniques for Power Load Profiles

ZHANG Bin,ZHUANG Chijie,HU Jun,CHEN Shuiming,ZHANG Mingming,WANG Ke,ZENG Rong
DOI: https://doi.org/10.13334/j.0258-8013.pcsee.2015.15.001
2015-01-01
Abstract:ABSTRACT:Load profiles clustering is a basic task for big data mining in electricity consumption database. This paper illustrated three typical validity indices and pointed out that Davies-Bouldin index is more suitable for assessing the clusters of load profiles. Hierarchical clustering, partitioning clustering, density-based clustering and model-based clustering were studied and the algorithms were evaluated from two aspects: efficiency and accuracy. The results prove that hierarchical clustering has high accuracy and low efficiency, while partitioning clustering has high efficiency and low accuracy. An ensemble algorithm was introduced and used for load profiles clustering, which was a combination of bootstrap sampling, partitioning clustering and hierarchical clustering. The ensemble clustering algorithm outperforms classical clustering algorithms on datasets of different scale and is especially suitable for large datasets clustering. Various techniques for reducing the dimension of the input datasets were studied and the results were compared from perspectives of computing time and information losses. The results indicate that the combination of principal component analysis and ensemble clustering algorithm performs better both in efficiency and accuracy for clustering large-scale load profiles.
What problem does this paper attempt to address?