Machine learning classification algorithms and anomaly detection in conventional meters and Tunisian electricity consumption large datasets

Simona-Vasilica Oprea,Adela Bâra
DOI: https://doi.org/10.1016/j.compeleceng.2021.107329
2021-09-01
Abstract:Although fraud in electricity consumption is easier to detect when consumption is recorded hourly by smart meters, in most developing countries, where the propensity for fraud is higher, conventional meters are not yet affordable. Fraud detection is easier with time series data-logging due to the periodicity and variability of consumption that reveals deviations from a regular consumption pattern. In contrast, fraud detection with conventional meters remains a significant challenge because anomalies in consumption are well hidden within the normal consumption of other consumers. In this paper, large datasets regarding consumers and invoice data from Tunisia are combined and investigated with several Machine Learning (ML) classification algorithms, to detect irregularities in electricity consumption. By performing extensive feature engineering, including multivariate Gaussian distribution, the efficiency of ensemble classifiers such as Light Gradient Boosting (LGB) outperforms other algorithms and achieves realistic performance from challenging, unbalanced and uncorrelated input datasets.
engineering, electrical & electronic,computer science, interdisciplinary applications, hardware & architecture
What problem does this paper attempt to address?