Analysis and Prediction of Massive Electricity Information Based on Hadoop Ha Architecture

Ke Ma,Jiayang Wang
DOI: https://doi.org/10.1109/icicta49267.2019.00086
2019-01-01
Abstract:With the widespread use of smart grids and the Internet of Things, the amount of electricity generated by energy metering equipment is gradually increased. The traditional model has been more difficult to burden the storage, processing and analysis of massive electricity consumption information, and has gradually become the performance bottleneck of the system. Based on Hadoop with the advantages of distributed storage and parallel computing, the current performance bottleneck can be solved. In order to improve the fault tolerance of Hadoop in the production environment, the QJM (Quorum Journal Manager) mode is adopted as the high-availability shared storage mechanism, and the ZooKeeper cluster is introduced to complete the switching between the active and standby nodes to achieve smooth failover and avoidance. In order to fully analyze and explore the potential value in massive electricity consumption information, Hive data warehouse is used to realize complex query and multi-dimensional analysis. Based on statistical analysis data, a gray model is used to predict the trend of some data in future electricity usage information. Through the real-time collection of massive electricity consumption information, data cleaning and classification preprocessing, HDFS cloud storage, MapReduce parallel computing, HiveQL statistical analysis, the proposed scheme is efficient and feasible. It also verifies that the gray model can better predict the future power consumption trend under the Hadoop HA (High Available) architecture.
What problem does this paper attempt to address?