A proximal policy optimization based intelligent home solar management

Kode Creer,Imitiaz Parvez
2024-05-09
Abstract:In the smart grid, the prosumers can sell unused electricity back to the power grid, assuming the prosumers own renewable energy sources and storage units. The maximizing of their profits under a dynamic electricity market is a problem that requires intelligent planning. To address this, we propose a framework based on Proximal Policy Optimization (PPO) using recurrent rewards. By using the information about the rewards modeled effectively with PPO to maximize our objective, we were able to get over 30\% improvement over the other naive algorithms in accumulating total profits. This shows promise in getting reinforcement learning algorithms to perform tasks required to plan their actions in complex domains like financial markets. We also introduce a novel method for embedding longs based on soliton waves that outperformed normal embedding in our use case with random floating point data augmentation.
Machine Learning,Artificial Intelligence
What problem does this paper attempt to address?
The problem that this paper attempts to solve is how to maximize the profits of household solar users through intelligent planning in the smart grid. Specifically, when these users have renewable energy sources (such as solar panels) and energy storage units, they can sell excess electricity in the dynamic electricity market to obtain revenue. However, due to market price fluctuations and the uncertainty of energy production, how to choose the best energy trading strategy has become a challenge. The paper proposes a method based on Proximal Policy Optimization (PPO), combined with a recursive reward mechanism, aiming to solve this problem, and shows more than 30% performance improvement compared to other simple algorithms in experiments. In addition, the paper also introduces a new method based on Soliton Waves for data embedding, which performs better than traditional embedding methods when dealing with random floating - point data augmentation. Through these techniques, the paper demonstrates the potential of reinforcement learning algorithms to perform tasks in complex fields such as financial markets.