Regularization of Soft Actor-Critic Algorithms with Automatic Temperature Adjustment

Ben You
2023-05-23
Abstract:This work presents a comprehensive analysis to regularize the Soft Actor-Critic (SAC) algorithm with automatic temperature adjustment. The the policy evaluation, the policy improvement and the temperature adjustment are reformulated, addressing certain modification and enhancing the clarity of the original theory in a more explicit manner.
Machine Learning,Artificial Intelligence
What problem does this paper attempt to address?