Constant force grinding controller for robots based on SAC optimal parameter finding algorithm

Chosei Rei,Qichao Wang,Linlin Chen,Xinhua Yan,Peng Zhang,Liwei Fu,Chong Wang,Xinghui Liu
DOI: https://doi.org/10.1038/s41598-024-63384-2
IF: 4.6
2024-06-21
Scientific Reports
Abstract:Since conventional PID (Proportional–Integral–Derivative) controllers hardly control the robot to stabilize for constant force grinding under changing environmental conditions, it is necessary to add a compensation term to conventional PID controllers. An optimal parameter finding algorithm based on SAC (Soft-Actor-Critic) is proposed to solve the problem that the compensation term parameters are difficult to obtain, including training state action and normalization preprocessing, reward function design, and targeted deep neural network design. The algorithm is used to find the optimal controller compensation term parameters and applied to the PID controller to complete the compensation through the inverse kinematics of the robot to achieve constant force grinding control. To verify the algorithm's feasibility, a simulation model of a grinding robot with sensible force information is established, and the simulation results show that the controller trained with the algorithm can achieve constant force grinding of the robot. Finally, the robot constant force grinding experimental system platform is built for testing, which verifies the control effect of the optimal parameter finding algorithm on the robot constant force grinding and has specific environmental adaptability.
multidisciplinary sciences
What problem does this paper attempt to address?
This paper mainly addresses the issue of how to maintain constant force control during robot grinding. Traditional PID controllers have difficulty in achieving this goal stably under environmental changes, so compensation terms need to be added. The paper proposes an optimal parameter search algorithm based on the soft Actor-Critic (SAC) algorithm to solve the problem of obtaining compensation term parameters. This algorithm includes training state-action and normalization preprocessing, reward function design, and target deep neural network design to find the optimal controller compensation term parameters. It is applied to the PID controller to achieve compensation through the inverse kinematics of the robot and realize constant force grinding control. To verify the feasibility and adaptability of the algorithm, the paper establishes a grinding robot simulation model with force perception information and conducts experimental tests. The experimental results show that the controller trained by the SAC algorithm can achieve constant force grinding of the robot and has good control performance in different environments. In addition, a real-world testing platform for robot constant force grinding is constructed to further confirm the control effect and environmental adaptability of the optimal parameter search algorithm based on the SAC algorithm.