Machining parameter optimization for a batch milling system using multi-task deep reinforcement learning

Pei Wang,Yixin Cui,Haizhen Tao,Xun Xu,Sheng Yang
DOI: https://doi.org/10.1016/j.jmsy.2024.11.013
IF: 12.1
2024-12-01
Journal of Manufacturing Systems
Abstract:The integrated multi-objective optimization of machining parameters for improved machining quality and efficiency is important in batch milling systems. Due to the change of the batch milling system state, the continuous use of the same machining parameters may lead to degradation in quality and efficiency for workpieces in batches. Machining parameter optimization is usually determined by manual experience or trial-and-error methods, making it difficult to achieve a synergistic consideration of both quality and efficiency. To address this issue, a novel multi-task deep reinforcement learning method for machining parameter optimization in a batch machining system is proposed. Firstly, a reliable parallel joint estimation model of multiple machining quality and efficiency indicators is established using a multi-task time series estimation method, which can learn the correlation of these indicators to improve estimation accuracy. Then, the parameter optimization problem is formalized as a Markov decision process supported by a reinforcement learning virtual environment and an agent. The reinforcement learning virtual environment with the joint estimation model is constructed to improve the accuracy of optimized machining parameters for the collaborative optimization of quality and efficiency indicators. Within the virtual environment, time series sequential state, sequential action, multi-objective reward function, and constraint conditions adapted to the joint estimation model are defined to repeatedly evaluate different machining parameters. The agent with a multi-head attention and a dynamic weight adjustment mechanism is designed to improve the stability of the optimization process. Finally, experiments on a real machining dataset of thin-walled parts show that compared with the traditional deep reinforcement learning algorithm, the optimization effect of the proposed framework is improved by 9 %−12 %, and the standard deviation is decreased by 9 % −18 %.
engineering, manufacturing, industrial,operations research & management science
What problem does this paper attempt to address?