Abstract:The integrated multi-objective optimization of machining parameters for improved machining quality and efficiency is important in batch milling systems. Due to the change of the batch milling system state, the continuous use of the same machining parameters may lead to degradation in quality and efficiency for workpieces in batches. Machining parameter optimization is usually determined by manual experience or trial-and-error methods, making it difficult to achieve a synergistic consideration of both quality and efficiency. To address this issue, a novel multi-task deep reinforcement learning method for machining parameter optimization in a batch machining system is proposed. Firstly, a reliable parallel joint estimation model of multiple machining quality and efficiency indicators is established using a multi-task time series estimation method, which can learn the correlation of these indicators to improve estimation accuracy. Then, the parameter optimization problem is formalized as a Markov decision process supported by a reinforcement learning virtual environment and an agent. The reinforcement learning virtual environment with the joint estimation model is constructed to improve the accuracy of optimized machining parameters for the collaborative optimization of quality and efficiency indicators. Within the virtual environment, time series sequential state, sequential action, multi-objective reward function, and constraint conditions adapted to the joint estimation model are defined to repeatedly evaluate different machining parameters. The agent with a multi-head attention and a dynamic weight adjustment mechanism is designed to improve the stability of the optimization process. Finally, experiments on a real machining dataset of thin-walled parts show that compared with the traditional deep reinforcement learning algorithm, the optimization effect of the proposed framework is improved by 9 %−12 %, and the standard deviation is decreased by 9 % −18 %.

Policy manifold generation for multi-task multi-objective optimization of energy flexible machining systems

Generative Upper-Level Policy Imitation Learning with Pareto-Improvement for Energy-Efficient Advanced Machining Systems.

Analysis of Multi-Objective Optimization of Machining Allowance Distribution and Parameters for Energy Saving Strategy

Pareto Fronts of Machining Parameters for Trade-off among Energy Consumption, Cutting Force and Processing Time

Machining parameter optimization for a batch milling system using multi-task deep reinforcement learning

A Generic Multi-Objective Optimization of Machining Processes Using an End-to-End Evolutionary Algorithm

Multi-resource constrained dynamic workshop scheduling based on proximal policy optimisation

Human-in-the-Loop Policy Optimization for Preference-Based Multi-Objective Reinforcement Learning

End-to-end Multi-Target Flexible Job Shop Scheduling with Deep Reinforcement Learning

Multi-agent Evolution Reinforcement Learning Method for Machining Parameters Optimization Based on Bootstrap Aggregating Graph Attention Network Simulated Environment

A modified multi-agent proximal policy optimization algorithm for multi-objective dynamic partial-re-entrant hybrid flow shop scheduling problem

Multi-objective optimisation of machining process parameters using deep learning-based data-driven genetic algorithm and TOPSIS

Integrated Optimisation of Multi-Pass Cutting Parameters and Tool Path with Hierarchical Reinforcement Learning Towards Green Manufacturing

C-MORL: Multi-Objective Reinforcement Learning through Efficient Discovery of Pareto Front

Towards Pareto-optimal energy management in integrated energy systems: A multi-agent and multi-objective deep reinforcement learning approach

An Adaptive Gaussian Process Based Manifold Transfer Learning to Expensive Dynamic Multi-Objective Optimization.

Multi-Objective Milling Parameter Optimization Base on a Novel Differential Evolution PSO Towards Minimum Energy Consumption

Multi-agent Reinforcement Learning Method for Cutting Parameters Optimization Based on Simulation and Experiment Dual Drive Environment

PA2D-MORL: Pareto Ascent Directional Decomposition Based Multi-Objective Reinforcement Learning

Multi-objective reinforcement learning for fed-batch fermentation process control

Energy-efficient tool path generation and expansion optimisation for five-axis flank milling with meta-reinforcement learning