Abstract:Deep Reinforcement Learning (DRL) is employed to develop autonomously optimized and custom-designed heat-treatment processes that are both, microstructure-sensitive and energy efficient. Different from conventional supervised machine learning, DRL does not rely on static neural network training from data alone, but a learning agent autonomously develops optimal solutions, based on reward and penalty elements, with reduced or no supervision. In our approach, a temperature-dependent Allen-Cahn model for phase transformation is used as the environment for the DRL agent, serving as the model world in which it gains experience and takes autonomous decisions. The agent of the DRL algorithm is controlling the temperature of the system, as a model furnace for heat-treatment of alloys. Microstructure goals are defined for the agent based on the desired microstructure of the phases. After training, the agent can generate temperature-time profiles for a variety of initial microstructure states to reach the final desired microstructure state. The agent's performance and the physical meaning of the heat-treatment profiles generated are investigated in detail. In particular, the agent is capable of controlling the temperature to reach the desired microstructure starting from a variety of initial conditions. This capability of the agent in handling a variety of conditions paves the way for using such an approach also for recycling-oriented heat treatment process design where the initial composition can vary from batch to batch, due to impurity intrusion, and also for the design of energy-efficient heat treatments. For testing this hypothesis, an agent without penalty on the total consumed energy is compared with one that considers energy costs. The energy cost penalty is imposed as an additional criterion on the agent for finding the optimal temperature-time profile.

Reinforcement Learning with thermal fluctuations at the nano-scale

Reinforcement learning with thermal fluctuations at the nanoscale

Environmental effects on emergent strategy in micro-scale multi-agent reinforcement learning

Quantum reinforcement learning in the presence of thermal dissipation

Controlling Rayleigh–Bénard convection via reinforcement learning

High-dimensional reinforcement learning for optimization and control of ultracold quantum gases

Controlling nonergodicity in quantum many-body systems by reinforcement learning

Reinforcement Learning for Multi-Scale Molecular Modeling

Model-Based Reinforcement Learning Control of Reaction-Diffusion Problems

Finding the ground state of spin Hamiltonians with reinforcement learning

Influence of Thermostatting on Nonequilibrium Molecular Dynamics Simulations of Heat Conduction in Solids.

Reinforcement Learning Approach To Thermal Transparency With Particles In Periodic Lattices

Generalized Langevin dynamics of a nanoparticle using a finite element approach: thermostating with correlated noise

Reinforcement learning in cold atom experiments

Control of quasi-equilibrium state of annular flow through reinforcement learning

Neural annealing and visualization of autoregressive neural networks in the Newman-Moore model

Re-exploring Control Strategies in a Non-Markovian Open Quantum System by Reinforcement Learning

Computational Discovery of Energy-Efficient Heat Treatment for Microstructure Design using Deep Reinforcement Learning

Loss Dynamics of Temporal Difference Reinforcement Learning

Reinforcement Learning for Molecular Dynamics Optimization: A Stochastic Pontryagin Maximum Principle Approach

Stabilising viscous extensional flows using Reinforcement Learning