Linearization of ReLU Activation Function for Neural Network-Embedded Optimization:Optimal Day-Ahead Energy Scheduling

Cunzhi Zhao,Xingpeng Li

2023-10-03

Abstract:Neural networks have been widely applied in the power system area. They can be used for better predicting input information and modeling system performance with increased accuracy. In some applications such as battery degradation neural network-based microgrid day-ahead energy scheduling, the input features of the trained learning model are variables to be solved in optimization models that enforce limits on the output of the same learning model. This will create a neural network-embedded optimization problem; the use of nonlinear activation functions in the neural network will make such problems extremely hard to solve if not unsolvable. To address this emerging challenge, this paper investigated different methods for linearizing the nonlinear activation functions with a particular focus on the widely used rectified linear unit (ReLU) function. Four linearization methods tailored for the ReLU activation function are developed, analyzed and compared in this paper. Each method employs a set of linear constraints to replace the ReLU function, effectively linearizing the optimization problem, which can overcome the computational challenges associated with the nonlinearity of the neural network model. These proposed linearization methods provide valuable tools for effectively solving optimization problems that integrate neural network models with ReLU activation functions.

Machine Learning,Systems and Control

What problem does this paper attempt to address?

This paper attempts to address the issue in neural network embedding optimization problems, where the use of nonlinear activation functions (particularly the ReLU function) leads to problems that are difficult or even impossible to solve. Specifically, when neural network models are used to predict input information and model system performance, if the input features of these models are decision variables in the optimization model, the optimization problem becomes very complex, especially when these models use nonlinear activation functions such as ReLU. To tackle this challenge, the paper studies different linearization methods for the ReLU function and develops, analyzes, and compares four linearization methods for the ReLU activation function. These methods effectively linearize the optimization problem by introducing a set of linear constraints to replace the ReLU function, thereby overcoming the computational difficulties brought about by the nonlinearity of neural network models. These proposed linearization methods provide valuable tools for effectively solving optimization problems that integrate neural network models.

Linearization of ReLU Activation Function for Neural Network-Embedded Optimization:Optimal Day-Ahead Energy Scheduling

Neural Network Solution for General Nonlinear Optimization Problems

A discrete-time neural network for optimization problems with hybrid constraints.

ReLU soothes the NTK condition number and accelerates optimization for wide neural networks

A Method on Searching Better Activation Functions

Normalized Activation Function: Toward Better Convergence

Learning to Solve Optimization Problems with Hard Linear Constraints

On the Universally Optimal Activation Function for a Class of Residual Neural Networks

Neural networks with ReLU powers need less depth

Effects of the Nonlinearity in Activation Functions on the Performance of Deep Learning Models

A Method to Improve the Computational Performance of Nonlinear All—optical Diffractive Deep Neural Network Model

Understanding Multi-phase Optimization Dynamics and Rich Nonlinear Behaviors of ReLU Networks

Optimizing over an ensemble of neural networks

Optimal function approximation with ReLU neural networks

Neural networks with trainable matrix activation functions

Activation function optimization method: Learnable series linear units (LSLUs)

Natural-Logarithm-Rectified Activation Function in Convolutional Neural Networks

Compelling ReLU Networks to Exhibit Exponentially Many Linear Regions at Initialization and During Training

A New Chaotic Neural Network Algorithm for “0-1” Optimization Problems Based Non-monotonous Activation Function

Active ReLU Linearized Neural Network based Frequency-Constrained Unit Commitment in Low-Inertia Power Systems

Neural Networks for Power Management Optimal Strategy in Hybrid Microgrid