Abstract:Event-based optimization (EBO) provides a unified framework for problems in which decisions can be made only when certain events occur. Because the event sequence usually is not Markovian, the optimal policy could depend on the entire event history, which is hard to implement in practice. So most existing studies focus on memoryless policies, which make decisions only based on the current observable events. But it remains open how to find the optimal memoryless policies in general, leaving alone to solve the EBO optimally. In this technical note, we address these two important questions for infinite-stage EBOs with finite state and action spaces and make the following three major contributions. First, we extend our previous studies on finite-stage EBOs and convert infinite-stage EBOs to partially observable Markov decision processes (POMDPs). The belief process of this POMDP is called belief-event decision process (BEDP). Under certain well-known conditions, the optimal policies of BEDPs can be achieved within stationary Markov deterministic policies. Second, assuming optimal stationary policies exist, the performance difference and derivative formulas are developed. Potentials of memoryless event-based policies are shown to be piecewise linear functions, and thus can be efficiently estimated through sample paths. Third, a potential-based approximate policy iteration algorithm is developed to obtain near-optimal memoryless policies. The convergence and performance loss bound of this algorithm are analyzed.

Event-based optimization for finite-horizon total-cost markov decision processes

Event-Based Optimization For Dispatching Policies In Material Handling Systems Of General Assembly Lines

Potential Based Optimization Algorithm Of Constrained Markov Decision Processes

Performance Optimization of Semi-Markov Decision Processes with Discounted-cost Criteria.

Error bounds of optimization algorithms for semi-Markov decision processes

Simulation Optimization Algorithm for SMDPs with Parameterized Randomized Stationary Policies

On Solving Optimal Policies for Finite-Stage Event-Based Optimization

Landscape of Policy Optimization for Finite Horizon MDPs with General State and Action

A Simulation Optimization Algorithm for CTMDPs Based on Randomized Stationary Policies

A Potential-Based Method for Finite-Stage Markov Decision Process

Risk probability optimization of finite horizon piecewise deterministic Markov decision processes

Optimization Algorithms for Semi-Markov Control Processes with Average Criteria

Performance Optimization for Countable Semi-Markov Decision Processes with Discounted-cost

Varying Receding-Horizon Based Production Control for Hybrid Production Systems

A tutorial on event-based optimization—a new optimization framework

Generalized Parameter Estimation Method for Model-Based Real Time Optimization

On solving optimal policies for event-based dynamic programming

Distributed Continuous-Time Optimization with Scalable Adaptive Event-Based Mechanisms

Single Sample Path-Based Optimization of Markov Chains

On Solving Event-Based Optimization with Average Reward over Infinite Stages

Performance Optimization of Continuous-Time Markov Control Processes Based on Performance Potentials