Abstract:We propose a steepest descent method to compute optimal control parameters for balancing between multiple performance objectives in stateless stochastic scheduling, wherein the scheduling decision is effected by a simple constant-time coin toss operation only. We apply our method to the scheduling of a mobile sensor's coverage time among a set of points of interest (PoIs). The coverage algorithm is guided by a Markov chain wherein the sensor at PoI i decides to go to the next PoI j with transition probability pij. We use steepest descent to compute the transition probabilities for optimal tradeoff between two performance goals concerning the distributions of per-PoI coverage times and exposure times, respectively. We also discuss how other important goals such as energy efficiency and entropy of the coverage schedule can be addressed. For computational efficiency, we show how to optimally adapt the step size in steepest descent to achieve fast convergence. However, we found that the structure of our problem is complex in that there may exist surprisingly many local optima in the solution space, causing basic steepest descent to get stuck easily at a local optimum. To solve the problem, we show how proper incorporation of noise in the search process can get us out of the local optima with high probability. We provide simulation results to verify the accuracy of our analysis, and show that our method can converge to the globally optimal control parameters under different assigned weights to the performance goals and different initial parameters.

Simulation Optimization Algorithm for SMDPs with Parameterized Randomized Stationary Policies

A Simulation Optimization Algorithm for CTMDPs Based on Randomized Stationary Policies

Performance Optimization of Semi-Markov Decision Processes with Discounted-cost Criteria.

Error bounds of optimization algorithms for semi-Markov decision processes

A Method of Parameter Optimization for Particle Swarm Optimization Based on Stochastic Processes

Parallel Optimization for Markov Control Processes Based on Performance Potentials Simulation

Performance Optimization for Countable Semi-Markov Decision Processes with Discounted-cost

Stochastic Steepest-Descent Optimization Of Multiple-Objective Mobile Sensor Coverage

Optimization Algorithms for Semi-Markov Control Processes with Average Criteria

Optimal Stationary Policies for Semi-Markov Control Processes with Discounted-Cost Criteria

Parameterized Markov Decision Process and Its Application to Service Rate Control.

Policy iteration for parameterized Markov decision processes and its application

Two-Timescale Simulation-based Algorithm for Markov Decision Process Based on Performance Potentials

Data-mechanism-driven Product Performance Optimization with Multiple Parameters under Uncertainties in Manufacturing Automation Systems

The Estimation of the Semi-Markov Processes Performance Potentials Based on Parallel Simulation

The Optimal Robust Control Policy for Uncertain Semi-Markov Control Processes

Event-based optimization for finite-horizon total-cost markov decision processes

Optimization Of Semi-Markov Switching State-Space Control Processes For Network Communication Systems

A Potential-Based Method for Finite-Stage Markov Decision Process

Potential Based Optimization Algorithm Of Constrained Markov Decision Processes

Simulation-Based optimization of singularly perturbed markov reward processes with states aggregation