Abstract:This paper is devoted to studying the average optimality in continuous-time Markov decision processes with fairly general state and action spaces. The criterion to be maximized is expected average rewards. The transition rates of underlying continuous-time jump Markov processes are allowed to be unbounded, and the reward rates may have neither upper nor lower bounds. We first provide two optimality inequalities with opposed directions, and also give suitable conditions under which the existence of solutions to the two optimality inequalities is ensured. Then, from the two optimality inequalities we prove the existence of optimal (deterministic) stationary policies by using the Dynkin formula. Moreover, we present a "semi martingale characterization" of an optimal stationary policy. Finally, we use a generalized Potlach process with control to illustrate the difference between our conditions and those in the previous literature, and then further apply our results to average optimal control problems of generalized birth-death systems, upwardly skip-free processes and two queueing systems. The approach developed in this paper is slightly different from the "optimality inequality approach" widely used in the previous literature.

Limiting Average Criteria for Nonstationary Markov Decision Processes

Nonhomogeneous Markov Decision Processes with Borel State Space-The Average Criterion with Nonuniformly Bounded Rewards.

On Average Optimality for Non-Stationary Markov Decision Processes in Borel Spaces

A new strong optimality criterion for nonstationary Markov decision processes

New Average Optimality Conditions for Semi-Markov Decision Processes in Borel Spaces.

Average Optimality for Markov Decision Processes in Borel Spaces: a New Condition and Approach

The Average Variance Criterion for Nonstationary MDP with Borel State Space

Nonstationary Denumerable State Markov Decision Processes – with Average Variance Criterion

Constrained Continuous-Time Markov Decision Processes with Average Criteria

Average Optimality in Markov Decision Processes with Unbounded Rewards

A Note on the Existence of Optimal Stationary Policies for Average Markov Decision Processes with Countable States

A Note on Optimality Conditions for Continuous-Time Markov Decision Processes with Average Cost Criterion

Another Set of Verifiable Conditions for Average Markov Decision Processes with Borel Spaces

Average Optimality For Continuous-Time Markov Decision Processes In Polish Spaces

Unbounded Cost Markov Decision Processes with Limsup and Liminf Average Criteria: New Conditions

Average cost Markov decision processes with countable state spaces

Nonstationary Markov Decision Processes with Risk Probability Criteria

STRONG AVERAGE OPTIMALITY FOR CONTROLLED NONHOMOGENEOUS MARKOV CHAINS*

Drift and Monotonicity Conditions for Continuous-Time Controlled Markov Chains with an Average Criterion.

A Semimartingale Characterization of Average Optimal Stationary Policies for Markov Decision Processes

Risk-sensitive Continuous-Time Markov Decision Processes with Unbounded Rates and Borel Spaces