Abstract:This paper is devoted to studying the average optimality in continuous-time Markov decision processes with fairly general state and action spaces. The criterion to be maximized is expected average rewards. The transition rates of underlying continuous-time jump Markov processes are allowed to be unbounded, and the reward rates may have neither upper nor lower bounds. We first provide two optimality inequalities with opposed directions, and also give suitable conditions under which the existence of solutions to the two optimality inequalities is ensured. Then, from the two optimality inequalities we prove the existence of optimal (deterministic) stationary policies by using the Dynkin formula. Moreover, we present a "semi martingale characterization" of an optimal stationary policy. Finally, we use a generalized Potlach process with control to illustrate the difference between our conditions and those in the previous literature, and then further apply our results to average optimal control problems of generalized birth-death systems, upwardly skip-free processes and two queueing systems. The approach developed in this paper is slightly different from the "optimality inequality approach" widely used in the previous literature.

New Average Optimality Conditions for Semi-Markov Decision Processes in Borel Spaces.

Average Optimality for Markov Decision Processes in Borel Spaces: a New Condition and Approach

Nonhomogeneous Markov Decision Processes with Borel State Space-The Average Criterion with Nonuniformly Bounded Rewards.

A Semimartingale Characterization of Average Optimal Stationary Policies for Markov Decision Processes

Another Set of Verifiable Conditions for Average Markov Decision Processes with Borel Spaces

On Average Optimality for Non-Stationary Markov Decision Processes in Borel Spaces

A Note on Optimality Conditions for Continuous-Time Markov Decision Processes with Average Cost Criterion

Average Optimality in Markov Decision Processes with Unbounded Rewards

Average Optimality For Continuous-Time Markov Decision Processes In Polish Spaces

New Discount and Average Optimality Conditions for Continuous-Time Markov Decision Processes

Risk-Sensitive Average Markov Decision Processes in General Spaces

Limiting Average Criteria for Nonstationary Markov Decision Processes

Value Iteration For Average Cost Markov Decision Processes In Borel Spaces

Markov Decision Processes with Variance Minimization: A New Condition and Approach

The Average Variance Criterion for Nonstationary MDP with Borel State Space

An Average-Value-at-risk Criterion for Markov Decision Processes with Unbounded Costs

A new strong optimality criterion for nonstationary Markov decision processes

Nonstationary Denumerable State Markov Decision Processes – with Average Variance Criterion

Constrained Semi-Markov Decision Processes with Ratio and Time Expected Average Criteria in Polish Spaces

Average cost Markov decision processes with countable state spaces

Another Set of Conditions for Markov Decision Processes with Average Sample-Path Costs