Abstract:This paper concerns studies on continuous-time controlled Markov chains, that is, continuous-time Markov decision processes with a denumerable state space, with respect to the discounted cost criterion. The cost and transition rates are allowed to be unbounded and the action set is a Borel space. We first study control problems in the class of deterministic stationary policies and give very weak conditions under which the existence of epsilon-optimal (epsilon greater than or equal to 0) policies is proved using the construction of a minimum Q-process. Then we further consider control problems in the class of randomized Markov policies for (1) regular and (2) nonregular Q-processes. To study case (1), first we present a new necessary and sufficient condition for a nonhomogeneous Q-process to be regular. This regularity condition, together with the extended generator of a nonhomogeneous Markov process, is used to prove the existence of epsilon-optimal stationary policies. Our results for case (1) are illustrated by a Schlogl model with a controlled diffusion. For case (2), we obtain a similar result using Kolmogorov's forward equation for the minimum Q-process and we also present an example in which our assumptions are satisfied, but those used in the previous literature fail to hold.

Continuous-Time Markov Decision Processes with State-Dependent Discount Factors

Continuous-Time Markov Decision Processes with Unbounded Transition and Discounted-Reward Rates

Continuous-Time Markov Decision Processes with Discounted Rewards: the Case of Polish Spaces

Discounted Optimality for Continuous-Time Markov Decision Processes in Polish Spaces

New Discount and Average Optimality Conditions for Continuous-Time Markov Decision Processes

Discounted Continuous-Time Constrained Markov Decision Processes in Polish Spaces

Continuous Time Markov Decision Processes with Expected Discounted Total Rewards

Denumerable-state Continuous-Time Markov Decision Processes with Unbounded Transition and Reward Rates under the Discounted Criterion

Constrained Continuous-Time Markov Control Processes with Discounted Criteria

Markov Decision Processes with State-Dependent Discount Factors and Unbounded Rewards/costs.

Average Optimality For Continuous-Time Markov Decision Processes In Polish Spaces

First Passage Optimality for Continuous-Time Markov Decision Processes with Varying Discount Factors and History-Dependent Policies

Risk-Sensitive Discounted Continuous-Time Markov Decision Processes with Unbounded Rates.

Constrained Continuous-Time Markov Decision Processes with Average Criteria

Absorbing Continuous-Time Markov Decision Processes with Total Cost Criteria

Discounted Continuous-Time Markov Decision Processes with Constraints: Unbounded Transition and Loss Rates

Continuous Time Markov Decision Processes with Nonuniformly Bounded Transition Rate: Expected Total Rewards

Risk-sensitive Continuous-Time Markov Decision Processes with Unbounded Rates and Borel Spaces

Continuous-Time Controlled Markov Chains with Discounted Rewards

Linear Programming and Constrained Average Optimality for General Continuous-Time Markov Decision Processes in History-Dependent Policies.

Continuous-Time Controlled Markov Chains