Abstract:Time-inconsistency refers to a paradox in decision making where agents exhibit inconsistent behaviors over time. Examples are procrastination where agents tends to costly postpone easy tasks, and abandonments where agents start a plan and quit in the middle. These behaviors are undesirable in the sense that agents make clearly suboptimal decisions over optimal ones. To capture such behaviors and more importantly, to quantify inefficiency caused by such behaviors, [Kleinberg & Oren 2014] propose a graph model which is essentially same as the standard planning model except for the cost structure. Using this model, they initiate the study of several interesting problems: 1) cost ratio: the worst ratio between the actual cost of the agent and the optimal cost, over all graph instances; 2) motivating subgraph: how to motivate the agent to reach the goal by deleting nodes and edges; 3) Intermediate rewards: how to motivate agents to reach the goal by placing intermediate rewards. Kleinberg and Oren give partial answers to these questions, but the main problems are still open. In fact, they raise these problems explicitly as open problems in their paper. In this paper, we give answers to all three open problems in [Kleinberg & Oren 2014]. First, we show a tight upper bound of cost ratio for graphs without Akerlof's structure, thus confirm the conjecture by Kleinberg and Oren that Akerlof's structure is indeed the worst case for cost ratio. Second, we prove that finding a motivating subgraph is NP-hard, showing that it is generally inefficient to motivate agents by deleting nodes and edges in the graph. Last but not least, we show that computing a strategy to place minimum amount of total reward is also NP-hard. Therefore, it is computational inefficient to motivate agents by placing intermediate rewards. The techniques we use to prove these results are nontrivial and of independent interests.

Planning with General Objective Functions: Going Beyond Total Rewards

Planning with Submodular Objective Functions

Toward Discovering Options that Achieve Faster Planning

Intention as Hierarchical Constraints in Human Planning

Decision-Theoretic Planning with non-Markovian Rewards

Game-theoretic Objective Space Planning

Receding Horizon Planning with Rule Hierarchies for Autonomous Vehicles

Cost-Optimal Algorithms for Planning with Procedural Control Knowledge

Symbolic Generalization for On-line Planning

Computational issues in time-inconsistent planning

Approximate Resolution of Stochastic Choice-based Discrete Planning

Planning Anything with Rigor: General-Purpose Zero-Shot Planning with LLM-based Formalized Programming

Optimality and robustness in path-planning under initial uncertainty

Hierarchical planning-scheduling-control -- Optimality surrogates and derivative-free optimization

Efficient and Reconfigurable Optimal Planning in Large-Scale Systems Using Hierarchical Finite State Machines

The Update-Equivalence Framework for Decision-Time Planning

Graph Planning with Expected Finite Horizon

Multiple stage stochastic linear programming with multiple objectives: flexible decision making

Multi-Robot Planning on Dynamic Topological Graphs using Mixed-Integer Programming

Human-in-the-loop: Provably Efficient Preference-based Reinforcement Learning with General Function Approximation.