Abstract:Abstract Solving vision problems often entails searching a solution space for optimal state(s) that has maximum,Bayesian posterior probability or minimum,energy. When the volume of the space is huge, exhaustive search becomes infeasible. Generic stochastic search (e.g. Markov chain Monte Carlo) could be even worse than exhaustive search as it may visit a state repeatedly. To expedite the Markov chain search, one may use heuristics as proposal probability to guide the search in promising portions of the space. Empirically the recent data-driven Markov chain Monte Carlo (DDMCMC) scheme[14, 12, 2], achieves fast search in a number of vision tasks,intuitively justifled by two observations (i). The posterior probabilities in vision tasks often have very low entropy and thus are narrowly focused on a tiny portion of the state space (ii). The proposal probability computed in bottom-up methods can approximate the posterior well. In this paper, we study an independent Metropolis sampler which is often used in designing components of complex MCMC algorithms. We obtain an analytic formula for the expected time to flrst hit a certain state (\flrst hitting-time"), as well as very tight lower and upper bounds which depend on the total variation between the target posterior and the heuristic probabilities. These results show, though in humble cases, that one can indeed reach optimal solutions in very few steps with good proposal probabilities regardless of the size of the original search space. This result is difierent from previous analysis on the Markov chain convergence rate which is bounded by the second largest eigen-value (modulus) and often corresponds to the worst case in the entire search space. In comparison, our analysis bears more relevance to the optimization tasks in vision. Keywords: Markov Chain Monte Carlo, First Hitting Time, Convergence Rate, Inde-

Single Sample Path-Based Optimization of Markov Chains

Path Integral Methods with Stochastic Control Barrier Functions

Improving Sample Efficiency of Model-Free Algorithms for Zero-Sum Markov Games

Improving sample efficiency of high dimensional Bayesian optimization with MCMC

How Do Heuristics Expedite Markov Chain Search

A Simulation Optimization Algorithm for CTMDPs Based on Randomized Stationary Policies

Optimal Sample Complexity for Average Reward Markov Decision Processes

Refined Sample Complexity for Markov Games with Independent Linear Function Approximation

A safe exploration approach to constrained Markov decision processes

Event-based optimization for finite-horizon total-cost markov decision processes

Settling the Sample Complexity of Model-Based Offline Reinforcement Learning

Landscape of Policy Optimization for Finite Horizon MDPs with General State and Action

Model Predictive Optimized Path Integral Strategies

Title How Do Heuristics Expedite Markov Chain Search ? Hitting-time Analysis of the Independence Metropolis Sampler Permalink

Finite-Sample Analysis of the Monte Carlo Exploring Starts Algorithm for Reinforcement Learning

Achieving $\tilde{O}(1/ε)$ Sample Complexity for Constrained Markov Decision Process

An Incremental Sampling-based Algorithm for Stochastic Optimal Control

Simulation Optimization Algorithm for SMDPs with Parameterized Randomized Stationary Policies

Deterministic Policy Optimization by Combining Pathwise and Score Function Estimators for Discrete Action Spaces

Potential Based Optimization Algorithm Of Constrained Markov Decision Processes

Convergence and sample complexity of natural policy gradient primal-dual methods for constrained MDPs