Abstract:In target tracking with mobile multi-sensor systems, sensor deployment impacts the observation capabilities and the resulting state estimation quality. Based on a partially observable Markov decision process (POMDP) formulation comprised of the observable sensor dynamics, unobservable target states, and accompanying observation laws, we present a distributed information-driven solution approach to the multi-agent target tracking problem, namely, sequential multi-agent nominal belief-state optimization (SMA-NBO). SMA-NBO seeks to minimize the expected tracking error via receding horizon control including a heuristic expected cost-to-go (HECTG). SMA-NBO incorporates a computationally efficient approximation of the target belief-state over the horizon. The agent-by-agent decision-making is capable of leveraging on-board (edge) compute for selecting (sub-optimal) target-tracking maneuvers exhibiting non-myopic cooperative fleet behavior. The optimization problem explicitly incorporates semantic information defining target occlusions from a world model. To illustrate the efficacy of our approach, a random occlusion forest environment is simulated. SMA-NBO is compared to other baseline approaches. The simulation results show SMA-NBO 1) maintains tracking performance and reduces the computational cost by replacing the calculation of the expected target trajectory with a single sample trajectory based on maximum a posteriori estimation; 2) generates cooperative fleet decision by sequentially optimizing single-agent policy with efficient usage of other agents' policy of intent; 3) aptly incorporates the multiple weighted trace penalty (MWTP) HECTG, which improves tracking performance with a computationally efficient heuristic.

Nonlinear POMDPs for Active State Tracking with Sensing Costs

Non-Myopic Target Tracking Strategies for State-Dependent Noise

Distributed Multi-Sensor Control for Multi-Target Tracking With a Sparsity-Promoting Objective Function

Adaptive Sensor Scheduling Algorithm for Target Tracking in Wireless Sensor Networks

Track-MDP: Reinforcement Learning for Target Tracking with Controlled Sensing

Decomposed POMDP Optimization-Based Sensor Management for Multi-Target Tracking in Passive Multi-Sensor Systems

Dynamic target tracking with integration of communication and coverage using mobile sensors

Cooperative Tracking Control for Nonlinear MASs under Event-Triggered Communication.

Sparse Sensing Architectures with Optimal Precision for Tracking Multi-agent Systems in Sensing-denied Environments

Cost-Bounded Active Classification Using Partially Observable Markov Decision Processes

Data-Efficient Off-Policy Learning for Distributed Optimal Tracking Control of HMAS with Unidentified Exosystem Dynamics.

SMA-NBO: A Sequential Multi-Agent Planning with Nominal Belief-State Optimization in Target Tracking

Sensor Management for Tracking in Sensor Networks

Non-Myopic Sensor Control for Target Search and Track Using a Sample-Based GOSPA Implementation

Event-Triggered Optimal Tracking Control Design with DHP Formulation for Discrete-Time Nonlinear Nonzero-Sum Games

Online Markov decision processes with Kullback-Leibler control cost

Potential Game-Based Non-Myopic Sensor Network Planning for Multi-Target Tracking

OCMDP: Observation-Constrained Markov Decision Process

Multi-Sensor Management for Multi-Target Tracking Using Mutual Information

Multisensor Multiobject Tracking with Improved Sampling Efficiency