Abstract:In consequential domains, it is often impossible to compel individuals to take treatment, so that optimal policy rules are merely suggestions in the presence of human non-adherence to treatment recommendations. Under heterogeneity, covariates may predict take-up of treatment and final outcome, but differently. While optimal treatment rules optimize causal outcomes across the population, access parity constraints or other fairness considerations on who receives treatment can be important. For example, in social services, a persistent puzzle is the gap in take-up of beneficial services among those who may benefit from them the most. We study causal identification and robust estimation of optimal treatment rules, including under potential violations of positivity. We consider fairness constraints such as demographic parity in treatment take-up, and other constraints, via constrained optimization. Our framework can be extended to handle algorithmic recommendations under an often-reasonable covariate-conditional exclusion restriction, using our robustness checks for lack of positivity in the recommendation. We develop a two-stage algorithm for solving over parametrized policy classes under general constraints to obtain variance-sensitive regret bounds. We illustrate the methods in three case studies based on data from reminders of SNAP benefits recertification, randomized encouragement to enroll in insurance, and from pretrial supervised release with electronic monitoring. While the specific remedy to inequities in algorithmic allocation is context-specific, it requires studying both take-up of decisions and downstream outcomes of them.

Treatment recommendation with distributional targets

Optimal sequential treatment allocation

Policy Learning with Distributional Welfare

Regularizing Discrimination in Optimal Policy Learning with Distributional Targets

Estimation of Optimal Dynamic Treatment Assignment Rules under Policy Constraints

Functional Sequential Treatment Allocation with Covariates

Individualized Treatment Allocation in Sequential Network Games

Optimal Dynamic Treatment Regimes and Partial Welfare Ordering

Set-valued dynamic treatment regimes for competing outcomes

Policy Targeting under Network Interference

Dynamically Optimal Treatment Allocation

Policy Learning with New Treatments

A Robust Method for Estimating Optimal Treatment Regimes

Treatment Allocation with Strategic Agents

Robust estimation of optimal dynamic treatment regimes for sequential treatment decisions.

Decision Theory for Treatment Choice Problems with Partial Identification

Stochastic Treatment Choice with Empirical Welfare Updating

Optimal and Fair Encouragement Policy Evaluation and Learning

Optimal Treatment Allocation under Constraints

Experimenting on Markov Decision Processes with Local Treatments

Treatment Allocation under Uncertain Costs