Target specification bias, counterfactual prediction, and algorithmic fairness in healthcare

Eran Tal
DOI: https://doi.org/10.1145/3600211.3604678
2023-08-04
Abstract:Bias in applications of machine learning (ML) to healthcare is usually attributed to unrepresentative or incomplete data, or to underlying health disparities. This article identifies a more pervasive source of bias that affects the clinical utility of ML-enabled prediction tools: target specification bias. Target specification bias arises when the operationalization of the target variable does not match its definition by decision makers. The mismatch is often subtle, and stems from the fact that decision makers are typically interested in predicting the outcomes of counterfactual, rather than actual, healthcare scenarios. Target specification bias persists independently of data limitations and health disparities. When left uncorrected, it gives rise to an overestimation of predictive accuracy, to inefficient utilization of medical resources, and to suboptimal decisions that can harm patients. Recent work in metrology - the science of measurement - suggests ways of counteracting target specification bias and avoiding its harmful consequences.
Machine Learning,Computers and Society,Methodology
What problem does this paper attempt to address?
This paper attempts to address the issue of target specification bias in the application of machine learning (ML) in the healthcare field. Specifically, this bias arises from the mismatch between the prediction targets that decision-makers (such as doctors) care about and the data labels actually used to train and test ML models. This mismatch is often due to decision-makers wanting to predict outcomes in counterfactual scenarios, while data labels can only reflect outcomes in actual scenarios. ### Background of the Paper - **Target Specification Bias**: Target specification bias occurs when the operational definition of the target variable is inconsistent with the decision-maker's definition. - **Counterfactual Prediction**: Decision-makers are usually concerned with outcomes under hypothetical conditions (i.e., counterfactual scenarios) rather than actual outcomes. - **Algorithm Fairness**: This bias not only affects the predictive accuracy of the model but can also lead to inefficient use of medical resources and adverse decisions for patients, thereby impacting the fairness and safety of the algorithm. ### Main Issues 1. **Limitations of Predictive Accuracy Assessment**: Current ML models typically assess predictive accuracy based on the concept of label matching, but this approach overlooks the issue of target specification bias. 2. **Clinical Usability of Decision Support Tools**: To make ML models clinically useful, they need to predict health outcomes or diagnostic results in counterfactual scenarios, not just actual scenarios. 3. **Fairness and Safety**: Target specification bias may cause the model to perform poorly in certain groups, affecting the fairness and safety of medical decisions. ### Solutions - **Introducing Metrology Concepts**: The authors draw on concepts from metrology to propose a broader concept of benchmark accuracy that considers not only label quality but also target specification bias. - **Improving Model Evaluation Methods**: By identifying and correcting target specification bias, the predictive accuracy of the model can be improved, potential harm to patients can be reduced, and the fairness and safety of medical decisions can be enhanced. ### Conclusion - **Importance**: Target specification bias is a pervasive issue that needs to be addressed in the application of machine learning in the healthcare field. - **Future Directions**: Future research should focus on better defining and operationalizing target variables to reduce target specification bias and improve the clinical usability and fairness of models.