Abstract:Objectives: Leveraging artificial intelligence (AI) in conjunction with electronic health records (EHRs) holds transformative potential to improve healthcare. Yet, addressing bias in AI, which risks worsening healthcare disparities, cannot be overlooked. This study reviews methods to detect and mitigate diverse forms of bias in AI models developed using EHR data. Methods: We conducted a systematic review following the Preferred Reporting Items for Systematic Reviews and Meta-analyses (PRISMA) guidelines, analyzing articles from PubMed, Web of Science, and IEEE published between January 1, 2010, and Dec 17, 2023. The review identified key biases, outlined strategies for detecting and mitigating bias throughout the AI model development process, and analyzed metrics for bias assessment. Results: Of the 450 articles retrieved, 20 met our criteria, revealing six major bias types: algorithmic, confounding, implicit, measurement, selection, and temporal. The AI models were primarily developed for predictive tasks in healthcare settings. Four studies concentrated on the detection of implicit and algorithmic biases employing fairness metrics like statistical parity, equal opportunity, and predictive equity. Sixty proposed various strategies for mitigating biases, especially targeting implicit and selection biases. These strategies, evaluated through both performance (e.g., accuracy, AUROC) and fairness metrics, predominantly involved data collection and preprocessing techniques like resampling, reweighting, and transformation. Discussion: This review highlights the varied and evolving nature of strategies to address bias in EHR-based AI models, emphasizing the urgent needs for the establishment of standardized, generalizable, and interpretable methodologies to foster the creation of ethical AI systems that promote fairness and equity in healthcare.
Artificial Intelligence,Computers and Society,Machine Learning,Quantitative Methods
What problem does this paper attempt to address?
The paper attempts to address the issue of bias in artificial intelligence (AI) models within electronic health record (EHR) data. Specifically, the study aims to systematically review and synthesize the current literature on bias identification, assessment, and mitigation strategies in AI models constructed from EHR data, focusing on the main types of bias and their handling throughout the model development cycle. Through this research, the authors hope to enhance the understanding of AI bias management and propose research directions to reduce the potential impact of AI on healthcare inequalities.
### Main Issues:
1. **Bias Identification**: How to effectively identify different types of bias in AI models constructed from EHR data?
2. **Bias Assessment**: How to assess the impact of these biases on healthcare inequalities?
3. **Bias Mitigation**: What are the effective strategies to mitigate these biases?
### Background and Significance:
- **AI in Healthcare**: The combination of AI and EHR data has revolutionary potential in medical research and clinical decision support (CDS).
- **Challenges of Bias**: Bias in EHR data and AI models can lead to analytical errors and biased outcomes, exacerbating healthcare inequalities.
- **Types of Bias**: The study identifies six main types of bias: algorithmic bias, confounding bias, implicit bias, measurement bias, selection bias, and temporal bias.
- **Limitations of Existing Research**: Although some review studies have focused on bias in medical AI, there is relatively little research specifically addressing bias in AI models constructed from EHR data.
### Research Objectives:
- **Systematic Review**: Systematically review the existing literature on bias in AI models constructed from EHR data.
- **Comprehensive Analysis**: Conduct a comprehensive analysis of bias identification, assessment, and mitigation strategies.
- **Propose Recommendations**: Propose standardized, interpretable, and generalizable methodological frameworks to ensure fairness in medical AI.
### Methods:
- **Data Sources and Search**: Following PRISMA guidelines, retrieve relevant articles published from January 1, 2010, to December 17, 2023, from PubMed, Web of Science, and IEEE databases.
- **Inclusion and Exclusion Criteria**: Include English articles with metadata and full text, focusing on AI models constructed from EHR data and detailing bias handling methods.
- **Article Screening Process**: Conduct initial screening through titles and abstracts, followed by full-text review, ultimately selecting 20 articles for analysis.
- **Data Extraction**: Extract bibliographic data, AI model information, and bias/fairness information from relevant literature.
- **Bias Type Classification**: Classify bias types through a structured two-step process, combining existing literature and established bias risk assessment tools.
### Results:
- **Types of Bias**: Identified six main types of bias: implicit bias, selection bias, measurement bias, confounding bias, algorithmic bias, and temporal bias.
- **Research Trends**: The number of published articles has gradually increased since 2014, with a significant rise in 2022 and 2023.
- **Task Classification**: The main tasks of AI models constructed from EHR data include disease diagnosis or risk prediction, treatment effect or disease progression prediction, and mortality or survival prediction.
- **Bias Assessment Metrics**: Various performance and fairness metrics were used to assess bias in the studies.
- **Bias Detection and Mitigation**: Most studies focused on detecting and mitigating implicit bias and selection bias, with fewer studies addressing other types of bias.
### Discussion:
- **Impact of Bias**: Bias can severely affect the performance and fairness of AI models, exacerbating healthcare inequalities.
- **Methodological Framework**: There is a need for standardized, generalizable, and interpretable methodological frameworks to ensure fairness in medical AI.
- **Future Research Directions**: Future research should aim to address multiple types of bias simultaneously and develop more sophisticated and detailed bias detection techniques.
Through this systematic review, the study emphasizes the importance of managing and mitigating bias in AI models constructed from EHR data, providing important references and guidance for promoting healthcare equity.