Abstract:In the last few years, the trend in health care of embracing artificial intelligence (AI) has dramatically changed the medical landscape. Medical centres have adopted AI applications to increase the accuracy of disease diagnosis and mitigate health risks. AI applications have changed rules and policies related to healthcare practice and work ethics. However, building trustworthy and explainable AI (XAI) in healthcare systems is still in its early stages. Specifically, the European Union has stated that AI must be human-centred and trustworthy, whereas in the healthcare sector, low methodological quality and high bias risk have become major concerns. This study endeavours to offer a systematic review of the trustworthiness and explainability of AI applications in healthcare, incorporating the assessment of quality, bias risk, and data fusion to supplement previous studies and provide more accurate and definitive findings. Likewise, 64 recent contributions on the trustworthiness of AI in healthcare from multiple databases (i.e., ScienceDirect, Scopus, Web of Science, and IEEE Xplore) were identified using a rigorous literature search method and selection criteria. The considered papers were categorised into a coherent and systematic classification including seven categories: explainable robotics, prediction, decision support, blockchain, transparency, digital health, and review. In this paper, we have presented a systematic and comprehensive analysis of earlier studies and opened the door to potential future studies by discussing in depth the challenges, motivations, and recommendations. In this study a systematic science mapping analysis in order to reorganise and summarise the results of earlier studies to address the issues of trustworthiness and objectivity was also performed. Moreover, this work has provided decisive evidence for the trustworthiness of AI in health care by presenting eight current state-of-the-art critical analyses regarding those more relevant research gaps. In addition, to the best of our knowledge, this study is the first to investigate the feasibility of utilising trustworthy and XAI applications in healthcare, by incorporating data fusion techniques and connecting various important pieces of information from available healthcare datasets and AI algorithms. The analysis of the revised contributions revealed crucial implications for academics and practitioners, and then potential methodological aspects to enhance the trustworthiness of AI applications in the medical sector were reviewed. Successively, the theoretical concept and current use of 17 XAI methods in health care were addressed. Finally, several objectives and guidelines were provided to policymakers to establish electronic health-care systems focused on achieving relevant features such as legitimacy, morality, and robustness. Several types of information fusion in healthcare were focused on in this study, including data, feature, image, decision, multimodal, hybrid, and temporal.

The METRIC-framework for assessing data quality for trustworthy AI in medicine: a systematic review

The METRIC-framework for assessing data quality for trustworthy AI in medicine: a systematic review

Can I trust my fake data – A comprehensive quality assessment framework for synthetic tabular data in healthcare

Can I trust my fake data -- A comprehensive quality assessment framework for synthetic tabular data in healthcare

Development and Validation of ML-DQA -- a Machine Learning Data Quality Assurance Framework for Healthcare

Designing an ML Auditing Criteria Catalog as Starting Point for the Development of a Framework

A clinician's guide to understanding and critically appraising machine learning studies: a checklist for Ruling Out Bias Using Standard Tools in Machine Learning (ROBUST-ML)

On evaluation metrics for medical applications of artificial intelligence

Guidelines and Standard Frameworks for Artificial Intelligence in Medicine: A Systematic Review

A Systematic Review of Trustworthy and Explainable Artificial Intelligence in Healthcare: Assessment of Quality, Bias Risk, and Data Fusion

Designing Interpretable ML System to Enhance Trust in Healthcare: A Systematic Review to Proposed Responsible Clinician-AI-Collaboration Framework

Perceptions of Data Set Experts on Important Characteristics of Health Data Sets Ready for Machine Learning: A Qualitative Study

Diagnostic quality model (DQM): an integrated framework for the assessment of diagnostic quality when using AI/ML

Requirements and reliability of AI in the medical context

Machine learning and artificial intelligence research for patient benefit: 20 critical questions on transparency, replicability, ethics, and effectiveness

Statistical Learning to Operationalize a Domain Agnostic Data Quality Scoring

Trustworthy clinical AI solutions: a unified review of uncertainty quantification in deep learning models for medical image analysis

Explainable and interpretable artificial intelligence in medicine: a systematic bibliometric review

APPRAISE-AI Tool for Quantitative Evaluation of AI Studies for Clinical Decision Support

Medical deep learning-A systematic meta-review

Guidelines and quality criteria for artificial intelligence-based prediction models in healthcare: a scoping review