Evaluating the Performance and Bias of Natural Language Processing Tools in Labeling Chest Radiograph Reports

Samantha M Santomartino,John R Zech,Kent Hall,Jean Jeudy,Vishwa Parekh,Paul H Yi
DOI: https://doi.org/10.1148/radiol.232746
IF: 19.7
2024-10-23
Radiology
Abstract:Background Natural language processing (NLP) is commonly used to annotate radiology datasets for training deep learning (DL) models. However, the accuracy and potential biases of these NLP methods have not been thoroughly investigated, particularly across different demographic groups. Purpose To evaluate the accuracy and demographic bias of four NLP radiology report labeling tools on two chest radiograph datasets. Materials and Methods This retrospective study, performed between April 2022 and...
radiology, nuclear medicine & medical imaging
What problem does this paper attempt to address?