Attention-based multiple-instance learning for Pediatric bone age assessment with efficient and interpretable

Chong Wang,Yang Wu,Chen Wang,Xuezhi Zhou,Yanxiang Niu,Yu Zhu,Xudong Gao,Chang Wang,Yi Yu
DOI: https://doi.org/10.1016/j.bspc.2022.104028
IF: 5.1
2023-01-01
Biomedical Signal Processing and Control
Abstract:Pediatric bone age assessment (BAA) is a common clinical technique for evaluating children's endocrine, genetic, and growth disorders. However, the deep learning BAA method based on global images neglects fine-grained concerns, and regions of interest (ROIs) need additional annotation and complex processing. To overcome these shortcomings, we proposed an interpretable deep-learning architecture based on multiple-instance learning to address BAA efficiently without additional annotations. We cropped the entire image into small patches and got patch features by feature extraction network. Then, an attention backbone ranked feature vectors of the entire image and aggregates its information according to its relative importance. Finally, each image's features and gender were aggregated to predict bone age. The proposed method can identify ROIs by attention-based multi-instance aggregation without additional labels and produce interpretable heatmaps. Moreover, by cropping the complete image into patches and reducing the dimensionality, the proposed model can notice the fine-grained information of the image and improve the model training speed. We validated the proposed method in the Radiological Society of North America 2017 dataset. The results showed that the proposed model achieved an advanced performance of MAE 4.17 months. Furthermore, the visualization results indicated that the proposed model was highly interpretable, which can localize the ROIs without spatial labeling. In conclusion, a novel method for high performance and interpretable bone age prediction without additional manual annotations has been developed, which can be used to effectively assess the pediatric's bone age.
engineering, biomedical
What problem does this paper attempt to address?