RATCHET: Medical Transformer for Chest X-ray Diagnosis and Reporting

Benjamin Hou,Georgios Kaissis,Ronald Summers,Bernhard Kainz
DOI: https://doi.org/10.48550/arXiv.2107.02104
2021-09-15
Abstract:Chest radiographs are one of the most common diagnostic modalities in clinical routine. It can be done cheaply, requires minimal equipment, and the image can be diagnosed by every radiologists. However, the number of chest radiographs obtained on a daily basis can easily overwhelm the available clinical capacities. We propose RATCHET: RAdiological Text Captioning for Human Examined Thoraces. RATCHET is a CNN-RNN-based medical transformer that is trained end-to-end. It is capable of extracting image features from chest radiographs, and generates medically accurate text reports that fit seamlessly into clinical work flows. The model is evaluated for its natural language generation ability using common metrics from NLP literature, as well as its medically accuracy through a surrogate report classification task. The model is available for download at: <a class="link-external link-http" href="http://www.github.com/farrell236/RATCHET" rel="external noopener nofollow">this http URL</a>.
Computer Vision and Pattern Recognition
What problem does this paper attempt to address?