Comparison of Performance of Large Language Models on Lung-RADS Related Questions

Eren Çamur,Turay Cesur,Yasin Celal Güneş
DOI: https://doi.org/10.1200/GO.24.00200
Abstract:This study evaluates LLM integration in interpreting Lung-RADS for lung cancer screening, highlighting their innovative role in enhancing radiological practice. Our findings reveal that Claude 3 Opus and Perplexity achieved a 96% accuracy rate, outperforming other models.
What problem does this paper attempt to address?