COVID-CT-Dataset: A CT Scan Dataset about COVID-19.

Xingyi Yang,Xuehai He,Jinyu Zhao,Yichen Zhang,Shanghang Zhang,Pengtao Xie
2020-01-01
Abstract:During the outbreak time of COVID-19, computed tomography (CT) is a usefulmanner for diagnosing COVID-19 patients. Due to privacy issues, publiclyavailable COVID-19 CT datasets are highly difficult to obtain, which hindersthe research and development of AI-powered diagnosis methods of COVID-19 basedon CTs. To address this issue, we build an open-sourced dataset – COVID-CT,which contains 349 COVID-19 CT images from 216 patients and 463 non-COVID-19CTs. The utility of this dataset is confirmed by a senior radiologist who hasbeen diagnosing and treating COVID-19 patients since the outbreak of thispandemic. We also perform experimental studies which further demonstrate thatthis dataset is useful for developing AI-based diagnosis models of COVID-19.Using this dataset, we develop diagnosis methods based on multi-task learningand self-supervised learning, that achieve an F1 of 0.90, an AUC of 0.98, andan accuracy of 0.89. According to the senior radiologist, models with suchperformance are good enough for clinical usage. The data and code are availableat https://github.com/UCSD-AI4H/COVID-CT
What problem does this paper attempt to address?