Delineating COVID-19 subgroups using routine clinical data identifies distinct in-hospital outcomes

Bojidar Rangelov,Alexandra Young,Watjana Lilaonitkul,Shahab Aslani,Paul Taylor,Eyjólfur Guðmundsson,Qianye Yang,Yipeng Hu,John R Hurst,David J Hawkes,Joseph Jacob,Тhe NCCID Collaborative,Pardeep Bains,Dominic Cushnan,Mark Halling-Brown,Emily Jefferson,Francois Lemarchand,Anastasios Sarellas,Daniel Schofield,James Sutherland,Mathew Watt,Daniel Alexander,Hena Aziz,Emma Lewis,Gerald Lip,Peter Manser,Philip Quinlan,Neil Sebire,Andrew Swift,Smita Shetty,Peter Williams,Oscar Bennett,Samie Dorgham,Alberto Favaro,Samantha Gan,Tara Ganepola,Gergely Imreh,Neha Puri,Jonathan Carl Luis Rodrigues,Helen Oliver,Benjamin Hudson,Graham Robinson,Richard Wood,Annette Moreton,Katy Lomas,Nigel Marchbank,Chinnoi Law,Harmeet Chana,Nemi Gandy,Ban Sharif,Leila Ismail,Jaymini Patel,Debbie Wai,Liz Mathers,Rachel Clark,Anisha Harrar,Alison Bettany,Kieran Foley,Carla Pothecary,Stephen Buckle,Lisa Roche,Aarti Shah,Fiona Kirkham,Hannah Bown,Simon Seal,Hayley Connoley,Jenna Tugwell-Allsup,Bethan Wyn Owen,Mary Jones,Andrew Moth,Jordan Colman,Giles Maskell,Daniel Kim,Alexander Sanchez-Cabello,Hannah Lewis,Matthew Thorley,Ross Kruger,Madalina Chifu,Nicholas Ashley,Susanne Spas,Angela Bates,Peter Halson,Chris Heafey,Caroline McCann,David McCreavy,Dileep Duvva,Tze Siah,Janet Deane,Emily Pearlman,James MacKay,Melissa Sia,Esme Easter,Doreen Brookes,Paul Burford,Ramona-Rita Barbara,Thomas Payne,Mark Ingram,Bahadar Bhatia,Sarah Yusuf,Fiona Rotherham,Gayle Warren,Angela Heeney,Angela Bowen,Adele Wilson,Zahida Hussain,Joanne Kellett,Rachael Harrison,Janet Watkins,Lisa Patterson,Tom Welsh,Dawn Redwood,Natasha Greig,Lindsay Van Pelt,Susan Palmer,Kate Milne,Joanna Tilley,Melissa Alexander,Amy J Frary,Judith L Babar,Timothy Sadler,Edward Neil-Gallacher,Sarah Cardona,Avneet Gill,Nnenna Omeje,Claire Ridgeon,Fergus Gleeson,Annette Johnstone,Russell Frood,Mohammed Atif Rabani,Andrew Scarsbrook,Mark D Lyttle,Stephen Lyen,Gareth James,Sarah Sheedy,Kiarna Homer,Alison Glover,Ben Gibbison,Jane Blazeby,Mai Baquedano,Teresa Jacob,Sisa Grubnic,Tony Crick,Debbie Crawford,Fiona Prestwood,Margaret Cooper,Mark Radon
DOI: https://doi.org/10.1038/s41598-023-32469-9
2023-06-20
Abstract:The COVID-19 pandemic has been a great challenge to healthcare systems worldwide. It highlighted the need for robust predictive models which can be readily deployed to uncover heterogeneities in disease course, aid decision-making and prioritise treatment. We adapted an unsupervised data-driven model-SuStaIn, to be utilised for short-term infectious disease like COVID-19, based on 11 commonly recorded clinical measures. We used 1344 patients from the National COVID-19 Chest Imaging Database (NCCID), hospitalised for RT-PCR confirmed COVID-19 disease, splitting them equally into a training and an independent validation cohort. We discovered three COVID-19 subtypes (General Haemodynamic, Renal and Immunological) and introduced disease severity stages, both of which were predictive of distinct risks of in-hospital mortality or escalation of treatment, when analysed using Cox Proportional Hazards models. A low-risk Normal-appearing subtype was also discovered. The model and our full pipeline are available online and can be adapted for future outbreaks of COVID-19 or other infectious disease.
What problem does this paper attempt to address?