Tackling algorithmic bias and promoting transparency in health datasets: the STANDING Together consensus recommendations
Joseph E Alderman,Joanne Palmer,Elinor Laws,Melissa D McCradden,Johan Ordish,Marzyeh Ghassemi,Stephen R Pfohl,Negar Rostamzadeh,Heather Cole-Lewis,Ben Glocker,Melanie Calvert,Tom J Pollard,Jaspret Gill,Jacqui Gath,Adewale Adebajo,Jude Beng,Cassandra H Leung,Stephanie Kuku,Lesley-Anne Farmer,Rubeta N Matin,Bilal A Mateen,Francis McKay,Katherine Heller,Alan Karthikesalingam,Darren Treanor,Maxine Mackintosh,Lauren Oakden-Rayner,Russell Pearson,Arjun K Manrai,Puja Myles,Judit Kumuthini,Zoher Kapacee,Neil J Sebire,Lama H Nazer,Jarrel Seah,Ashley Akbari,Lew Berman,Judy W Gichoya,Lorenzo Righetto,Diana Samuel,William Wasswa,Maria Charalambides,Anmol Arora,Sameer Pujari,Charlotte Summers,Elizabeth Sapey,Sharon Wilkinson,Vishal Thakker,Alastair Denniston,Xiaoxuan Liu
DOI: https://doi.org/10.1016/S2589-7500(24)00224-3
2024-12-12
Abstract:Without careful dissection of the ways in which biases can be encoded into artificial intelligence (AI) health technologies, there is a risk of perpetuating existing health inequalities at scale. One major source of bias is the data that underpins such technologies. The STANDING Together recommendations aim to encourage transparency regarding limitations of health datasets and proactive evaluation of their effect across population groups. Draft recommendation items were informed by a systematic review and stakeholder survey. The recommendations were developed using a Delphi approach, supplemented by a public consultation and international interview study. Overall, more than 350 representatives from 58 countries provided input into this initiative. 194 Delphi participants from 25 countries voted and provided comments on 32 candidate items across three electronic survey rounds and one in-person consensus meeting. The 29 STANDING Together consensus recommendations are presented here in two parts. Recommendations for Documentation of Health Datasets provide guidance for dataset curators to enable transparency around data composition and limitations. Recommendations for Use of Health Datasets aim to enable identification and mitigation of algorithmic biases that might exacerbate health inequalities. These recommendations are intended to prompt proactive inquiry rather than acting as a checklist. We hope to raise awareness that no dataset is free of limitations, so transparent communication of data limitations should be perceived as valuable, and absence of this information as a limitation. We hope that adoption of the STANDING Together recommendations by stakeholders across the AI health technology lifecycle will enable everyone in society to benefit from technologies which are safe and effective.