Towards Responsible AI: A Design Space Exploration of Human-Centered Artificial Intelligence User Interfaces to Investigate Fairness

Yuri Nakao,Lorenzo Strappelli,Simone Stumpf,Aisha Naseer,Daniele Regoli,Giulia Del Gamba
DOI: https://doi.org/10.48550/arXiv.2206.00474
2022-06-01
Abstract:With Artificial intelligence (AI) to aid or automate decision-making advancing rapidly, a particular concern is its fairness. In order to create reliable, safe and trustworthy systems through human-centred artificial intelligence (HCAI) design, recent efforts have produced user interfaces (UIs) for AI experts to investigate the fairness of AI models. In this work, we provide a design space exploration that supports not only data scientists but also domain experts to investigate AI fairness. Using loan applications as an example, we held a series of workshops with loan officers and data scientists to elicit their requirements. We instantiated these requirements into FairHIL, a UI to support human-in-the-loop fairness investigations, and describe how this UI could be generalized to other use cases. We evaluated FairHIL through a think-aloud user study. Our work contributes better designs to investigate an AI model's fairness-and move closer towards responsible AI.
Artificial Intelligence,Human-Computer Interaction
What problem does this paper attempt to address?
The core problem that this paper attempts to solve is the fairness issue of artificial intelligence (AI) systems in the process of assisting or automating decision - making, especially in the specific scenario of loan applications. With the rapid development of AI technology, its applications in fields such as justice, medical care, and finance are becoming more and more widespread, but the reliability and fairness of these systems have attracted increasing attention. In order to create reliable, safe, and trustworthy AI systems, the human - centered artificial intelligence (HCAI) design concept has gradually become a research hotspot. However, many existing user interfaces (UI) and technologies are mainly for data scientists and machine - learning experts, and fail to fully consider the needs of domain experts and other stakeholders. ### Specific Problems 1. **Understanding How Different User Groups Evaluate the Fairness of AI**: - The research aims to understand how loan officers and data scientists evaluate the fairness of AI systems when handling loan applications, including the processes, standards, information needs, and transparency requirements they use. 2. **Designing UI Components to Support Different User Groups in Evaluating AI Fairness**: - Through a series of workshops, the research team collected the needs of loan officers and data scientists, and based on these needs, designed a set of UI components (FairHIL) to support these user groups in evaluating the fairness of AI systems during human - computer interaction. 3. **Evaluating the Effectiveness and Usability of UI Components**: - Through user research, verify the effectiveness and usability of these UI components in practical applications, and explore how to promote these UI components to other fields. ### Goals - **Identifying Needs**: Determine the specific needs of domain experts and data scientists in evaluating the fairness of AI. - **Providing UI Components**: Develop a set of UI components to support the work processes and practices of these stakeholders when evaluating the fairness of AI systems. - **Clarifying the Design Space**: Define design options for developing similar UI components in other fields. - **Evaluating Examples**: Evaluate the application effects of these UI components in the loan application field through user research. ### Contributions - **Requirement Analysis**: Clarified the needs of domain experts and data scientists in evaluating the fairness of AI. - **UI Design**: Provided a set of UI components to support different user groups in evaluating the fairness of AI. - **Design Space Exploration**: Provided a reference framework for similar designs in other fields. - **User Evaluation**: Verified the effectiveness and usability of UI components through user research. Through these efforts, this research has contributed to the realization of responsible AI development, especially in making progress in ensuring the fairness of AI systems.