Abstract:The pervasive integration of Artificial Intelligence (AI) has introduced complex challenges in the responsibility and accountability in the event of incidents involving AI-enabled systems. The interconnectivity of these systems, ethical concerns of AI-induced incidents, coupled with uncertainties in AI technology and the absence of corresponding regulations, have made traditional responsibility attribution challenging. To this end, this work proposes a Computational Reflective Equilibrium (CRE) approach to establish a coherent and ethically acceptable responsibility attribution framework for all stakeholders. The computational approach provides a structured analysis that overcomes the limitations of conceptual approaches in dealing with dynamic and multifaceted scenarios, showcasing the framework's explainability, coherence, and adaptivity properties in the responsibility attribution process. We examine the pivotal role of the initial activation level associated with claims in equilibrium computation. Using an AI-assisted medical decision-support system as a case study, we illustrate how different initializations lead to diverse responsibility distributions. The framework offers valuable insights into accountability in AI-induced incidents, facilitating the development of a sustainable and resilient system through continuous monitoring, revision, and reflection.
What problem does this paper attempt to address?
The problem that this paper attempts to solve is how to reasonably attribute responsibility in events triggered by artificial intelligence (AI). With the popularization of AI technology, the integration of AI systems in various key functions has brought complex challenges in terms of responsibility and accountability. These challenges include the interconnectivity of AI systems, ethical issues, technical uncertainties, and the lack of corresponding regulations, making it difficult for traditional responsibility - attribution methods to cope. Therefore, this paper proposes a method based on Computational Reflective Equilibrium (CRE), aiming to establish a coherent and ethically acceptable responsibility - attribution framework for all stakeholders.
Specifically, the paper addresses this problem through the following aspects:
1. **Proposing the CRE framework**: Using the concept of reflective equilibrium and combining computational methods to overcome the limitations of conceptual methods in dealing with dynamic and multi - faceted scenarios. The CRE framework provides a structured analysis method that can demonstrate interpretability, consistency, and adaptability in the responsibility - attribution process.
2. **Case study**: Using an AI - assisted medical decision - support system as a case study, it shows how different initial conditions lead to different responsibility distributions. This helps in understanding how to allocate responsibility in practical applications.
3. **Challenges in responsibility attribution**: The paper discusses several major challenges in responsibility attribution, including the complexity and interconnectivity of AI systems, the uncertainty of the technology itself, the lack of clear legal and ethical guidelines, and the new challenges brought by ethical constraints.
4. **Computational methods**: It details the computational process of the CRE framework, including steps such as identifying responsible entities, finding supporting claims, constructing a constraint network, and performing consistency - driven calculations. Through these steps, the CRE framework can find a responsibility - allocation scheme that is most in line with ethics and logic.
5. **The influence of initial activation levels**: The paper also explores the influence of initial activation levels on the final responsibility - allocation results and proposes methods for quantitatively evaluating initial activation levels, including public preferences and quantitative surveys.
In conclusion, by proposing the CRE framework, this paper aims to solve the problem of responsibility attribution in AI - triggered events and provide theoretical and practical support for the establishment of safe, reliable, and trustworthy AI systems.