Engineering Trustworthy AI: A Developer Guide for Empirical Risk Minimization

Diana Pfau,Alexander Jung

2024-10-25

Abstract:AI systems increasingly shape critical decisions across personal and societal domains. While empirical risk minimization (ERM) drives much of the AI success, it typically prioritizes accuracy over trustworthiness, often resulting in biases, opacity, and other adverse effects. This paper discusses how key requirements for trustworthy AI can be translated into design choices for the components of ERM. We hope to provide actionable guidance for building AI systems that meet emerging standards for trustworthiness of AI.

Artificial Intelligence

What problem does this paper attempt to address?

The problem that this paper attempts to solve is: **How to transform the key requirements of trustworthy AI into components of the Empirical Risk Minimization (ERM) method through design choices, thereby constructing more trustworthy artificial intelligence systems**. Specifically, although Empirical Risk Minimization (ERM) has driven the success of much of current AI, it usually gives priority to accuracy and ignores trustworthiness, leading to bias, opacity, and other negative impacts. Therefore, this paper explores how to make specific design choices for the three components of data, model, and loss function to meet the seven key trustworthy AI requirements proposed by the European Union (robustness, privacy - protection, fairness, explainability, etc.), thereby providing actionable guidance for constructing AI systems that meet the emerging trustworthy AI standards. ### Formulas Involved 1. **Empirical Risk**: \[ \hat{L}(h|D) := \frac{1}{m} \sum_{r = 1}^{m} L((x^{(r)}, y^{(r)}), h) \] where \( D=\{(x^{(1)}, y^{(1)}), \ldots, (x^{(m)}, y^{(m)})\} \) is the training set, and \( L((x, y), h) \) is the loss function. 2. **Empirical Risk Minimization**: \[ \hat{h} := \arg\min_{h \in H} \hat{L}(h|D)=\arg\min_{h \in H} \sum_{(x,y) \in D} L((x, y), h) \] 3. **Huber Loss**: \[ L_{\text{Huber}}(y, h(x)) = \begin{cases} \frac{1}{2}(y - h(x))^2 & \text{if } |y - h(x)| \leq \delta \\ \delta (|y - h(x)| - \frac{1}{2}\delta) & \text{otherwise} \end{cases} \] 4. **Adversarial Loss**: \[ \tilde{h} = \arg\min_{h \in H} \sup_{\tilde{D} \in U} \sum_{(x,y) \in \tilde{D}} L((x, y), h) \] where \( U \) is the uncertainty set, representing the possible perturbation range. Through these design choices, the paper aims to ensure that AI systems are not only optimized statistically and computationally, but also achieve higher trustworthiness at the ethical and social levels.

Engineering Trustworthy AI: A Developer Guide for Empirical Risk Minimization

Filling gaps in trustworthy development of AI

Towards Risk-Free Trustworthy Artificial Intelligence: Significance and Requirements

Trust and trustworthy artificial intelligence: A research agenda for AI in the environmental sciences

Trustworthy AI: From Principles to Practices

Artificial Intelligence (AI) Trust Framework and Maturity Model: Applying an Entropy Lens to Improve Security, Privacy, and Ethical AI

The Challenges and Opportunities of Artificial Intelligence for Trustworthy Robots and Autonomous Systems

Challenges and efforts in managing AI trustworthiness risks: a state of knowledge

Trustworthy and Responsible AI for Human-Centric Autonomous Decision-Making Systems

Establishing and Evaluating Trustworthy AI: Overview and Research Challenges

Who to Trust, How and Why: Untangling AI Ethics Principles, Trustworthiness and Trust

Trustworthy Artificial Intelligence: A Review

The Challenges and Opportunities of Human-Centered AI for Trustworthy Robots and Autonomous Systems

The Importance of Distrust in AI

Guideline for Trustworthy Artificial Intelligence -- AI Assessment Catalog

The Reasoning Under Uncertainty Trap: A Structural AI Risk

The relationship between trust in AI and trustworthy machine learning technologies

Survey of Trustworthy AI: A Meta Decision of AI

Developing trustworthy artificial intelligence: insights from research on interpersonal, human-automation, and human-AI trust

Misplaced Trust and Distrust: How Not to Engage with Medical Artificial Intelligence

Designing Trustworthy AI: A Human-Machine Teaming Framework to Guide Development