Engineering Trustworthy AI: A Developer Guide for Empirical Risk Minimization

Diana Pfau,Alexander Jung
2024-10-25
Abstract:AI systems increasingly shape critical decisions across personal and societal domains. While empirical risk minimization (ERM) drives much of the AI success, it typically prioritizes accuracy over trustworthiness, often resulting in biases, opacity, and other adverse effects. This paper discusses how key requirements for trustworthy AI can be translated into design choices for the components of ERM. We hope to provide actionable guidance for building AI systems that meet emerging standards for trustworthiness of AI.
Artificial Intelligence
What problem does this paper attempt to address?
The problem that this paper attempts to solve is: **How to transform the key requirements of trustworthy AI into components of the Empirical Risk Minimization (ERM) method through design choices, thereby constructing more trustworthy artificial intelligence systems**. Specifically, although Empirical Risk Minimization (ERM) has driven the success of much of current AI, it usually gives priority to accuracy and ignores trustworthiness, leading to bias, opacity, and other negative impacts. Therefore, this paper explores how to make specific design choices for the three components of data, model, and loss function to meet the seven key trustworthy AI requirements proposed by the European Union (robustness, privacy - protection, fairness, explainability, etc.), thereby providing actionable guidance for constructing AI systems that meet the emerging trustworthy AI standards. ### Formulas Involved 1. **Empirical Risk**: \[ \hat{L}(h|D) := \frac{1}{m} \sum_{r = 1}^{m} L((x^{(r)}, y^{(r)}), h) \] where \( D=\{(x^{(1)}, y^{(1)}), \ldots, (x^{(m)}, y^{(m)})\} \) is the training set, and \( L((x, y), h) \) is the loss function. 2. **Empirical Risk Minimization**: \[ \hat{h} := \arg\min_{h \in H} \hat{L}(h|D)=\arg\min_{h \in H} \sum_{(x,y) \in D} L((x, y), h) \] 3. **Huber Loss**: \[ L_{\text{Huber}}(y, h(x)) = \begin{cases} \frac{1}{2}(y - h(x))^2 & \text{if } |y - h(x)| \leq \delta \\ \delta (|y - h(x)| - \frac{1}{2}\delta) & \text{otherwise} \end{cases} \] 4. **Adversarial Loss**: \[ \tilde{h} = \arg\min_{h \in H} \sup_{\tilde{D} \in U} \sum_{(x,y) \in \tilde{D}} L((x, y), h) \] where \( U \) is the uncertainty set, representing the possible perturbation range. Through these design choices, the paper aims to ensure that AI systems are not only optimized statistically and computationally, but also achieve higher trustworthiness at the ethical and social levels.