Abstract:The opaque nature of transformer-based models, particularly in applications susceptible to unethical practices such as dark-patterns in user interfaces, requires models that integrate uncertainty quantification to enhance trust in predictions. This study focuses on dark-pattern detection, deceptive design choices that manipulate user decisions, undermining autonomy and consent. We propose a differential fine-tuning approach implemented at the final classification head via uncertainty quantification with transformer-based pre-trained models. Employing a dense neural network (DNN) head architecture as a baseline, we examine two methods capable of quantifying uncertainty: Spectral-normalized Neural Gaussian Processes (SNGPs) and Bayesian Neural Networks (BNNs). These methods are evaluated on a set of open-source foundational models across multiple dimensions: model performance, variance in certainty of predictions and environmental impact during training and inference phases. Results demonstrate that integrating uncertainty quantification maintains performance while providing insights into challenging instances within the models. Moreover, the study reveals that the environmental impact does not uniformly increase with the incorporation of uncertainty quantification techniques. The study's findings demonstrate that uncertainty quantification enhances transparency and provides measurable confidence in predictions, improving the explainability and clarity of black-box models. This facilitates informed decision-making and mitigates the influence of dark-patterns on user interfaces. These results highlight the importance of incorporating uncertainty quantification techniques in developing machine learning models, particularly in domains where interpretability and trustworthiness are critical.

What problem does this paper attempt to address?

### What problems does this paper attempt to solve? This paper aims to solve the problems of opacity and unreliability of Transformer Models when detecting Dark Patterns in user interfaces. Specifically: 1. **Opacity problem**: Due to their black - box nature, Transformer Models are difficult to gain trust in prediction results in high - risk applications. Especially in applications involving ethical issues, such as Dark Patterns in user interfaces, these problems are particularly prominent. Dark Patterns refer to those design choices that manipulate user decisions and undermine user autonomy and consent rights. 2. **Reliability problem**: In order to improve the reliability and interpretability of these models in critical application scenarios, Uncertainty Quantification needs to be introduced. This helps to provide a confidence measure for model predictions, thereby improving the decision - making process and reducing the impact of Dark Patterns on users. To this end, the author proposes a differential fine - tuning method and enhances trust through Uncertainty Quantification on the Final Classification Head. The research mainly focuses on the following two aspects: - **Uncertainty Quantification techniques**: Two methods are used to quantify uncertainty - Spectral - normalized Neural Gaussian Processes (SNGPs) and Bayesian Neural Networks (BNNs). These methods are applied to multiple open - source base models for evaluation. - **Performance and environmental impact**: The research not only examines the changes in model performance and prediction certainty, but also analyzes the environmental impact of these Uncertainty Quantification techniques during the training and inference stages. Through these methods, the paper hopes to develop an ethical and environmentally - friendly artificial intelligence system to meet the growing demand for transparency and trust in the digital environment.

Uncertainty Quantification for Transformer Models for Dark-Pattern Detection

Uncertainty Estimation of Transformers' Predictions via Topological Analysis of the Attention Matrices

Multidimensional Uncertainty Quantification for Deep Neural Networks

Quantifying Uncertainty in Deep Learning Classification with Noise in Discrete Inputs for Risk-Based Decision Making

A Survey on Uncertainty Quantification Methods for Deep Learning

A Survey on Uncertainty Quantification Methods for Deep Neural Networks: An Uncertainty Source Perspective

Uncertainty-Based Rejection in Machine Learning: Implications for Model Development and Interpretability

Uncertainty Quantification for Deep Learning

Quantization Variation: A New Perspective on Training Transformers with Low-Bit Precision

Transformer-Based Astronomical Time Series Model with Uncertainty Estimation for Detecting Misclassified Instances

Enhancing Trustworthiness in ML-Based Network Intrusion Detection with Uncertainty Quantification

Towards trustworthy rotating machinery fault diagnosis via attention uncertainty in transformer

Uncertainty-aware deep learning for digital twin-driven monitoring: Application to fault detection in power lines

Generalized Uncertainty of Deep Neural Networks: Taxonomy and Applications

Epistemic Uncertainty Quantification For Pre-trained Neural Network

Evaluation of Uncertainty Quantification in Deep Learning

Uncertainty Quantification in Deep Neural Networks through Statistical Inference on Latent Space

On Uncertainty In Natural Language Processing

Just rotate it! Uncertainty estimation in closed-source models via multiple queries

Enabling uncertainty estimation in neural networks through weight perturbation for improved Alzheimer's disease classification

Uncertainty Quantification in Computer-Aided Diagnosis: Make Your Model say "I don't know" for Ambiguous Cases