Abstract:Machine Reading Comprehension (MRC) has achieved impressive answer inference performance in recent years but rarely considers the trustworthiness and reliability of the deployed systems. However, it is crucial to estimate the predictive uncertainty in real-world applications to measure how likely the prediction is wrong. Hence it is possible to abstain from the uncertain prediction with low confidence and build a trustworthy system. Prior studies use post-processing ways to measure the predictive uncertainty, such as employing heuristic softmax probability or training a calibrator on top of a trained MRC model. However, they only calibrate the confidence without considering the domain adaptation relationship. To handle the limitations, this paper presents TrustMRC, a non-postprocessing trustworthy MRC system that leverages (1) conditional calibration strategy to get reliable uncertainty, and (2) conditional adversarial learning strategy to learn transfer representations under domain shift setting. On the one hand, to estimate the predictive uncertainty, a conditional calibration module is proposed to predict whether the output of the answer prediction module is correct, and it is combined with an additional ECE constraint to restrict the confidence more reliable. On the other hand, for domain shift, TrustMRC designs a conditional adversarial learning strategy to learn transfer representations through a domain discriminator with uncertainty constraints, which takes both input and uncertainty alignment into account. Besides, TrustMRC is a non-postprocessing model that completes the answer prediction and uncertainty prediction in an end-to-end framework, so that these two sub-tasks can benefit from each other via multi-task learning. Instead of traditional EM and F1 metrics, EM-coverage and F1-coverage curves are used, for the trustworthiness-aware MRC evaluation. The experimental results on SQuAD 1.1, Natural Questions, and NewsQA datasets indicate that TrustMRC can make reliable predictions under domain shift settings.

Defending Machine Reading Comprehension against Question-Targeted Attacks.

D-DAE: Defense-Penetrating Model Extraction Attacks.

Using Adversarial Attacks to Reveal the Statistical Bias in Machine Reading Comprehension Models

EI-MTD: Moving Target Defense for Edge Intelligence Against Adversarial Attacks

The Impacts of Unanswerable Questions on the Robustness of Machine Reading Comprehension Models

A Robust Adversarial Training Approach to Machine Reading Comprehension

Feeding What You Need by Understanding What You Learned

Adversarial Domain Adaptation for Machine Reading Comprehension

Deceiving Question-Answering Models: A Hybrid Word-Level Adversarial Approach

A Multi-Task Learning Machine Reading Comprehension Model for Noisy Document (student Abstract)

Machine Reading Comprehension: The Role of Contextualized Language Models and Beyond

A novel multi-domain machine reading comprehension model with domain interference mitigation

Learning Invariant Representation Improves Robustness for MRC Models

Enhancing Pre-Trained Generative Language Models with Question Attended Span Extraction on Machine Reading Comprehension

A Multi-answer Multi-task Framework for Real-world Machine Reading Comprehension.

Defending Adversarial Attacks on Cloud-aided Automatic Speech Recognition Systems.

Robustness-Eva-MRC: Assessing and Analyzing the Robustness of Neural Models in Extractive Machine Reading Comprehension

Trustworthy machine reading comprehension with conditional adversarial calibration

Improving the robustness of machine reading comprehension model with hierarchical knowledge and auxiliary unanswerability prediction

Multi-Passage Machine Reading Comprehension with Cross-Passage Answer Verification.

Human Behavior Inspired Machine Reading Comprehension