A Hybrid Neural Network Model for Commonsense Reasoning

Pengcheng He,Xiaodong Liu,Weizhu Chen,Jianfeng Gao
DOI: https://doi.org/10.48550/arXiv.1907.11983
2019-07-28
Abstract:This paper proposes a hybrid neural network (HNN) model for commonsense reasoning. An HNN consists of two component models, a masked language model and a semantic similarity model, which share a BERT-based contextual encoder but use different model-specific input and output layers. HNN obtains new state-of-the-art results on three classic commonsense reasoning tasks, pushing the WNLI benchmark to 89%, the Winograd Schema Challenge (WSC) benchmark to 75.1%, and the PDP60 benchmark to 90.0%. An ablation study shows that language models and semantic similarity models are complementary approaches to commonsense reasoning, and HNN effectively combines the strengths of both. The code and pre-trained models will be publicly available at <a class="link-external link-https" href="https://github.com/namisan/mt-dnn" rel="external noopener nofollow">this https URL</a>.
Computation and Language
What problem does this paper attempt to address?