Abstract:Open-domain question answering (QA) aims to find the answer to a question from a large collection of <a class="link-external link-http" href="http://documents.Though" rel="external noopener nofollow">this http URL</a> many models for single-document machine comprehension have achieved strong performance, there is still much room for improving open-domain QA systems since document retrieval and answer reranking are still unsatisfactory. Golden documents that contain the correct answers may not be correctly scored by the retrieval component, and the correct answers that have been extracted may be wrongly ranked after other candidate answers by the reranking component. One of the reasons is derived from the independent principle in which each candidate document (or answer) is scored independently without considering its relationship to other documents (or answers). In this work, we propose a knowledge-aided open-domain QA (KAQA) method which targets at improving relevant document retrieval and candidate answer reranking by considering the relationship between a question and the documents (termed as question-document graph), and the relationship between candidate documents (termed as document-document graph). The graphs are built using knowledge triples from external knowledge resources. During document retrieval, a candidate document is scored by considering its relationship to the question and other documents. During answer reranking, a candidate answer is reranked using not only its own context but also the clues from other documents. The experimental results show that our proposed method improves document retrieval and answer reranking, and thereby enhances the overall performance of open-domain question answering.

CALM: Commen-Sense Knowledge Augmentation for Document Image Understanding

Simple and Effective Visual Question Answering in a Single Modality

Coarse-to-Careful: Seeking Semantic-related Knowledge for Open-domain Commonsense Question Answering

DIEM: Decomposition-Integration Enhancing Multimodal Insights

Knowledge-aware image understanding with multi-level visual representation enhancement for visual question answering

Keep Skills in Mind: Understanding and Implementing Skills in Commonsense Question Answering

Knowledge-Aided Open-Domain Question Answering

ChiQA: A Large Scale Image-based Real-World Question Answering Dataset for Multi-Modal Understanding

DSAMR: Dual-Stream Attention Multi-hop Reasoning for knowledge-based visual question answering

Knowledge Condensation and Reasoning for Knowledge-based VQA

Benchmarking Knowledge-Enhanced Commonsense Question Answering via Knowledge-to-Text Transformation

CIKQA: Learning Commonsense Inference with a Unified Knowledge-in-the-loop QA Paradigm

Cross-modal Knowledge Reasoning for Knowledge-based Visual Question Answering

Visually Grounded Commonsense Knowledge Acquisition

Understanding ME? Multimodal Evaluation for Fine-grained Visual Commonsense

Towards Complex Document Understanding by Discrete Reasoning

Multi-Level Knowledge Injecting for Visual Commonsense Reasoning

Multimodal Commonsense Knowledge Distillation for Visual Question Answering

Parallel Fusion of Graph and Text with Semantic Enhancement for Commonsense Question Answering

Knowledge-Enhanced Visual Question Answering with Multi-modal Joint Guidance.

Read and Think: An Efficient Step-wise Multimodal Language Model for Document Understanding and Reasoning