Abstract:Commonsense reasoning is one of the abilities necessary for artificial intelligence to be as intelligent as humans. However, how to make AI understand commonsense has been a problem that has plagued artificial intelligence for more than 60 years. Existing efforts focus more on the means of knowledge acquisition and strive to enrich the capacity of commonsense knowledge (CSK) bases and dimensions of CSK through advanced methods. Unfortunately, this exuberance has obscured a general consideration of CSK, such as how to follow human habits to obtain the most representative knowledge we need to understand the world. In this paper, this representative knowledge is referred to as core CSK. The influence of core CSK is extensive, and it constitutes almost the fundamental element of human life and the most fundamental cognition of the world. Harnessing human curiosity to find solutions to the above problems is an effective and straightforward route. Specifically, we focus on a special corpus to mine core CSK, namely, why-questions. For example, we can harvest “the sky is blue” from “why is the sky blue?”. To this end, we propose a novel method to extract CSK from why-questions, which mainly consist of two modules. The first is a question classification module used to determine whether a question contains CSK. In this module, we propose a classifier based on a one-sided bootstrapping method and design several informative features for the classifier. The second is a crowdsourcing module used to improve the quality of the extracted commonsense. We conduct extensive experiments, and the experimental results show that our method effectively mines CSK from question corpora. Furthermore, statistical analysis demonstrates the feasibility of this curiosity-driven approach, implying that we provide a basic idea for collecting core CSK. Remarkably, today’s outstanding large language models do not have such simple knowledge summarization capabilities, demonstrating the barrier between the excellence of language models and the universality of CSK.

Does It Make Sense? And Why? A Pilot Study For Sense Making And Explanation

Do Multi-Sense Embeddings Improve Natural Language Understanding?

What Really is Commonsense Knowledge?

SemEval-2020 Task 4: Commonsense Validation and Explanation.

CommonsenseVIS: Visualizing and Understanding Commonsense Reasoning Capabilities of Natural Language Models

Evaluating Commonsense in Pre-trained Language Models

Every Answer Matters: Evaluating Commonsense with Probabilistic Measures

Commonsense Knowledge Salience Evaluation with a Benchmark Dataset in E-commerce

UNcommonsense Reasoning: Abductive Reasoning about Uncommon Situations

A framework for quantifying individual and collective common sense

Rule or Story, Which is a Better Commonsense Expression for Talking with Large Language Models?

Exploring and Analyzing Machine Commonsense Benchmarks

Probing Commonsense Explanation in Dialogue Response Generation

Common Sense Beyond English: Evaluating and Improving Multilingual Language Models for Commonsense Reasoning

Natural Language Processing with Commonsense Knowledge: A Survey

Don’t Ignore the Drive of Curiosity: Rethinking Subtleties Between Universality of Commonsense Knowledge and Excellence of Large Language Models

Benchmarks for Automated Commonsense Reasoning: A Survey

Helpful, Misleading or Confusing: How Humans Perceive Fundamental Building Blocks of Artificial Intelligence Explanations

Extending Sense-Making Models with Ideas from Cognition and Learning Theories.

Do Natural Language Explanations Represent Valid Logical Arguments? Verifying Entailment in Explainable NLI Gold Standards

Benchmarking Knowledge-Enhanced Commonsense Question Answering via Knowledge-to-Text Transformation