Towards Generalizable and Faithful Logic Reasoning over Natural Language via Resolution Refutation

Zhouhao Sun,Xiao Ding,Li Du,Bibo Cai,Jinglong Gao,Ting Liu,Qin Bing

2024-04-03

Abstract:Large language models (LLMs) have achieved significant performance in various natural language reasoning tasks. However, they still struggle with performing first-order logic reasoning over formal logical theories expressed in natural language. This is because the previous LLMs-based reasoning systems have the theoretical incompleteness issue. As a result, it can only address a limited set of simple reasoning problems, which significantly decreases their generalization ability. To address this issue, we propose a novel framework, named Generalizable and Faithful Reasoner (GFaiR), which introduces the paradigm of resolution refutation. Resolution refutation has the capability to solve all first-order logic reasoning problems by extending reasoning rules and employing the principle of proof by contradiction, so our system's completeness can be improved by introducing resolution refutation. Experimental results demonstrate that our system outperforms previous works by achieving state-of-the-art performances in complex scenarios while maintaining performances in simple scenarios. Besides, we observe that GFaiR is faithful to its reasoning process.

Artificial Intelligence,Computation and Language

What problem does this paper attempt to address?

The problem that this paper attempts to solve is that in natural language processing, existing large - language models (LLMs) have theoretical incompleteness when performing first - order logical reasoning, which causes them to be able to solve only a limited number of simple reasoning problems and cannot be generalized to more complex scenarios. Specifically, the paper points out that the current reasoning systems based on LLMs have the hallucination problem, that is, these models may generate incorrect intermediate reasoning steps and thus reach the final conclusion, which not only affects the accuracy of reasoning but also reduces the completeness of the system. To overcome these problems, the paper proposes a new framework - Generalizable and Faithful Reasoner (GFaiR), which introduces resolution refutation. Resolution refutation can solve all first - order logical reasoning problems by expanding the reasoning rules and adopting the principle of proving contradictions, thereby improving the completeness and credibility of the system. The experimental results show that GFaiR outperforms previous work in complex scenarios, while also maintaining good performance in simple scenarios, and its reasoning process is trustworthy.

Towards Generalizable and Faithful Logic Reasoning over Natural Language via Resolution Refutation

Concise and Organized Perception Facilitates Large Language Models for Deductive Reasoning.

Logic-LM: Empowering Large Language Models with Symbolic Solvers for Faithful Logical Reasoning

Reason from Fallacy: Enhancing Large Language Models' Logical Reasoning through Logical Fallacy Understanding

Reasoning on Graphs: Faithful and Interpretable Large Language Model Reasoning

Aristotle: Mastering Logical Reasoning with A Logic-Complete Decompose-Search-Resolve Framework

Are Large Language Models Really Good Logical Reasoners? A Comprehensive Evaluation and Beyond

Large Language Models as an Indirect Reasoner: Contrapositive and Contradiction for Automated Reasoning

Enhancing Zero-Shot Chain-of-Thought Reasoning in Large Language Models through Logic

FOLIO: Natural Language Reasoning with First-Order Logic

LLMs for Relational Reasoning: How Far are We?

CLR-Fact: Evaluating the Complex Logical Reasoning Capability of Large Language Models over Factual Knowledge

Reliable Reasoning Beyond Natural Language

Coupling Large Language Models with Logic Programming for Robust and General Reasoning from Text

LOGIC-LM++: Multi-Step Refinement for Symbolic Formulations

LeanReasoner: Boosting Complex Logical Reasoning with Lean

DetermLR: Augmenting LLM-based Logical Reasoning from Indeterminacy to Determinacy

Language Models Can Be Logical Solvers

Graph-constrained Reasoning: Faithful Reasoning on Knowledge Graphs with Large Language Models

Concise and Organized Perception Facilitates Reasoning in Large Language Models

A Systematic Analysis of Large Language Models as Soft Reasoners: The Case of Syllogistic Inferences