Enhancing LLM Intelligence with ARM-RAG: Auxiliary Rationale Memory for Retrieval Augmented Generation

Eric Melz

2023-11-08

Abstract:Large Language Models (LLMs) are smart but forgetful. Recent studies, (e.g., (Bubeck et al., 2023)) on modern LLMs have shown that they are capable of performing amazing tasks typically necessitating human-level intelligence. However, unlike humans, frozen LLMs do not improve over time; they neither acquire new knowledge nor learn from their successes or failures. Some approaches to improving the intelligence of LLMs include fine-tuning models based on problem-solving performance (Zelikman et al., 2022), and building bigger and more sophisticated models (Bubeck et al., 2023). However, these methods have the drawback of requiring substantial data and computational resources to retrain existing models. In this paper, we explore the use of Retrieval Augmented Generation, also known as RAG (Lewis et al., 2021) to improve problem-solving performance. We propose ARM-RAG (Auxiliary Rationale Memory for Retrieval Augmented Generation), a system that learns from its successes without incurring high training costs. We demonstrate that the storage and subsequent retrieval of reasoning chains have a positive influence on performance in grade-school math problems.

Computation and Language,Artificial Intelligence

What problem does this paper attempt to address?

The paper aims to address the limitations of large language models (LLMs) in problem-solving, particularly the fact that these models, while intelligent, lack memory functionality and cannot acquire new knowledge over time or learn from past successes and failures. To tackle this issue, the paper proposes the ARM-RAG (Assisted Reasoning Memory Retrieval-Augmented Generation) system. The main contributions of the paper are as follows: 1. **Proposing the ARM-RAG system**: This system utilizes Retrieval-Augmented Generation (RAG) technology to improve LLMs' performance in solving mathematical problems by storing and retrieving previously successful reasoning chains (referred to as "rationales"). 2. **Experimental validation**: A series of experiments demonstrate the effectiveness of the ARM-RAG system, and it was found that the system's performance further improves when the target problem is fuzzily processed. 3. **Comparative experiments**: The effects of different prompting methods, including strong prompts and negative prompts, were compared to verify the impact of prompts on the system's performance. The core hypothesis of the paper is that effective retrieval-augmented generation technology can significantly enhance the problem-solving capabilities of LLMs. Experimental results show that the ARM-RAG system outperforms baseline models on the training set, and its performance on the test set also improves, especially after applying fuzzy query techniques.

Enhancing LLM Intelligence with ARM-RAG: Auxiliary Rationale Memory for Retrieval Augmented Generation

AssistRAG: Boosting the Potential of Large Language Models with an Intelligent Information Assistant

Astute RAG: Overcoming Imperfect Retrieval Augmentation and Knowledge Conflicts for Large Language Models

M-RAG: Reinforcing Large Language Model Performance through Retrieval-Augmented Generation with Multiple Partitions

Retrieval-Augmented Generation for Large Language Models: A Survey

Metacognitive Retrieval-Augmented Large Language Models

RAG-Star: Enhancing Deliberative Reasoning with Retrieval Augmented Verification and Refinement

GEM-RAG: Graphical Eigen Memories For Retrieval Augmented Generation

How Much Can RAG Help the Reasoning of LLM?

LLM Augmentations to support Analytical Reasoning over Multiple Documents

Long-Context LLMs Meet RAG: Overcoming Challenges for Long Inputs in RAG

MemLLM: Finetuning LLMs to Use An Explicit Read-Write Memory

Auto-RAG: Autonomous Retrieval-Augmented Generation for Large Language Models

RRAML: Reinforced Retrieval Augmented Machine Learning

Retrieval Augmented Generation or Long-Context LLMs? A Comprehensive Study and Hybrid Approach

Similarity is Not All You Need: Endowing Retrieval Augmented Generation with Multi Layered Thoughts

RARE: Retrieval-Augmented Reasoning Enhancement for Large Language Models

Boosting the Capabilities of Compact Models in Low-Data Contexts with Large Language Models and Retrieval-Augmented Generation

Self-RAG: Learning to Retrieve, Generate, and Critique through Self-Reflection

RAGLAB: A Modular and Research-Oriented Unified Framework for Retrieval-Augmented Generation