Abstract:Memory is identified as a crucial human faculty that allows for the retention of visual and linguistic information within the hippocampus and neurons in the brain, which can subsequently be retrieved to address real-world challenges that arise through a lifetime of learning. The resolution of complex AI tasks through the application of acquired knowledge represents a stride toward the realization of artificial general intelligence. However, despite the prevalence of Large Language Models (LLMs) like GPT-3.5 and GPT-4 \cite{brown2020language, leiter2023chatgpt, zaitsu2023distinguishing, OpenAI2023GPT4TR} , which have displayed remarkable capabilities in language comprehension, generation, interaction, and reasoning, they are inhibited by constraints on context length that preclude the processing of extensive, continually evolving knowledge bases. This paper proposes that LLMs could be augmented through the selective integration of knowledge from external repositories, and in doing so, introduces a novel methodology for External Reasoning, exemplified by ChatPDF. Central to this approach is the establishment of a tiered policy for \textbf{External Reasoning based on Multiple LLM Interchange Assistance} in \cref{fig:overall}, where the level of support rendered is modulated across entry, intermediate, and advanced tiers based on the complexity of the query, with adjustments made in response to human feedback. A comprehensive evaluation of this methodology is conducted using multiple LLMs and the results indicate state-of-the-art performance in \cref{comparison} , surpassing existing solutions including <a class="link-external link-http" href="http://ChatPDF.com" rel="external noopener nofollow">this http URL</a>. Moreover, the paper emphasizes that this approach is more efficient compared to the direct processing of full text by LLMs. The source code is publicly available at: \url{<a class="link-external link-https" href="https://github.com/AkideLiu/ANLP" rel="external noopener nofollow">this https URL</a>}.

Rethinking with Retrieval: Faithful Large Language Model Inference

Concise and Organized Perception Facilitates Large Language Models for Deductive Reasoning.

Retrieval Meets Reasoning: Dynamic In-Context Editing for Long-Text Understanding

Unveiling Factual Recall Behaviors of Large Language Models through Knowledge Neurons

Improving Retrieval Augmented Language Model with Self-Reasoning

On the Hardness of Faithful Chain-of-Thought Reasoning in Large Language Models

Towards Faithful Chain-of-Thought: Large Language Models are Bridging Reasoners

Rational Metareasoning for Large Language Models

External Reasoning: Towards Multi-Large-Language-Models Interchangeable Assistance with Human Feedback

RARE: Retrieval-Augmented Reasoning Enhancement for Large Language Models

Reasoning on Graphs: Faithful and Interpretable Large Language Model Reasoning

RRAML: Reinforced Retrieval Augmented Machine Learning

Think-on-Graph 2.0: Deep and Faithful Large Language Model Reasoning with Knowledge-guided Retrieval Augmented Generation

Self-Knowledge Guided Retrieval Augmentation for Large Language Models

RAT: Retrieval Augmented Thoughts Elicit Context-Aware Reasoning in Long-Horizon Generation

Disentangling Memory and Reasoning Ability in Large Language Models

Language Models are Hidden Reasoners: Unlocking Latent Reasoning Capabilities via Self-Rewarding

Re-Reading Improves Reasoning in Large Language Models

Retrieval-augmented Multi-modal Chain-of-Thoughts Reasoning for Large Language Models

RATT: A Thought Structure for Coherent and Correct LLM Reasoning

Enhancing Large Language Models' Situated Faithfulness to External Contexts