Abstract:Large Language Models (LLMs) prompted to generate chain-of-thought (CoT) exhibit impressive reasoning capabilities. Recent attempts at prompt decomposition toward solving complex, multi-step reasoning problems depend on the ability of the LLM to simultaneously decompose and solve the problem. A significant disadvantage is that foundational LLMs are typically not available for fine-tuning, making adaptation computationally prohibitive. We believe (and demonstrate) that problem decomposition and solution generation are distinct capabilites, better addressed in separate modules, than by one monolithic LLM. We introduce DaSLaM, which uses a decomposition generator to decompose complex problems into subproblems that require fewer reasoning steps. These subproblems are answered by a solver. We use a relatively small (13B parameters) LM as the decomposition generator, which we train using policy gradient optimization to interact with a solver LM (regarded as black-box) and guide it through subproblems, thereby rendering our method solver-agnostic. Evaluation on multiple different reasoning datasets reveal that with our method, a 175 billion parameter LM (text-davinci-003) can produce competitive or even better performance, compared to its orders-of-magnitude larger successor, GPT-4. Additionally, we show that DaSLaM is not limited by the solver's capabilities as a function of scale; e.g., solver LMs with diverse sizes give significant performance improvement with our solver-agnostic decomposition technique. Exhaustive ablation studies evince the superiority of our modular finetuning technique over exorbitantly large decomposer LLMs, based on prompting alone.

ToM-LM: Delegating Theory of Mind Reasoning to External Symbolic Executors in Large Language Models

Concise and Organized Perception Facilitates Large Language Models for Deductive Reasoning.

HI-TOM: A Benchmark for Evaluating Higher-Order Theory of Mind Reasoning in Large Language Models

TimeToM: Temporal Space is the Key to Unlocking the Door of Large Language Models' Theory-of-Mind

Constrained Reasoning Chains for Enhancing Theory-of-Mind in Large Language Models

Logic-LM: Empowering Large Language Models with Symbolic Solvers for Faithful Logical Reasoning

Theory of Mind for Multi-Agent Collaboration via Large Language Models

Zero, Finite, and Infinite Belief History of Theory of Mind Reasoning in Large Language Models

Theory of Mind in Large Language Models: Examining Performance of 11 State-of-the-Art models vs. Children Aged 7-10 on Advanced Tests

MindGames: Targeting Theory of Mind in Large Language Models with Dynamic Epistemic Modal Logic

Probing the Robustness of Theory of Mind in Large Language Models

Minding Language Models' (Lack of) Theory of Mind: A Plug-and-Play Multi-Character Belief Tracker

Meta-Reasoning: Semantics-Symbol Deconstruction for Large Language Models

Thought-Like-Pro: Enhancing Reasoning of Large Language Models through Self-Driven Prolog-based Chain-of-Thought

Think Twice: Perspective-Taking Improves Large Language Models' Theory-of-Mind Capabilities

Mind Your Theory: Theory of Mind Goes Deeper Than Reasoning

Think-on-Graph: Deep and Responsible Reasoning of Large Language Model on Knowledge Graph

Language Models Represent Beliefs of Self and Others

Through the Theory of Mind's Eye: Reading Minds with Multimodal Video Large Language Models

Small Language Models Fine-tuned to Coordinate Larger Language Models improve Complex Reasoning

Faithful Logical Reasoning via Symbolic Chain-of-Thought