Abstract:Large language models (LLMs) have shown remarkable potential in various domains, but they often lack the ability to access and reason over domain-specific knowledge and tools. In this paper, we introduced CACTUS (Chemistry Agent Connecting Tool-Usage to Science), an LLM-based agent that integrates cheminformatics tools to enable advanced reasoning and problem-solving in chemistry and molecular discovery. We evaluate the performance of CACTUS using a diverse set of open-source LLMs, including Gemma-7b, Falcon-7b, MPT-7b, Llama2-7b, and Mistral-7b, on a benchmark of thousands of chemistry questions. Our results demonstrate that CACTUS significantly outperforms baseline LLMs, with the Gemma-7b and Mistral-7b models achieving the highest accuracy regardless of the prompting strategy used. Moreover, we explore the impact of domain-specific prompting and hardware configurations on model performance, highlighting the importance of prompt engineering and the potential for deploying smaller models on consumer-grade hardware without significant loss in accuracy. By combining the cognitive capabilities of open-source LLMs with domain-specific tools, CACTUS can assist researchers in tasks such as molecular property prediction, similarity searching, and drug-likeness assessment. Furthermore, CACTUS represents a significant milestone in the field of cheminformatics, offering an adaptable tool for researchers engaged in chemistry and molecular discovery. By integrating the strengths of open-source LLMs with domain-specific tools, CACTUS has the potential to accelerate scientific advancement and unlock new frontiers in the exploration of novel, effective, and safe therapeutic candidates, catalysts, and materials. Moreover, CACTUS's ability to integrate with automated experimentation platforms and make data-driven decisions in real time opens up new possibilities for autonomous discovery.

Monte Carlo Thought Search: Large Language Model Querying for Complex Scientific Reasoning in Catalyst Design

ChemReasoner: Heuristic Search over a Large Language Model's Knowledge Space using Quantum-Chemical Feedback

Structured Chemistry Reasoning with Large Language Models

No Train Still Gain. Unleash Mathematical Reasoning of Large Language Models with Monte Carlo Tree Search Guided by Energy Function

When Do Program-of-Thought Works for Reasoning?

Monte Carlo Tree Search Boosts Reasoning via Iterative Preference Learning

Let's Be Self-generated via Step by Step: A Curriculum Learning Approach to Automated Reasoning with Large Language Models

Rational Metareasoning for Large Language Models

Plan of Thoughts: Heuristic-Guided Problem Solving with Large Language Models

$T^2$ of Thoughts: Temperature Tree Elicits Reasoning in Large Language Models

Reasoning with Large Language Models, a Survey

CACTUS: Chemistry Agent Connecting Tool-Usage to Science

CataLM: Empowering Catalyst Design Through Large Language Models

Bayesian Optimization of Catalysts With In-context Learning

Leveraging Chemistry Foundation Models to Facilitate Structure Focused Retrieval Augmented Generation in Multi-Agent Workflows for Catalyst and Materials Design

Step-by-Step Reasoning for Math Problems via Twisted Sequential Monte Carlo

Interpretable Contrastive Monte Carlo Tree Search Reasoning

Modelling Chemical Reasoning to Predict Reactions

Break the Chain: Large Language Models Can be Shortcut Reasoners

ChainLM: Empowering Large Language Models with Improved Chain-of-Thought Prompting

Combinatorial Reasoning: Selecting Reasons in Generative AI Pipelines via Combinatorial Optimization