Augmented and Programmatically Optimized LLM Prompts Reduce Chemical Hallucinations

Scott Reed
DOI: https://doi.org/10.26434/chemrxiv-2025-rwgt8
2025-01-02
Abstract:Utilizing Large Language Models (LLMs) for handling scientific information comes with risk of the outputs not matching expectations, commonly called hallucinations. To fully utilize LLMs in research requires improving their accuracy, avoiding hallucinations, and extending their scope to research topics outside their direct training. There is also a benefit to getting the most accurate information from an LLM at the time of inference without having to create and train custom new models for each application. Here, augmented generation and machine learning driven prompt optimization are combined to extract performance improvements over base LLM function on a common chemical research task. Specifically, an LLM was used to predict the topological polar surface area (TPSA) of molecules. By using augmented generation and machine learning optimized prompts, the error in the prediction was reduced to 7.44 root mean squared error (RMSE) from 59.41 RMSE with direct calls to the same LLM.
Chemistry
What problem does this paper attempt to address?