Chaining thoughts and LLMs to learn DNA structural biophysics

Tyler D. Ross,Ashwin Gopinath
2024-03-03
Abstract:The future development of an AI scientist, a tool that is capable of integrating a variety of experimental data and generating testable hypotheses, holds immense potential. So far, bespoke machine learning models have been created to specialize in singular scientific tasks, but otherwise lack the flexibility of a general purpose model. Here, we show that a general purpose large language model, chatGPT 3.5-turbo, can be fine-tuned to learn the structural biophysics of DNA. We find that both fine-tuning models to return chain-of-thought responses and chaining together models fine-tuned for subtasks have an enhanced ability to analyze and design DNA sequences and their structures.
Quantitative Methods,Artificial Intelligence,Machine Learning
What problem does this paper attempt to address?
This paper discusses how to use large language models, such as ChatGPT, to learn and understand the structural biophysical properties of DNA. The research improves the ability to analyze and design DNA sequences and their structures through fine-tuning the model for chain thinking and task chaining. The goal is to develop an AI scientist tool that can integrate experimental data and generate testable hypotheses. Currently, there are machine learning models specifically designed for individual scientific tasks, but they lack generality. The paper demonstrates through experiments that general-purpose language models can be trained to simulate physical phenomena, such as DNA structure formation.