Bioinformatics and Biomedical Informatics with ChatGPT: Year One Review

Jinge Wang,Zien Cheng,Qiuming Yao,Li Liu,Dong Xu,Gangqing Hu
2024-06-12
Abstract:The year 2023 marked a significant surge in the exploration of applying large language model (LLM) chatbots, notably ChatGPT, across various disciplines. We surveyed the applications of ChatGPT in bioinformatics and biomedical informatics throughout the year, covering omics, genetics, biomedical text mining, drug discovery, biomedical image understanding, bioinformatics programming, and bioinformatics education. Our survey delineates the current strengths and limitations of this chatbot in bioinformatics and offers insights into potential avenues for future developments.
Other Quantitative Biology,Artificial Intelligence
What problem does this paper attempt to address?
The problem that this paper attempts to solve is to evaluate and explore the application potential and limitations of large - language models (LLMs), especially ChatGPT, in the fields of bioinformatics and biomedical informatics. Specifically, the paper reviews ChatGPT's applications in the following aspects during 2023: 1. **Omics**: It explores the novel application of GPT - 4 in annotating cell types in single - cell RNA sequencing data and evaluates its performance in genomics tasks. 2. **Genetics**: It analyzes ChatGPT's application in genetic counseling, especially its use in administrative tasks, and its performance on human genetics multiple - choice questions. 3. **Biomedical Text Mining**: It evaluates ChatGPT's performance in tasks such as named - entity recognition, relation extraction, sentence similarity, document classification, and question - answering, and explores methods to improve performance through prompting strategies. 4. **Drug Discovery**: It studies ChatGPT's performance in tasks such as drug - disease association identification and drug - drug interaction prediction, and discusses the importance of the "human - in - the - loop" method that combines human expert knowledge. 5. **Biomedical Image Understanding**: It examines the performance of GPT - 4V (visual version) in tasks such as medical visual question - answering and biomedical image classification. 6. **Bioinformatics Programming**: It explores how ChatGPT can generate executable code through natural - language instructions to assist scientists without advanced programming skills in bioinformatics analysis. Overall, this paper aims to summarize the current application status of ChatGPT in the fields of bioinformatics and biomedical informatics, point out its advantages and limitations, and provide directions for future research and development.