Developing ChatGPT for Biology and Medicine: A Complete Review of Biomedical Question Answering

Qing Li,Lei Li,Yu Li
2024-01-21
Abstract:ChatGPT explores a strategic blueprint of question answering (QA) in delivering medical diagnosis, treatment recommendations, and other healthcare support. This is achieved through the increasing incorporation of medical domain data via natural language processing (NLP) and multimodal paradigms. By transitioning the distribution of text, images, videos, and other modalities from the general domain to the medical domain, these techniques have expedited the progress of medical domain question answering (MDQA). They bridge the gap between human natural language and sophisticated medical domain knowledge or expert manual annotations, handling large-scale, diverse, unbalanced, or even unlabeled data analysis scenarios in medical contexts. Central to our focus is the utilizing of language models and multimodal paradigms for medical question answering, aiming to guide the research community in selecting appropriate mechanisms for their specific medical research requirements. Specialized tasks such as unimodal-related question answering, reading comprehension, reasoning, diagnosis, relation extraction, probability modeling, and others, as well as multimodal-related tasks like vision question answering, image caption, cross-modal retrieval, report summarization, and generation, are discussed in detail. Each section delves into the intricate specifics of the respective method under consideration. This paper highlights the structures and advancements of medical domain explorations against general domain methods, emphasizing their applications across different tasks and datasets. It also outlines current challenges and opportunities for future medical domain research, paving the way for continued innovation and application in this rapidly evolving field.
Computation and Language,Artificial Intelligence
What problem does this paper attempt to address?
The problem that this paper attempts to solve is the development of ChatGPT in the fields of biology and medicine to achieve medical diagnosis, treatment recommendations and other health support services. Specifically, the paper explores the methods and challenges of transforming General Domain Question Answering (GDQA) into Medical Domain Question Answering (MDQA) through natural language processing (NLP) and multimodal paradigms. The paper focuses on how to use language models and multimodal paradigms to improve the question - answering ability in the medical field, aiming to guide the research community to select technical mechanisms suitable for their specific medical research needs. In addition, the paper also outlines the current challenges faced in medical field research and future research opportunities, pointing out the direction for this rapidly developing field.