ChatGPT as a bioinformatic partner.

Gianluca Mondillo,Alessandra Perrotta,Simone Colosimo,Vittoria Frattolillo
DOI: https://doi.org/10.1101/2024.08.20.24312291
2024-08-20
Abstract:The advanced Large Language Model ChatGPT4o, developed by OpenAI, can be used in the field of bioinformatics to analyze and understand cross-reactive allergic reactions. This study explores the use of ChatGPT4o to support research on allergens, particularly in the cross-reactivity syndrome between cat and pork. Using a hypothetical clinical case of a child with a confirmed allergy to Fel d 2 (cat albumin) and Sus s 1 (pork albumin), the model guided data collection, protein sequence analysis, and three-dimensional structure visualization. Through the use of bioinformatics tools like SDAP 2.0 and BepiPRED, the epitope regions of the allergenic proteins were predicted, con-firming their accessibility to immunoglobulin E (IgE) and probability of cross-reactivity. The results show that regions with high epitope probability exhibit high surface accessibility and predominantly coil and helical structures. The construction of a phylogenetic tree further sup-ported the evolutionary relationships among the studied allergens. ChatGPT4o has demonstrated its usefulness in guiding non-specialist researchers through complex bioinformatics processes, making advanced science accessible and improving analytical and innovation capabilities.
Health Informatics
What problem does this paper attempt to address?
The paper attempts to address the issue of using ChatGPT4o, an advanced large language model, to analyze and understand cross-reactivity in allergic reactions within the field of bioinformatics, particularly in the context of Pork-Cat Syndrome, which involves cross-reactivity between cats and pork. Specifically, the study demonstrates how to use ChatGPT4o for data collection, protein sequence analysis, and 3D structure visualization through a hypothetical clinical case—a child allergic to the cat allergen Fel d 2 who experiences an allergic reaction after consuming pork and is subsequently found to be allergic to the pork allergen Sus s 1. The study employed various bioinformatics tools, such as SDAP 2.0 and BepiPRED, to predict allergen epitope regions and confirm the accessibility of these regions to Immunoglobulin E (IgE) and the potential for cross-reactivity. The results indicated that regions with high epitope probability exhibited higher surface accessibility and were primarily composed of coiled and helical structures. Additionally, the constructed phylogenetic tree further supported the evolutionary relationships between the studied allergens. Overall, this paper aims to demonstrate the practicality of ChatGPT4o in guiding non-expert researchers through complex bioinformatics workflows, making advanced scientific research more accessible, and enhancing analytical and innovative capabilities.