Abstract:Background Artificial intelligence (AI) is evolving for healthcare services. Higher cognitive thinking in AI refers to the ability of the system to perform advanced cognitive processes, such as problem-solving, decision-making, reasoning, and perception. This type of thinking goes beyond simple data processing and involves the ability to understand and manipulate abstract concepts, interpret, and use information in a contextually relevant way, and generate new insights based on past experiences and accumulated knowledge. Natural language processing models like ChatGPT is a conversational program that can interact with humans to provide answers to queries. Objective We aimed to ascertain the capability of ChatGPT in solving higher-order reasoning in the subject of pathology. Methods This cross-sectional study was conducted on the internet using an AI-based chat program that provides free service for research purposes. The current version of ChatGPT (January 30 version) was used to converse with a total of 100 higher-order reasoning queries. These questions were randomly selected from the question bank of the institution and categorized according to different systems. The responses to each question were collected and stored for further analysis. The responses were evaluated by three expert pathologists on a zero to five scale and categorized into the structure of the observed learning outcome (SOLO) taxonomy categories. The score was compared by a one-sample median test with hypothetical values to find its accuracy. Result A total of 100 higher-order reasoning questions were solved by the program in an average of 45.31±7.14 seconds for an answer. The overall median score was 4.08 (Q1-Q3: 4-4.33) which was below the hypothetical maximum value of five (one-test median test p <0.0001) and similar to four (one-test median test p = 0.14). The majority (86%) of the responses were in the "relational" category in the SOLO taxonomy. There was no difference in the scores of the responses for questions asked from various organ systems in the subject of Pathology (Kruskal Wallis p = 0.55). The scores rated by three pathologists had an excellent level of inter-rater reliability (ICC = 0.975 [95% CI: 0.965-0.983]; F = 40.26; p < 0.0001). Conclusion The capability of ChatGPT to solve higher-order reasoning questions in pathology had a relational level of accuracy. Hence, the text output had connections among its parts to provide a meaningful response. The answers from the program can score approximately 80%. Hence, academicians or students can get help from the program for solving reasoning-type questions also. As the program is evolving, further studies are needed to find its accuracy level in any further versions.

Human-like problem-solving abilities in large language models using ChatGPT

Large language model, AI and scientific research: why ChatGPT is only the beginning

Is ChatGPT a General-Purpose Natural Language Processing Task Solver?

Human-like intuitive behavior and reasoning biases emerged in large language models but disappeared in ChatGPT

Challenging large language models' " intelligence" with human tools: A neuropsychological investigation in Italian language on prefrontal functioning

Enhancing Human-Computer Interaction through AI: A Study on ChatGPT in Educational Environments

Unraveling ChatGPT: A Critical Analysis of AI-Generated Goal-Oriented Dialogues and Annotations

ChatGPT or Human? Detect and Explain. Explaining Decisions of Machine Learning Model for Detecting Short ChatGPT-generated Text

Can Chat GPT solve a Linguistics Exam?

Transforming Conversations with AI—A Comprehensive Study of ChatGPT

Applicability of ChatGPT in Assisting to Solve Higher Order Problems in Pathology

AI-based chatbot interactions and critical thinking skills: an exploratory study

Extending the Frontier of ChatGPT: Code Generation and Debugging

Large Language Models: Their Success and Impact

Unlocking the Potential of ChatGPT: A Comprehensive Exploration of its Applications, Advantages, Limitations, and Future Directions in Natural Language Processing

Chatbots as Problem Solvers: Playing Twenty Questions with Role Reversals

ChatGPT-Crawler: Find out if ChatGPT really knows what it's talking about

Advancing Medical Practice with Artificial Intelligence: ChatGPT in Healthcare

Evaluating Capabilities of Large Language Models: Performance of GPT4 on Surgical Knowledge Assessments

ChatGPT: ascertaining the self-evident. The use of AI in generating human knowledge

ChatGPT is a Knowledgeable but Inexperienced Solver: An Investigation of Commonsense Problem in Large Language Models