An Examination of the Use of Large Language Models to Aid Analysis of Textual Data

Robert H. Tai,Lillian R. Bentley,Xin Xia,Jason M. Sitt,Sarah C. Fankhauser,Ana M. Chicas-Mosier,Barnas G. Monteith
DOI: https://doi.org/10.1177/16094069241231168
2024-01-01
International Journal of Qualitative Methods
Abstract:The increasing use of machine learning and Large Language Models (LLMs) opens up opportunities to use these artificially intelligent algorithms in novel ways. This article proposes a methodology using LLMs to support traditional deductive coding in qualitative research. We began our analysis with three different sample texts taken from existing interviews. Next, we created a codebook and inputted the sample text and codebook into an LLM. We asked the LLM to determine if the codes were present in a sample text provided and requested evidence to support the coding. The sample texts were inputted 160 times to record changes between iterations of the LLM response. Each iteration was analogous to a new coder deductively analyzing the text with the codebook information. In our results, we present the outputs for these recursive analyses, along with a comparison of the LLM coding to evaluations made by human coders using traditional coding methods. We argue that LLM analysis can aid qualitative researchers by deductively coding transcripts, providing a systematic and reliable platform for code identification, and offering a means of avoiding analysis misalignment. Implications of using LLM in research praxis are discussed, along with current limitations.
social sciences, interdisciplinary
What problem does this paper attempt to address?