Demystifying Issues, Causes and Solutions in LLM Open-Source Projects

Yangxiao Cai,Peng Liang,Yifei Wang,Zengyang Li,Mojtaba Shahin
2024-09-25
Abstract:With the advancements of Large Language Models (LLMs), an increasing number of open-source software projects are using LLMs as their core functional component. Although research and practice on LLMs are capturing considerable interest, no dedicated studies explored the challenges faced by practitioners of LLM open-source projects, the causes of these challenges, and potential solutions. To fill this research gap, we conducted an empirical study to understand the issues that practitioners encounter when developing and using LLM open-source software, the possible causes of these issues, and potential solutions.We collected all closed issues from 15 LLM open-source projects and labelled issues that met our requirements. We then randomly selected 994 issues from the labelled issues as the sample for data extraction and analysis to understand the prevalent issues, their underlying causes, and potential solutions. Our study results show that (1) Model Issue is the most common issue faced by practitioners, (2) Model Problem, Configuration and Connection Problem, and Feature and Method Problem are identified as the most frequent causes of the issues, and (3) Optimize Model is the predominant solution to the issues. Based on the study results, we provide implications for practitioners and researchers of LLM open-source projects.
Software Engineering,Artificial Intelligence
What problem does this paper attempt to address?
This paper aims to address the issues encountered in the development and use of large language models (LLMs) in open-source projects, and to explore the root causes and potential solutions to these issues. Specifically: 1. **Research Background and Objectives**: - With the development of large language models, an increasing number of open-source software projects are incorporating LLMs as their core functional components. Despite significant progress in the research and practice of LLMs, there is currently a lack of research specifically focused on the challenges, causes, and solutions faced in LLMs open-source projects. - To fill this research gap, the authors conducted an empirical study to understand the problems developers encounter when developing and using LLMs open-source software, identify the root causes of these problems, and propose possible solutions. 2. **Main Findings**: - The study results indicate that **model issues** are the most common type of problems, including runtime issues, architecture design issues, loading issues, training issues, etc. - The main causes of these problems include model issues, configuration and connection issues, and feature and method issues. - The most commonly used solution is to optimize the model. 3. **Research Contributions**: - By collecting and analyzing nearly 1000 closed issues from 15 LLMs open-source projects on GitHub, the authors provided a two-level classification system for these issues and categorized the causes and solutions of these issues. - The study also provided mappings between identified issues and their causes, as well as between issues and their solutions. 4. **Research Methods**: - Data Collection: 15 LLMs open-source projects meeting specific criteria were selected from GitHub, and all closed issues were collected. - Data Annotation and Sampling: The collected issues were annotated, and 994 samples were randomly selected for detailed analysis. - Data Extraction: A series of data items were defined to extract relevant information about issues, causes, and solutions. - Data Analysis: Qualitative data analysis was conducted using open coding and constant comparison methods. 5. **Research Results**: - It was found that model issues are the most frequently encountered problem category by developers, followed by component issues and parameter issues. - By categorizing the types of issues, the study revealed the most common problems and their causes in LLMs open-source projects and proposed corresponding solutions. Through this study, the authors hope to provide valuable insights for practitioners and researchers of LLMs open-source projects, helping them better understand and solve the problems encountered in actual development.