Myth: The loss of core developers is a critical issue for OSS communities

Olivier Nourry,Masanari Kondo,Shinobu Saito,Yukako Iimura,Naoyasu Ubayashi,Yasutaka Kamei
2024-11-30
Abstract:Throughout their lifetime, open-source software systems will naturally attract new contributors and lose existing contributors. Not all OSS contributors are equal, however, as some contributors within a project possess significant knowledge and expertise of the codebase (i.e., core developers). When investigating the ability of projects to attract new contributors and how often a project loses contributors, it is therefore important to take into account the expertise of the contributors. Since core developers are vital to the longevity of projects, we therefore aim to find out: can OSS projects attract new core developers and how often do OSS projects lose core developers? To investigate core developer contribution patterns, we calculate the truck factor (or bus factor) of over 36,000 OSS projects to investigate how often TF developers join or abandon OSS projects. We find that 89% of our studied projects have experienced losing their core development team at least once. Our results also show that in 70% of cases, this project abandonment happens within the first three years of the project life. We also find that most OSS projects rely on a single core developer to maintain development activities. Finally, we find that only 27% of projects that were abandoned were able to attract at least one new TF developer. Our analysis shows that it is not uncommon for OSS projects to lose their initial core development team. This is likely due to most OSS project relying on a single core developer to maintain development activities. The first year of development is critical for OSS projects since this is where they are most at risk of losing their core developer(s). Additionally, projects that lose their core developer(s) early seem less likely to survive this event than projects that lost their core developers later on during their life.
Software Engineering
What problem does this paper attempt to address?
The problems that this paper attempts to solve are: whether the loss of core developers in open - source software (OSS) projects is a critical issue, and whether these projects can attract new core developers to maintain the activity and development of the projects. Specifically, the main research objectives are: 1. **RQ1: How common is TFDD in GitHub projects?** - By calculating the truck factor (Truck Factor, TF) of more than 36,000 OSS projects, the researchers want to know how many of these projects have experienced the situation where all core developers leave (i.e., Truck Factor Developer Detachment, TFDD). 2. **RQ2: How often can open - source projects survive after experiencing TFDD?** - The researchers hope to understand how many projects can re - attract new core developers to continue development activities after all core developers have left. 3. **RQ3: What are the differences between surviving projects and non - surviving projects?** - The researchers hope to find the differences between the two by comparing the development activity indicators (such as the number of commits, the number of files, the number of contributors, and project age, etc.) of surviving projects and non - surviving projects. ### Research Background The open - source software ecosystem has become an important pillar of software development. Many key open - source projects (such as Linux, Kubernetes, Docker, TensorFlow, Apache HTTP Server, etc.) rely on a small number of core developers to maintain the codebase and add new features. Therefore, the loss of core developers may have a significant impact on the continuous development of the project. To better understand this phenomenon, the researchers use the "truck factor" as an indicator to measure the vulnerability of the project. ### Main Findings - **The Universality of TFDD**: 89.65% of the research projects have experienced at least one situation where all core developers leave. - **The Time Distribution of TFDD**: 70% of the projects experienced the first TFDD within the first three years, and most TFDDs occurred in the first development year of the project. - **The Vulnerability of the Project**: Most projects rely on a single core developer to maintain development activities. - **The Survival Rate of the Project**: Only 27.07% of the projects can survive after experiencing TFDD and attract new core developers. - **The Characteristics of Surviving Projects**: Surviving projects are usually more mature (i.e., older) than non - surviving projects, and have more commit records and contributors when TFDD occurs. ### Conclusion The research shows that open - source projects are most vulnerable to the risk of core developer loss in the early stage, and many projects rely on a single core developer to maintain development activities. Although some projects can survive after experiencing core developer loss, this is not common. More mature projects are more likely to attract new developers when facing TFDD. ### Formula Explanation The truck factor (Truck Factor, TF) mentioned in the article is an indicator of project vulnerability, which is defined as: \[ \text{Truck Factor} = \min\left( k \mid \sum_{i = 1}^{k} C_i\geq T \right) \] where \( C_i \) represents the contribution degree of the \( i \) - th developer, and \( T \) is a threshold representing the maximum contribution loss that the project can bear. When all truck factor developers stop contributing, the project is considered to be at risk.