Abstract:Context In Empirical Software Engineering, it is crucial to work with representative samples that reflect the current state of the software industry. An important consideration, especially in rapidly changing fields like software development, is that if we use a sample collected years ago, it should continue to represent the same population in the present day to produce generalizable results. However, it is seldom the case in which a software sample built several years ago accurately depicts the current state of the development industry. Nevertheless, many recent studies rely on rather old datasets (seven or more years of age) to conduct their investigations. Objective To analyze the evolution of a population of open-source projects, determine the likelihood of detecting significant differences over time, and study the activity history of the projects. Method We performed a longitudinal study with 72 snapshots of quality projects from Github, covering the period between July 1st 2017 and June 1st 2023. We recorded monthly values of seven repository metrics (contributors, commits, closed pull-requests, merged pull-requests, closed issues, number of stars and forks), encompassing data from a total of 1991 repositories. Results We observed significant changes in all the metrics evaluated, with most cases showing negligible to small effect sizes. Notably, merged pull-requests registered medium effect sizes. The evolution was not equal in all the metrics, however, after five years it was unlikely that a sample of projects remained representative for any of the analyzed metrics, showing probabilities below 25%. Conclusion Although the temporal validity of a sample depends on the specific data being studied, employing datasets created several years ago does not appear to be a sound strategy if the aim is to produce results that can be extrapolated to the current state of the population.

Revisiting Aristotle vs. Ringelmann: The influence of biases on measuring productivity in Open Source software development

Revisiting Linus's Law: Benefits and Challenges of Open Source Software Peer Review.

Influence of Communication Among Shared Developers on the Productivity of Open Source Software Projects

Mind the Gap: On the Relationship Between Automatically Measured and Self-Reported Productivity

A Replication Study on Measuring the Growth of Open Source

Measuring the Effect of Social Communications on Individual Working Rhythms: A Case Study of Open Source Software

Sources of Underproduction in Open Source Software

A longitudinal study on the temporal validity of software samples

Software development practices in academia: a case study comparison

Collaboration Drives Individual Productivity

Characterizing the Roles of Contributors in Open-source Scientific Software Projects

Model Contribution Rate Theory: An Empirical Examination

Assessing Code Authorship: The Case of the Linux Kernel

A comparison of two approaches for measuring interdisciplinary research output: the disciplinary diversity of authors vs the disciplinary diversity of the reference list

The Productivity Effects of Generative AI: Evidence from a Field Experiment with GitHub Copilot

Long-Term Productivity Based on Science, not Preference

Communication and Code Dependency Effects on Software Code Quality: An Empirical Analysis of Herbsleb Hypothesis

Diversity, Productivity, and Growth of Open Source Developer Communities

How Firms Adapt and Interact in Open Source Ecosystems: Analyzing Stakeholder Influence and Collaboration Patterns

On the variation and specialisation of workload—A case study of the Gnome ecosystem community

The Impact of Generative AI on Collaborative Open-Source Software Development: Evidence from GitHub Copilot