Android apps and user feedback: a dataset for software evolution and quality improvement

Giovanni Grano,Andrea Di Sorbo,Francesco Mercaldo,Corrado A. Visaggio,Gerardo Canfora,Sebastiano Panichella
DOI: https://doi.org/10.1145/3121264.3121266
2017-09-05
Abstract:Nowadays, Android represents the most popular mobile platform with a market share of around 80%. Previous research showed that data contained in user reviews and code change history of mobile apps represent a rich source of information for reducing software maintenance and development effort, increasing customers' satisfaction. Stemming from this observation, we present in this paper a large dataset of Android applications belonging to 23 different apps categories, which provides an overview of the types of feedback users report on the apps and documents the evolution of the related code metrics. The dataset contains about 395 applications of the F-Droid repository, including around 600 versions, 280,000 user reviews and more than 450,000 user feedback (extracted with specific text mining approaches). Furthermore, for each app version in our dataset, we employed the Paprika tool and developed several Python scripts to detect 8 different code smells and compute 22 code quality indicators. The paper discusses the potential usefulness of the dataset for future research in the field.
What problem does this paper attempt to address?