COSPA: Identifying Key Classes in Object-Oriented Software Using Preference Aggregation
Xin Du,Tian Wang,Weifeng Pan,Muchou Wang,Bo Jiang,Yiming Xiang,Chunlai Chai,Jiale Wang,Chengxiang Yuan
DOI: https://doi.org/10.1109/access.2021.3105475
IF: 3.9
2021-01-01
IEEE Access
Abstract:Software comprehension helps us to maintain software better. The developers often use documentation and mutual communication to understand software. However, the incompletion of documentation usually causes some problems such that we cannot acquire the detailed information of software. So researchers are now using key classes as starting points for software understanding and propose several effective methods for key class identification. Such methods are mostly implemented using supervised and unsupervised learning methods; however, they may be subject to class imbalance, insufficient amount of data, and lack of accuracy. In this paper, we propose a novel approach, called COSPA (identifying key C lasses in O bject-Oriented S oftware using P reference A ggregation), to identify key classes in software. Our approach is inspired by the social choice theory and uses Kemeny-Young method to find a preference that minimizes the total difference between different pairs of sequences returned by three key class identification methods (i.e., a-index, InDeg, and OSE). Further, we rank the classes in descending order according to that preference set, and a small number of top-ranked classes are considered as key candidates. To better evaluate the performance of our approach, we apply it to six Java software systems and compare it with nine state-of-the-art approaches. In the experiments, we use Friedman to evaluate all methods in the whole set of research subjects; results show that, though no single method works always best in all subject systems, our method indeed performs better than others.