A Method of Node Importance Measurement in Software Network Based on the Variations of h-Index

Yi DING,Bing LI,Can CHENG,Yuqi ZHAO
DOI: https://doi.org/10.15961/j.jsuese.201601182
2017-01-01
Abstract:When new members were involved in the development and maintenance of software projects,they usually need to spendmuch time to understand the architecture and function of the system.To help them understanding a software systemand quickly grasp the system,somekey classes were in general given prority to be recommended as soon as possible.A large number of studies have shown that the software system has aclear form of complex network topology.Therefore,we could build software network models,and then identified important classes in software systems by means of network node importance measurement,so as to help new members to master the core structure and function of the system quickly.Previously,there were many methods for measuring the importance of node in a network.But most methods considered only the degree of neighbor node or the weight of edge.As a metric successfully applied to evaluate the productivity of a scholar,little was known about whether hindex was suitable to identify key classes in weighted software network.In this paper,based on the degree of node and the weight of edge,three variations of h-index (i.e.,H-NWD,A-NWD,G-NWD) were proposed to measure the importance of the classes on three open-source software projects (i.e.,Jung,Ant,and Maven) built by corresponding model of weighted software network.The feasibility of proposed measures was validated by comparing them with the five existing centrality measures of complex network(i.e.,degree centrality,betweenness centrality,closeness centrality,eigenvector centrality and pageRank centrality).The results showed that the proposed index of H-NWD and G-NWD was effective in identifying the key classes,and the intersection reached more than 80% with the existing metrics.In the case of determining class modifications,the rank value of important class nodes identified by H-NWD was much higher and the other class nodes identified by H-NWD were modified more frequently.Compared to existing indicators,it was more accurate in identifying key classes.
What problem does this paper attempt to address?