Web accessibility sampling method based on node topology characteristics

Fei GAO,Rong-hua CHEN,Jia-jun BU,Zhi YU,Ying-han WANG,Tian TIAN
DOI: https://doi.org/10.3785/j.issn.1008-973X.2017.10.002
2017-01-01
Abstract:As the existing sampling methods for web accessibility evaluation could not provide the samples which could give good representation of the entire website,the sampling methods could not reflect the distribution characteristics of the website sample data,which lead to some problems that make big sampling errors.A novel interval sampling algorithm based on the node's topological characteristics was proposed starting with the topological structure between web nodes in order to solve the problem.Each page was treated as a node and the similarity topological graph between web pages was constructed by the KNN-Graph algorithm.Then the importance of each node was obtained by its local and global topological characteristics and was sorted to get an orderly sequence of all the pages.The pages with interval sampling algorithm were chosen based on the sorting results.The method can achieve distributed sampling in different topological regions.The experimental data on real disabled person federation website shows that the method can achieve better results by obtaining smaller mean errors and more extensive distribution of the samples than other algorithms.
What problem does this paper attempt to address?