Pilot study on analyzing semantic web data

Jun Ye,Yuzhong Qu
2008-01-01
Abstract:In order to explore the development status of the semantic web, 9.859 636 × 106 semantic web documents (SWD) are crawled and their distributions over websites, sizes, and their use of namespaces are analyzed by using the complex network analysis method. The results show that the distribution of SWDs over websites meets a power-law with an exponent of 0.5304. The size distribution of SWDs also meets a power-law with an exponent of 1.4071. Besides, SWDs are of unbalanced distribution over countries. Compared with two years ago, the number of SWDs has increased greatly. The power-law exponent of website distribution decreases from 0.6515 to 0.5304, whereas the power-law exponent of size distribution increases from 1.1833 to 1.4071. And the use of namespaces also changes notably.
What problem does this paper attempt to address?