Building and Exploring Marine Oriented Knowledge Graph for ZhouShan Library.
Tong Ruan,Haofen Wang,Fanghuai Hu,Jun Ding,Kai Lu
2014-01-01
Abstract:Paradigm Shift of Library Industry in China As more and more readers are in favor of accessing digital resources online, most libraries in China are in their way to build or strengthen their digital libraries. Nowadays, there exist several major content providers like WeiPu4, WanFang5, and ChaoXing6 who not only own a large number of digital contents of journals, books, andmagazines, but also run their integrated platforms for search and navigation. Most libraries only act as a consumer or a distributor in the digital content supply chain, which makes them suffer from serious homogenization, lack of content control, and weak competitiveness. The above issues enforce libraries to search for new opportunities. On the other hand, in early 2013, China Ministry of Culture has issued guidelines to build various resource repositories specified for different sectors. It advocated different regions to develop thematic repositories according to the economic and cultural characteristics of the region. ZhouShan Library takes this chance and becomes a pioneer to make the transition. ZhouShan Islands are listed as the first “state-level new district” around marine economy.With the support of local government, ZhouShan Library starts a project named “Universal Knowledge Repository for Marine Digital Library”. The intension is to help inhabitants and travelers know ZhouShan and marine economy, and to support different bureaus of ZhouShan government, such as Fishery Agency or Economic and Information Commission to do queries and statistics about local marine economy. In this way, ZhouShan Library is changing from a content distributor to a content provider of the marine domain. This change also happens to other regional libraries, which leads to a trend of paradigm shift in China’s library industry. TheRole of (Vertical)KnowledgeGraphRegarding the ZhouShan Library project, a marine repository should include fishes, fishing grounds, fish processing methods, related researchers and local enterprises. No single source can cover all aspects of data in the repository. It is also impossible for users to manually integrate knowledge from various sources. In some cases, concepts or facts need to be extracted from semi-structured data (e.g., lists or tables from Web pages) and unstructured data (e.g., documents). In other cases, data from internal database or from LOD are to be extracted, transformed, and loaded to the repository in a unified representation. Moreover, research institutes,