A Practical Algorithm for Converting Unstructured Hypertext to Structured Database

ZHENG Qing-Hua,YOU Yuan-xia,YUAN Wen-bin
DOI: https://doi.org/10.13328/j.cnki.jos.2001.02.002
2001-01-01
Journal of Software
Abstract:Hypertext is a kind of unstructured document. It is impossible to realize the search based on content and topic for hypertext documents. However, hypertext is one of the most important ways of information storage and organization in the Internet. Therefore, in order to realize the effective management and the search of hypertext documents, a new and practical method named HtoDB for converting unstructured hypertext to database is presented. In the paper, the requirements and functions for converting hypertext to database are analyzed, the converting model and algorithm are also put forward according to the graph theory. The algorithm and model presented in this paper are verified in the project of “LU XUN digital library system”.
What problem does this paper attempt to address?