Research of Web semantic concept tree construction based on thesaurus and FCA

Ya-lin SUN,Lin-lin ZHAO,Xiao-ping YANG
DOI: https://doi.org/10.3969/j.issn.1001-3695.2014.11.025
2014-01-01
Abstract:In order to guide users to use well and improving websites’quality and construcing the Web semantic model,this pa-per presented a new approach and framework of learning from Web pages,and used formal concept analysis (FCA)to build the semantic concept tree.Firstly,it used information extraction and natural language processing tools to extract and segment texts, and then identified feature words by statistical methods.Secondly,it transformed feature words into thesaurus terms by using search-engine-based similarity calculation.Thirdly,it formed a formal context,and reduced the context by using rules,clustering and other techniques.Finally,it constructed concept lattice by using some algorithm to get hierarchy,which then transformed into the concept tree.Experimental results show that the concept tree can be used as the basis of Web ontology model,and have a pro-found signification for semantic assessment.The proposed algorithm has a certain value and referenced significance.
What problem does this paper attempt to address?