ERE: Entity Relationship Extraction System Based on Semi-structured Web Pages

Dong YU,Nuo LI,Derong SHEN,Nan TANG,Hongbin XU,Yue KOU,Ge YU
DOI: https://doi.org/10.3969/j.issn1672-9722.2014.09.010
2014-01-01
Abstract:In traditional methods,researchers use extraction pattern to extract entity relationships in text fragments that have complete semantic information.And they use heuristic algorithms or probabilistic models to choose the extracted candidate relationships.As for the semi-structured web pages,these methods become less applicable because the information of the entities is shown in some html modules where the semantic information is not complete.In this paper,an entity relationship extraction system that can solve the problem perfectly is propsoed.The system is composed of four functional modules:data extraction rule learning module,data extraction module,entity relationship compute module and entity relationship base query module.Firstly,users give a key word and choose a matching type.And the system will query the entity information base and find some entities that meet the conditions.Then the system will query the entity relationship base with the entities founded previously.Finally,the relationships that contain the entities will be returned to users.
What problem does this paper attempt to address?