Semantic HMC: Ontology-Described Hierarchy Maintenance in Big Data Context

Rafael Peixoto,Christophe Cruz,Nuno Silva
DOI: https://doi.org/10.1007/978-3-319-26138-6_53
2015-01-01
Abstract:One of the biggest challenges in Big Data is the exploitation of Value from large volumes of data that are constantly changing. To exploit value, one must focus on extracting knowledge from these Big Data sources. To extract knowledge and value from unstructured text we propose using a Hierarchical Multi-Label Classification process called Semantic HMC that uses ontologies to describe the predictive model including the label hierarchy and the classification rules. To not overload the user, this process automatically learns the ontology-described label hierarchy from a very large set of text documents. This paper aims to present a maintenance process of the ontology-described label hierarchy relations with regards to a stream of unstructured text documents in the context of Big Data that incrementally updates the label hierarchy.
What problem does this paper attempt to address?