Unique Topic Query Processing On Cloud

Lvhong Liu,Zhihui Yang,Zhenying He,Yinan Jing,Xiaoyang Sean Wang
DOI: https://doi.org/10.1109/CSCloud/EdgeCom.2018.00026
2018-01-01
Abstract:With the rapid growth of text data, text analysis plays an increasingly important role for various applications. Tranditional database management systems provide basic data operations for various users. But most of them only provide data access and basic calculation for relational datasets, we can not perform text analysis operations on them directly. To support users analyzing unstructured text, this paper designs a set of SQL-like information extraction and unique topic query language for users to extract information and find out unique topics on unstructured data. This paper proposes a unique topic query processing system on cloud based on relational information extraction with Spark as cloud infrastructure.
What problem does this paper attempt to address?