Research and Implementation of the Muti-Task-Parallel Scheduling Loading Technology of Massive Text Data

Guangqiang Chen,Shuqiang Yang,Xiaohui Zhang,Runheng Li,Yan Jia
2009-01-01
Journal of Computer Research and Development
Abstract:The scale of text data increases more and more rapidly and it poses enormous challenge to traditional database technology,such as data storage,and data loading,etc.MDMP(massive data management platform)is developed to store and manage the huge volume information.The loading process of the huge text data is divided into some smaller independent sub-tasks by its attributes,so it can be scheduled parallely.The multi-task-parallel scheduling loading technique is used in the platform,thus it provides high data loading rates.The multi-task-parallel scheduling loading techniques in MDMP is mainly studied in this paper.
What problem does this paper attempt to address?