Wave: Trigger Based Synchronous Data Process System.

Kun Lu,Mingming Sun,Changlong Li,Hang Zhuang,Jinhong Zhou,Xuehai Zhou
DOI: https://doi.org/10.1109/ccgrid.2014.124
2014-01-01
Abstract:With the rapid development of cloud computing, more and more applications need to process large amount of data on clusters. Different types of data processing frameworks in cloud have appeared, such as MapReduce, Spark and Percolator. These frameworks are used to a certain type of data processing. In this paper, we provide processing framework called Wave, which is designed for bulk data processing, incremental computing and iterative processing with a uniform application interface. Wave is an event driven data process model for semi-structured data of distributed systems. Programmers use events and trigger reactions to process the data. Wave provides simplified API for users to implements parallel programs on cluster. Programs running in Wave are automatically parallelized and executed on cluster synchronously. Wave uses an implicit mechanism to synchronize the parallel programs execution without any user specification.
What problem does this paper attempt to address?