Typical Big Data Computing Frameworks

ZHAOSheng,JIANGJinlei
DOI: https://doi.org/10.3969/j.issn.1009-6868.2016.02.004
2016-01-01
Abstract:Big data computing technologies have two typical processing modes:batch computing and stream computing. Batch computing is mainly used for high-throughput processing of static data and does not produce results in real time. Stream computing is used for processing dynamic data online in real time but has difficulty providing a ful view of data. In this paper, we analyze some typical big data computing frameworks from the perspective of scalability, fault-tolerance, task scheduling, resource utilization, real time guarantee, and input/output (IO) overhead. We then points out some future trends and hot research topics.
What problem does this paper attempt to address?