A Comparative Survey of Big Data Computing and HPC: From a Parallel Programming Model to a Cluster Architecture

Fei Yin,Feng Shi
DOI: https://doi.org/10.1007/s10766-021-00717-y
2021-05-26
International Journal of Parallel Programming
Abstract:With the rapid growth of artificial intelligence (AI), the Internet of Things (IoT) and big data, emerging applications that cross stacks with different techniques bring new challenges to parallel computing systems. These cross-stack functionalities require one system to possess multiple characteristics, such as the ability to process data under high throughput and low latency, the ability to carry out iterative and incremental computation, transparent fault tolerance, and the ability to perform heterogeneous tasks that evolve dynamically. However, high-performance computing (HPC) and big data computing, as two categories of parallel computing architecture, are incapable of meeting all these requirements. Therefore, by performing a comparative analysis of HPC and big data computing from the perspective of the parallel programming model layer, middleware layer, and infrastructure layer, we explore the design principles of the two architectures and discuss a converged architecture to address the abovementioned challenges.
computer science, theory & methods
What problem does this paper attempt to address?