Abstract:As user-generated data diversifies, an increasing number of big data tasks now encompass unstructured data forms, including audio, imagery, and video content. The processing of these data often requires the support of GPU devices in the cluster. However, most of the existing cluster resource management frameworks lack effective decoupling of CPU resources and GPU resources, and the management granularity of GPU resources is too coarse, resulting in poor GPU sharing and low resource utilization. Given, for the inadequacy of the existing cluster resource management framework that cannot support batch stream integration and CPU-GPU resource scheduling, we design and implement a Hybrid Heterogeneous Resource Management (H-HRM) for CPU-GPU heterogeneous clusters. The resource queue binding mechanism on computing nodes provides flexible binding of CPU resources to GPU resources, which solves the problem that CPU resources and GPU resources in the cluster are difficult to decouple. According to the different uses of CPU or GPU as the main resource, a Hybrid Domain Resource Fairness (HDRF) model is proposed to realize the reasonable allocation of CPU resources and GPU resources. Through the queue stacking technology and the mechanism that multiple executors can run simultaneously on the queue, fine-grained sharing of GPU resources is realized and the utilization rate of GPU resources is improved. Finally, this paper realizes the docking of H-HRM and Spark programming framework and conducts real load tests, and after comparing with the performance of Mesos, it is proved that H-HRM can handle the mixed scenarios of batch jobs and stream processing jobs in the cluster, and the HDRF algorithm also greatly improves the utilization of the GPU.

Design and realization of hybrid resource management system for heterogeneous cluster

Online Scheduling of Mixed CPU-GPU Jobs

Task Scheduling for GPU Heterogeneous Cluster.

An Energy Efficient Task Scheduling Scheme for Heterogeneous GPU-enhanced Clusters

A user mode CPU–GPU scheduling framework for hybrid workloads

Task Scheduling Greedy Heuristics for GPU Heterogeneous Cluster Involving the Weights of the Processor

A Virtual Multi-Channel GPU Fair Scheduling Method for Virtual Machines.

Hybrid Map Task Scheduling for GPU-Based Heterogeneous Clusters

AN PARALLEL AND DISTRIBUTED PROGRAMMING SOLUTION BASED ON HETEROGENEOUS GPU CLUSTER

Energy Efficient Real-Time Task Scheduling on CPU-GPU Hybrid Clusters

Programming Framework for Node Heterogeneous GPU Cluster

Multiple Resources Scheduling for Diverse Workloads in Heterogeneous Datacenter

A parallel computing method for irregular work

Orchestrated Co-scheduling, Resource Partitioning, and Power Capping on CPU-GPU Heterogeneous Systems via Machine Learning

A MapReduce Computing Framework Based on GPU Cluster.

StreamMAP: Automatic Task Assignment System on GPU Cluster

Scheduling Distributed Deep Learning Jobs in Heterogeneous Cluster with Placement Awareness

HPH: Hybrid Parallelism on Heterogeneous Clusters for Accelerating Large-scale DNNs Training.

A Scalable Hybrid Algorithm for Solving Partial Differential Equations on a Cluster of CPU/GPU

Energy-Aware Non-Preemptive Task Scheduling with Deadline Constraint in DVFS-Enabled Heterogeneous Clusters

Automatic Task Assignment System of General Computing Oriented GPU Cluster