CARE: Cloudified Android with Optimized Rendering Platform
Yuxin Xiang,Dongjie Tang,Rui Huang,Yong Yao,Chao Xie,Qiming Shi,Randy Xu,Mohammad Reza Haghighat,Cathy Bao,Yicheng Gu,Zhengwei Qi,Haibing Guan
DOI: https://doi.org/10.1109/tmm.2023.3274303
IF: 7.3
2024-01-01
IEEE Transactions on Multimedia
Abstract:Due to the excellent rendering capabilities, GPUs are mainstream accelerators in the Cloud-rendering industry. However, current Cloud-rendering systems suffer from a CPU-GPU workload imbalance that not only degrades application performance but also causes a significant waste of GPU resources. Recent proposals (such as API-forwarding and c-GPU) for improving CPU-GPU balance are promising but fail to solve system-resource redundancy issues (i.e., each instance tends to occupy all resources, exceeding its requirements). Such behavior will increase CPU load and lower effective GPU utilization. To demonstrate the severity of the issue, we evaluated real-world applications and results show that in most cases, nearly 50% of resources are useless. To solve this problem, we present CARE, the first framework intended to reduce the system-level redundancy by cloudifying the system from monolithic to Cloud-native. To allow users to configure required services, CARE puts forward a functional unit called Configurable Android (CA) . To allow multiple instances to share certain types of resources, CARE innovates Sharing Resource (SR) . To reduce the unused services, CARE introduces Pruning Resources (PR) . To further alleviate the CPU pressure and achieve CPU-GPU balance, we propose rShare, a system aiming at enhancing CPU effective utilization and increasing Android instance density of the Cloud-rendering platform. Based on Kubernetes, rShare divides all the CPUs into non-overlapping shared CPU pools, allocates instances to pools within milliseconds, and dynamically migrates them by tracking their QoS status. So far, CARE primarily focuses on Android systems and can handle 60 heavyweight instances (e.g., KOG (King of Glory)) on Intel SG1. rShare can apply instance allocation within milliseconds and increase the platform density by 39.4%.