Taming Server Memory TCO with Multiple Software-Defined Compressed Tiers

Sandeep Kumar,Aravinda Prasad,Sreenivas Subramoney
2024-04-22
Abstract:Memory accounts for 33 - 50% of the total cost of ownership (TCO) in modern data centers. We propose a novel solution to tame memory TCO through the novel creation and judicious management of multiple software-defined compressed memory tiers.
Operating Systems,Emerging Technologies
What problem does this paper attempt to address?
The paper aims to address the issue of high Total Cost of Ownership (TCO) of memory in modern data centers, especially as the data demands of AI and machine learning applications continue to grow. The working set size has reached the TB level, causing memory costs to account for 33%-50% of the total data center costs. To reduce memory TCO, current software solutions typically use a 2-Tier system, which employs a layer of compressed memory outside of DRAM, such as zswap in Linux. However, this approach has limitations: 1. **Limited TCO Savings**: Only placing infrequently accessed cold data (about 20%-30% of the data) into the compressed layer results in limited TCO savings. 2. **Performance Loss**: Aggressively placing more data into the compressed layer to increase TCO savings can lead to unacceptable performance loss. 3. **Underutilization of Warm Data**: About 50%-60% of the data is warm data, which is not fully utilized for compression to further save TCO. To address these issues, the paper proposes a new scheme called TierScape, which optimizes memory TCO by creating and managing multiple software-defined compressed memory tiers. Compared to the traditional 2-Tier system, TierScape has the following features: - **Multi-Tier Compressed Memory**: Implements multi-tier compressed memory using different compression algorithms, memory allocators, and storage media, resulting in varying access latencies, data compressibility, and per-unit memory usage costs. - **Dynamic Data Migration**: Dynamically adjusts the distribution of data across different compressed tiers based on application data access patterns to balance TCO savings and application performance. - **Advanced Analytical Model**: Proposes an advanced analytical model to recommend the optimal distribution of pages across multi-tier compressed memory based on page access characteristics, achieving the best balance between performance and TCO. The main contributions of the paper include: - Proposing the possibility of TCO savings for warm data and demonstrating that this goal can be achieved even with acceptable performance impact. - Highlighting the limitations of existing 2-Tier systems. - Showcasing the benefits of multi-tier compressed memory. - Proposing a reasonable method for managing page placement across different tiers, including a waterfall model and an analytical model. Overall, TierScape aims to maximize memory TCO savings while maintaining good performance.