Towards Higher Efficiency in a Distributed Memory Storage System Using Data Compression

Xiaoyang Yu,Songfeng Lu,Tongyang Wang,Xinfang Zhang,Shaohua Wan
DOI: https://doi.org/10.1504/ijbic.2022.128090
2022-01-01
International Journal of Bio-Inspired Computation
Abstract:As the amount of data grows, achieving an appropriate trade-off among computation, storage and network transportation will be beneficial for a distributed memory storage system, leading to higher overall efficiency. To this end, we explore a method to achieve this trade-off by introducing data compression technology in a transparent manner. Instead of focusing on specific compressed data structures, we target block level compression for a general-purpose storage system to incorporate a wide range of existing data analysis frameworks and usage scenarios, especially with big data. A prototype is implemented and evaluated based on the memory-centric distributed storage system Alluxio to provide transparent compression and decompression during write/read operations. The extensive experiments for data with different types of compression ratio are conducted and the experimental results prove that our approach can achieve huge write/read throughput.
What problem does this paper attempt to address?