HyF2FS: A Filesystem to Fully Exploit the Parallelism of Hybrid Storage

Jintong Zhang,Jianxi Chen,Kezheng Liu,Yongkang Zhuo,Panfei Yuan
DOI: https://doi.org/10.1109/ICCD58817.2023.00049
2023-01-01
Abstract:Hybrid storage systems can fully leverage the advantages of multiple devices to achieve better performance. However, current systems are designed primarily for a slow disk with an expensive fast device at high costs. They ignore device features and workload status while placing data. The issue of cache pollution is affecting their data hotness identification. Besides, inconsistent load status in multiple devices is overlooked during migration. These shortcomings constrain the overall performance of the system. To solve this, we propose HyF2FS, a hybrid storage filesystem based on F2FS. HyF2FS features a cache-tiering integrated architecture that stores hot data and metadata in an accelerator while asynchronously migrating cold data to the SSD, which provides cost-effective opportunities to optimize device parallelism. HyF2FS uses multidimensional scores to place data on the appropriate device to achieve high bandwidth. To improve data hotness identification, HyF2FS proposes two-level counters. Besides, a migration window is employed to minimize the impact of migration on foreground I/O. By implementing these scheduling algorithms, HyF2FS can fully exploit the parallelism of both fast and slow devices. Experimental results demonstrate significant improvements in throughput (116%-244%) and latency reduction (49%-64%) compared to F2FS and other hybrid storage systems.
What problem does this paper attempt to address?