Optimizing Hash-based Distributed Storage Using Client Choices

Peilun Li,Wei Xu
DOI: https://doi.org/10.1145/2967360.2967365
2016-01-01
Abstract:Many distributed storage systems use hash-based methods for block placement. While hashing improves scalability, it lacks the flexibility that modern applications need for performance optimization. We propose CHOICE, a design allowing clients to have multiple choices for block placement. It also provides the client with relevant server performance metrics so the clients can implement their own choice policy for performance optimization such as choosing better locality or less busy servers. CHOICE requires minimal changes to the storage server and thus easy to deploy. We have implemented it in Ceph, a popular open-source distributed storage system. On two real Ceph clusters with 45 and 176 disks respectively, we show that we can greatly improve performance using the right placement policy.
What problem does this paper attempt to address?